This is an archive of the discontinued LLVM Phabricator instance.

[X86FixupLEAs] Turn optIncDec into a generic two address LEA optimizer. Support LEA64_32r properly.
ClosedPublic

Authored by craig.topper on May 2 2019, 4:24 PM.

Download Raw Diff

Details

Reviewers

spatel
RKSimon
andreadb

Commits

rG46e5052b8e2d: [X86FixupLEAs] Turn optIncDec into a generic two address LEA optimizer. Support…
rL361691: [X86FixupLEAs] Turn optIncDec into a generic two address LEA optimizer. Support…

Summary

INC/DEC is really a special case of a more generic issue. We should also turn leas into add reg/reg or add reg/imm regardless of the slow lea flags.

This also supports LEA64_32 which has 64 bit input registers and 32 bit output registers. So we need to convert the 64 bit inputs to their 32 bit equivalents to check if they are equal to base reg.

One thing to note, the original code preserved the kill flags by adding operands to the new instruction instead of using addReg. But I think tied operands aren't supposed to have the kill flag set. I dropped the kill flags, but I could probably try to preserve it in the add reg/reg case if we think its important. Not sure which operand its supposed to go on for the LEA64_32r instruction due to the super reg implicit uses. Though I'm also not sure those are needed since they were probably just created by an INSERT_SUBREG from a 32-bit input.

Diff Detail

Repository: rL LLVM

Event Timeline

craig.topper created this revision.May 2 2019, 4:24 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 2 2019, 4:24 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

RKSimon added a reviewer: andreadb.May 3 2019, 1:33 AM

RKSimon mentioned this in rL359886: [X86] Add X64 common prefixes and regenerate mul i64 tests .May 3 2019, 7:05 AM

RKSimon mentioned this in rG4d4f779fa2e5: [X86] Add X64 common prefixes and regenerate mul i64 tests.

Overall, the patch looks good to me.

I also don't think that those tied operands should have had the kill flag set. But then, I don't know how important is to accurately preserve kill flags in FixupLEAs; my understanding is that FixupLEAs runs quite late in the codegen pipeline (since it is a pre-emit pass). So, I don't know if passes running after it still try to check for the presence of those flags.

Rebase

Herald added a project: Restricted Project. · View Herald TranscriptMay 5 2019, 10:08 PM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B31429: Diff 198220.May 5 2019, 10:09 PM

Where did all the clang diffs come from?

Not sure. I'll fix it.

Get rid of clang stuff.

Ping

I haven't looked closely at the series of transforms that gets us here, so let me ask: would it be more efficient to produce the add/inc/dec machine instructions directly rather than LEA? Or do we do this because the 3-address opportunity helps register allocation, so this late reversal is a special-case of the more generally useful transforms that produce LEA in the 1st place?

llvm/lib/Target/X86/X86FixupLEAs.cpp
11 ↗	(On Diff #198303)	Should update this comment to better match the logic: "It replaces LEAs with ADD/INC/DEC when that is better for size/speed." ?

In D61472#1515994, @spatel wrote:

I haven't looked closely at the series of transforms that gets us here, so let me ask: would it be more efficient to produce the add/inc/dec machine instructions directly rather than LEA? Or do we do this because the 3-address opportunity helps register allocation, so this late reversal is a special-case of the more generally useful transforms that produce LEA in the 1st place?

I believe a lot of these are caused TwoAddressInstructionPass converting to three address sometimes when it maybe shouldn't. I think there's some overlap with the test diffs in D52109. But I don't think the change I made in that diff is correct. There appear to be some test changes here that aren't affected by that patch either. So maybe there are other issues in TwoAddressInstruction heuristics.

In D61472#1516642, @craig.topper wrote:

In D61472#1515994, @spatel wrote:

I haven't looked closely at the series of transforms that gets us here, so let me ask: would it be more efficient to produce the add/inc/dec machine instructions directly rather than LEA? Or do we do this because the 3-address opportunity helps register allocation, so this late reversal is a special-case of the more generally useful transforms that produce LEA in the 1st place?

I believe a lot of these are caused TwoAddressInstructionPass converting to three address sometimes when it maybe shouldn't. I think there's some overlap with the test diffs in D52109. But I don't think the change I made in that diff is correct. There appear to be some test changes here that aren't affected by that patch either. So maybe there are other issues in TwoAddressInstruction heuristics.

Ok - these diffs look alright, so LGTM. Maybe we can adjust the earlier transforms to avoid some back-and-forth though as follow-up work.

This revision is now accepted and ready to land.May 24 2019, 1:46 PM

Closed by commit rL361691: [X86FixupLEAs] Turn optIncDec into a generic two address LEA optimizer. Support… (authored by ctopper). · Explain WhyMay 24 2019, 11:15 PM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: qcolombet. · View Herald TranscriptMay 24 2019, 11:15 PM

sidorovd mentioned this in rGea4b5f93cfbc: [X86] Add X64 common prefixes and regenerate mul i64 tests.May 30 2019, 8:18 AM

sidorovd mentioned this in rGd8e44eab6b87: [X86] Add X64 common prefixes and regenerate mul i64 tests.May 30 2019, 9:30 AM

Revision Contents

Path

Size

llvm/

trunk/

lib/

Target/

X86/

X86FixupLEAs.cpp

154 lines

test/

CodeGen/

X86/

GlobalISel/

12 lines

2 lines

14 lines

2 lines

MergeConsecutiveStores.ll

4 lines

atomic-unordered.ll

2 lines

avx512vl-intrinsics-upgrade.ll

12 lines

bitreverse.ll

4 lines

bswap_tree2.ll

2 lines

bypass-slow-division-32.ll

2 lines

2 lines

4 lines

2 lines

4 lines

10 lines

2 lines

4 lines

2 lines

4 lines

26 lines

2 lines

6 lines

ragreedy-hoist-spill.ll

2 lines

2 lines

4 lines

6 lines

2 lines

4 lines

win_coreclr_chkstk.ll

2 lines

Transforms/

LoopStrengthReduce/

X86/

ivchain-X86.ll

8 lines

Diff 201392

llvm/trunk/lib/Target/X86/X86FixupLEAs.cpp

//===-- X86FixupLEAs.cpp - use or replace LEA instructions -----------===//		//===-- X86FixupLEAs.cpp - use or replace LEA instructions -----------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file defines the pass that finds instructions that can be		// This file defines the pass that finds instructions that can be
// re-written as LEA instructions in order to reduce pipeline delays.		// re-written as LEA instructions in order to reduce pipeline delays.
// When optimizing for size it replaces suitable LEAs with INC or DEC.		// It replaces LEAs with ADD/INC/DEC when that is better for size/speed.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "X86.h"		#include "X86.h"
#include "X86InstrInfo.h"		#include "X86InstrInfo.h"
#include "X86Subtarget.h"		#include "X86Subtarget.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/CodeGen/MachineFunctionPass.h"		#include "llvm/CodeGen/MachineFunctionPass.h"
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	class FixupLEAPass : public MachineFunctionPass {
/// - LEA that uses base and index registers where the base is EBP, RBP,		/// - LEA that uses base and index registers where the base is EBP, RBP,
/// or R13		/// or R13
/// - LEA that uses RIP relative addressing mode		/// - LEA that uses RIP relative addressing mode
/// - LEA that uses 16-bit addressing mode "		/// - LEA that uses 16-bit addressing mode "
/// This function currently handles the first 2 cases only.		/// This function currently handles the first 2 cases only.
MachineInstr *processInstrForSlow3OpLEA(MachineInstr &MI,		MachineInstr *processInstrForSlow3OpLEA(MachineInstr &MI,
MachineBasicBlock &MBB);		MachineBasicBlock &MBB);

/// Look for LEAs that add 1 to reg or subtract 1 from reg		/// Look for LEAs that are really two address LEAs that we might be able to
/// and convert them to INC or DEC respectively.		/// turn into regular ADD instructions.
bool fixupIncDec(MachineBasicBlock::iterator &I,		bool optTwoAddrLEA(MachineBasicBlock::iterator &I,
MachineBasicBlock &MBB) const;		MachineBasicBlock &MBB, bool OptIncDec,
		bool UseLEAForSP) const;

/// Determine if an instruction references a machine register		/// Determine if an instruction references a machine register
/// and, if so, whether it reads or writes the register.		/// and, if so, whether it reads or writes the register.
RegUsageState usesRegister(MachineOperand &p, MachineBasicBlock::iterator I);		RegUsageState usesRegister(MachineOperand &p, MachineBasicBlock::iterator I);

/// Step backwards through a basic block, looking		/// Step backwards through a basic block, looking
/// for an instruction which writes a register within		/// for an instruction which writes a register within
/// a maximum of INSTR_DISTANCE_THRESHOLD instruction latency cycles.		/// a maximum of INSTR_DISTANCE_THRESHOLD instruction latency cycles.
Show All 24 Lines	public:
// This pass runs after regalloc and doesn't support VReg operands.		// This pass runs after regalloc and doesn't support VReg operands.
MachineFunctionProperties getRequiredProperties() const override {		MachineFunctionProperties getRequiredProperties() const override {
return MachineFunctionProperties().set(		return MachineFunctionProperties().set(
MachineFunctionProperties::Property::NoVRegs);		MachineFunctionProperties::Property::NoVRegs);
}		}

private:		private:
TargetSchedModel TSM;		TargetSchedModel TSM;
const X86InstrInfo *TII; // Machine instruction info.		const X86InstrInfo *TII;
		const X86RegisterInfo *TRI;
};		};
}		}

char FixupLEAPass::ID = 0;		char FixupLEAPass::ID = 0;

INITIALIZE_PASS(FixupLEAPass, FIXUPLEA_NAME, FIXUPLEA_DESC, false, false)		INITIALIZE_PASS(FixupLEAPass, FIXUPLEA_NAME, FIXUPLEA_DESC, false, false)

MachineInstr *		MachineInstr *
▲ Show 20 Lines • Show All 66 Lines • ▼ Show 20 Lines	if (skipFunction(MF.getFunction()))
return false;		return false;

const X86Subtarget &ST = MF.getSubtarget<X86Subtarget>();		const X86Subtarget &ST = MF.getSubtarget<X86Subtarget>();
bool IsSlowLEA = ST.slowLEA();		bool IsSlowLEA = ST.slowLEA();
bool IsSlow3OpsLEA = ST.slow3OpsLEA();		bool IsSlow3OpsLEA = ST.slow3OpsLEA();
bool LEAUsesAG = ST.LEAusesAG();		bool LEAUsesAG = ST.LEAusesAG();

bool OptIncDec = !ST.slowIncDec() \|\| MF.getFunction().hasOptSize();		bool OptIncDec = !ST.slowIncDec() \|\| MF.getFunction().hasOptSize();
bool OptLEA = LEAUsesAG \|\| IsSlowLEA \|\| IsSlow3OpsLEA;		bool UseLEAForSP = ST.useLeaForSP();

if (!OptLEA && !OptIncDec)
return false;

TSM.init(&ST);		TSM.init(&ST);
TII = ST.getInstrInfo();		TII = ST.getInstrInfo();
		TRI = ST.getRegisterInfo();

LLVM_DEBUG(dbgs() << "Start X86FixupLEAs\n";);		LLVM_DEBUG(dbgs() << "Start X86FixupLEAs\n";);
for (MachineBasicBlock &MBB : MF) {		for (MachineBasicBlock &MBB : MF) {
// First pass. Try to remove or optimize existing LEAs.		// First pass. Try to remove or optimize existing LEAs.
for (MachineBasicBlock::iterator I = MBB.begin(); I != MBB.end(); ++I) {		for (MachineBasicBlock::iterator I = MBB.begin(); I != MBB.end(); ++I) {
if (!isLEA(I->getOpcode()))		if (!isLEA(I->getOpcode()))
continue;		continue;

if (OptIncDec && fixupIncDec(I, MBB))		if (optTwoAddrLEA(I, MBB, OptIncDec, UseLEAForSP))
continue;		continue;

if (IsSlowLEA) {		if (IsSlowLEA) {
processInstructionForSlowLEA(I, MBB);		processInstructionForSlowLEA(I, MBB);
} else if (IsSlow3OpsLEA) {		} else if (IsSlow3OpsLEA) {
if (auto NewMI = processInstrForSlow3OpLEA(I, MBB)) {		if (auto NewMI = processInstrForSlow3OpLEA(I, MBB)) {
MBB.erase(I);		MBB.erase(I);
I = NewMI;		I = NewMI;
▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	static inline bool hasLEAOffset(const MachineOperand &Offset) {
return (Offset.isImm() && Offset.getImm() != 0) \|\| Offset.isGlobal();		return (Offset.isImm() && Offset.getImm() != 0) \|\| Offset.isGlobal();
}		}

static inline unsigned getADDrrFromLEA(unsigned LEAOpcode) {		static inline unsigned getADDrrFromLEA(unsigned LEAOpcode) {
switch (LEAOpcode) {		switch (LEAOpcode) {
default:		default:
llvm_unreachable("Unexpected LEA instruction");		llvm_unreachable("Unexpected LEA instruction");
case X86::LEA32r:		case X86::LEA32r:
return X86::ADD32rr;
case X86::LEA64_32r:		case X86::LEA64_32r:
		return X86::ADD32rr;
case X86::LEA64r:		case X86::LEA64r:
return X86::ADD64rr;		return X86::ADD64rr;
}		}
}		}

static inline unsigned getADDriFromLEA(unsigned LEAOpcode,		static inline unsigned getADDriFromLEA(unsigned LEAOpcode,
const MachineOperand &Offset) {		const MachineOperand &Offset) {
bool IsInt8 = Offset.isImm() && isInt<8>(Offset.getImm());		bool IsInt8 = Offset.isImm() && isInt<8>(Offset.getImm());
switch (LEAOpcode) {		switch (LEAOpcode) {
default:		default:
llvm_unreachable("Unexpected LEA instruction");		llvm_unreachable("Unexpected LEA instruction");
case X86::LEA32r:		case X86::LEA32r:
case X86::LEA64_32r:		case X86::LEA64_32r:
return IsInt8 ? X86::ADD32ri8 : X86::ADD32ri;		return IsInt8 ? X86::ADD32ri8 : X86::ADD32ri;
case X86::LEA64r:		case X86::LEA64r:
return IsInt8 ? X86::ADD64ri8 : X86::ADD64ri32;		return IsInt8 ? X86::ADD64ri8 : X86::ADD64ri32;
}		}
}		}

/// isLEASimpleIncOrDec - Does this LEA have one these forms:		static inline unsigned getINCDECFromLEA(unsigned LEAOpcode, bool IsINC) {
/// lea %reg, 1(%reg)		switch (LEAOpcode) {
/// lea %reg, -1(%reg)
static inline bool isLEASimpleIncOrDec(MachineInstr &LEA) {
unsigned SrcReg = LEA.getOperand(1 + X86::AddrBaseReg).getReg();
unsigned DstReg = LEA.getOperand(0).getReg();
const MachineOperand &AddrDisp = LEA.getOperand(1 + X86::AddrDisp);
return SrcReg == DstReg &&
LEA.getOperand(1 + X86::AddrIndexReg).getReg() == 0 &&
LEA.getOperand(1 + X86::AddrSegmentReg).getReg() == 0 &&
AddrDisp.isImm() &&
(AddrDisp.getImm() == 1 \|\| AddrDisp.getImm() == -1);
}

bool FixupLEAPass::fixupIncDec(MachineBasicBlock::iterator &I,
MachineBasicBlock &MBB) const {
MachineInstr &MI = *I;

if (isLEASimpleIncOrDec(MI) && TII->isSafeToClobberEFLAGS(MBB, I)) {
unsigned NewOpcode;
bool isINC = MI.getOperand(1 + X86::AddrDisp).getImm() == 1;
switch (MI.getOpcode()) {
default:		default:
llvm_unreachable("Unexpected LEA instruction");		llvm_unreachable("Unexpected LEA instruction");
case X86::LEA32r:		case X86::LEA32r:
case X86::LEA64_32r:		case X86::LEA64_32r:
NewOpcode = isINC ? X86::INC32r : X86::DEC32r;		return IsINC ? X86::INC32r : X86::DEC32r;
break;
case X86::LEA64r:		case X86::LEA64r:
NewOpcode = isINC ? X86::INC64r : X86::DEC64r;		return IsINC ? X86::INC64r : X86::DEC64r;
break;		}
}		}

MachineInstr *NewMI =		bool FixupLEAPass::optTwoAddrLEA(MachineBasicBlock::iterator &I,
BuildMI(MBB, I, MI.getDebugLoc(), TII->get(NewOpcode))		MachineBasicBlock &MBB, bool OptIncDec,
.add(MI.getOperand(0))		bool UseLEAForSP) const {
.add(MI.getOperand(1 + X86::AddrBaseReg));		MachineInstr &MI = *I;
MBB.erase(I);
I = static_cast<MachineBasicBlock::iterator>(NewMI);		const MachineOperand &Base = MI.getOperand(1 + X86::AddrBaseReg);
return true;		const MachineOperand &Scale = MI.getOperand(1 + X86::AddrScaleAmt);
		const MachineOperand &Index = MI.getOperand(1 + X86::AddrIndexReg);
		const MachineOperand &Disp = MI.getOperand(1 + X86::AddrDisp);
		const MachineOperand &Segment = MI.getOperand(1 + X86::AddrSegmentReg);

		if (Segment.getReg() != 0 \|\| !Disp.isImm() \|\| Scale.getImm() > 1 \|\|
		!TII->isSafeToClobberEFLAGS(MBB, I))
		return false;

		unsigned DestReg = MI.getOperand(0).getReg();
		unsigned BaseReg = Base.getReg();
		unsigned IndexReg = Index.getReg();

		// Don't change stack adjustment LEAs.
		if (UseLEAForSP && (DestReg == X86::ESP \|\| DestReg == X86::RSP))
		return false;

		// LEA64_32 has 64-bit operands but 32-bit result.
		if (MI.getOpcode() == X86::LEA64_32r) {
		if (BaseReg != 0)
		BaseReg = TRI->getSubReg(BaseReg, X86::sub_32bit);
		if (IndexReg != 0)
		IndexReg = TRI->getSubReg(IndexReg, X86::sub_32bit);
}		}

		MachineInstr *NewMI = nullptr;

		// Look for lea(%reg1, %reg2), %reg1 or lea(%reg2, %reg1), %reg1
		// which can be turned into add %reg2, %reg1
		if (BaseReg != 0 && IndexReg != 0 && Disp.getImm() == 0 &&
		(DestReg == BaseReg \|\| DestReg == IndexReg)) {
		unsigned NewOpcode = getADDrrFromLEA(MI.getOpcode());
		if (DestReg != BaseReg)
		std::swap(BaseReg, IndexReg);

		if (MI.getOpcode() == X86::LEA64_32r) {
		// TODO: Do we need the super register implicit use?
		NewMI = BuildMI(MBB, I, MI.getDebugLoc(), TII->get(NewOpcode), DestReg)
		.addReg(BaseReg).addReg(IndexReg)
		.addReg(Base.getReg(), RegState::Implicit)
		.addReg(Index.getReg(), RegState::Implicit);
		} else {
		NewMI = BuildMI(MBB, I, MI.getDebugLoc(), TII->get(NewOpcode), DestReg)
		.addReg(BaseReg).addReg(IndexReg);
		}
		} else if (DestReg == BaseReg && IndexReg == 0) {
		// This is an LEA with only a base register and a displacement,
		// We can use ADDri or INC/DEC.

		// Does this LEA have one these forms:
		// lea %reg, 1(%reg)
		// lea %reg, -1(%reg)
		if (OptIncDec && (Disp.getImm() == 1 \|\| Disp.getImm() == -1)) {
		bool IsINC = Disp.getImm() == 1;
		unsigned NewOpcode = getINCDECFromLEA(MI.getOpcode(), IsINC);

		if (MI.getOpcode() == X86::LEA64_32r) {
		// TODO: Do we need the super register implicit use?
		NewMI = BuildMI(MBB, I, MI.getDebugLoc(), TII->get(NewOpcode), DestReg)
		.addReg(BaseReg).addReg(Base.getReg(), RegState::Implicit);
		} else {
		NewMI = BuildMI(MBB, I, MI.getDebugLoc(), TII->get(NewOpcode), DestReg)
		.addReg(BaseReg);
		}
		} else {
		unsigned NewOpcode = getADDriFromLEA(MI.getOpcode(), Disp);
		if (MI.getOpcode() == X86::LEA64_32r) {
		// TODO: Do we need the super register implicit use?
		NewMI = BuildMI(MBB, I, MI.getDebugLoc(), TII->get(NewOpcode), DestReg)
		.addReg(BaseReg).addImm(Disp.getImm())
		.addReg(Base.getReg(), RegState::Implicit);
		} else {
		NewMI = BuildMI(MBB, I, MI.getDebugLoc(), TII->get(NewOpcode), DestReg)
		.addReg(BaseReg).addImm(Disp.getImm());
		}
		}
		} else
return false;		return false;

		MBB.erase(I);
		I = NewMI;
		return true;
}		}

void FixupLEAPass::processInstruction(MachineBasicBlock::iterator &I,		void FixupLEAPass::processInstruction(MachineBasicBlock::iterator &I,
MachineBasicBlock &MBB) {		MachineBasicBlock &MBB) {
// Process a load, store, or LEA instruction.		// Process a load, store, or LEA instruction.
MachineInstr &MI = *I;		MachineInstr &MI = *I;
const MCInstrDesc &Desc = MI.getDesc();		const MCInstrDesc &Desc = MI.getDesc();
int AddrOffset = X86II::getMemoryOperandNo(Desc.TSFlags);		int AddrOffset = X86II::getMemoryOperandNo(Desc.TSFlags);
▲ Show 20 Lines • Show All 189 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/GlobalISel/add-ext.ll

	Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines

	; The typical use case: a 64-bit system where an 'int' is used as an index into an array.			; The typical use case: a 64-bit system where an 'int' is used as an index into an array.

	define i8* @gep8(i32 %i, i8* %x) {			define i8* @gep8(i32 %i, i8* %x) {
	; CHECK-LABEL: gep8:			; CHECK-LABEL: gep8:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: addl $5, %edi			; CHECK-NEXT: addl $5, %edi
	; CHECK-NEXT: movslq %edi, %rax			; CHECK-NEXT: movslq %edi, %rax
	; CHECK-NEXT: leaq (%rsi,%rax), %rax			; CHECK-NEXT: addq %rsi, %rax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq

	%add = add nsw i32 %i, 5			%add = add nsw i32 %i, 5
	%ext = sext i32 %add to i64			%ext = sext i32 %add to i64
	%idx = getelementptr i8, i8* %x, i64 %ext			%idx = getelementptr i8, i8* %x, i64 %ext
	ret i8* %idx			ret i8* %idx
	}			}

	▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines
	define void @PR20134(i32* %a, i32 %i) {			define void @PR20134(i32* %a, i32 %i) {
	; CHECK-LABEL: PR20134:			; CHECK-LABEL: PR20134:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: # kill: def $esi killed $esi def $rsi			; CHECK-NEXT: # kill: def $esi killed $esi def $rsi
	; CHECK-NEXT: leal 1(%rsi), %eax			; CHECK-NEXT: leal 1(%rsi), %eax
	; CHECK-NEXT: cltq			; CHECK-NEXT: cltq
	; CHECK-NEXT: movq $4, %rcx			; CHECK-NEXT: movq $4, %rcx
	; CHECK-NEXT: imulq %rcx, %rax			; CHECK-NEXT: imulq %rcx, %rax
	; CHECK-NEXT: leaq (%rdi,%rax), %rax			; CHECK-NEXT: addq %rdi, %rax
	; CHECK-NEXT: leal 2(%rsi), %edx			; CHECK-NEXT: leal 2(%rsi), %edx
	; CHECK-NEXT: movslq %edx, %rdx			; CHECK-NEXT: movslq %edx, %rdx
	; CHECK-NEXT: imulq %rcx, %rdx			; CHECK-NEXT: imulq %rcx, %rdx
	; CHECK-NEXT: leaq (%rdi,%rdx), %rdx			; CHECK-NEXT: addq %rdi, %rdx
	; CHECK-NEXT: movl (%rdx), %edx			; CHECK-NEXT: movl (%rdx), %edx
	; CHECK-NEXT: addl (%rax), %edx			; CHECK-NEXT: addl (%rax), %edx
	; CHECK-NEXT: movslq %esi, %rax			; CHECK-NEXT: movslq %esi, %rax
	; CHECK-NEXT: imulq %rcx, %rax			; CHECK-NEXT: imulq %rcx, %rax
	; CHECK-NEXT: leaq (%rdi,%rax), %rax			; CHECK-NEXT: addq %rdi, %rax
	; CHECK-NEXT: movl %edx, (%rax)			; CHECK-NEXT: movl %edx, (%rax)
	; CHECK-NEXT: retq			; CHECK-NEXT: retq

	%add1 = add nsw i32 %i, 1			%add1 = add nsw i32 %i, 1
	%idx1 = sext i32 %add1 to i64			%idx1 = sext i32 %add1 to i64
	%gep1 = getelementptr i32, i32* %a, i64 %idx1			%gep1 = getelementptr i32, i32* %a, i64 %idx1
	%load1 = load i32, i32* %gep1, align 4			%load1 = load i32, i32* %gep1, align 4

	Show All 12 Lines
	; The same as @PR20134 but sign extension is replaced with zero extension			; The same as @PR20134 but sign extension is replaced with zero extension
	define void @PR20134_zext(i32* %a, i32 %i) {			define void @PR20134_zext(i32* %a, i32 %i) {
	; CHECK-LABEL: PR20134_zext:			; CHECK-LABEL: PR20134_zext:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: # kill: def $esi killed $esi def $rsi			; CHECK-NEXT: # kill: def $esi killed $esi def $rsi
	; CHECK-NEXT: leal 1(%rsi), %eax			; CHECK-NEXT: leal 1(%rsi), %eax
	; CHECK-NEXT: movq $4, %rcx			; CHECK-NEXT: movq $4, %rcx
	; CHECK-NEXT: imulq %rcx, %rax			; CHECK-NEXT: imulq %rcx, %rax
	; CHECK-NEXT: leaq (%rdi,%rax), %rax			; CHECK-NEXT: addq %rdi, %rax
	; CHECK-NEXT: leal 2(%rsi), %edx			; CHECK-NEXT: leal 2(%rsi), %edx
	; CHECK-NEXT: imulq %rcx, %rdx			; CHECK-NEXT: imulq %rcx, %rdx
	; CHECK-NEXT: leaq (%rdi,%rdx), %rdx			; CHECK-NEXT: addq %rdi, %rdx
	; CHECK-NEXT: movl (%rdx), %edx			; CHECK-NEXT: movl (%rdx), %edx
	; CHECK-NEXT: addl (%rax), %edx			; CHECK-NEXT: addl (%rax), %edx
	; CHECK-NEXT: imulq %rcx, %rsi			; CHECK-NEXT: imulq %rcx, %rsi
	; CHECK-NEXT: leaq (%rdi,%rsi), %rax			; CHECK-NEXT: leaq (%rdi,%rsi), %rax
	; CHECK-NEXT: movl %edx, (%rax)			; CHECK-NEXT: movl %edx, (%rax)
	; CHECK-NEXT: retq			; CHECK-NEXT: retq

	%add1 = add nuw i32 %i, 1			%add1 = add nuw i32 %i, 1
	Show All 15 Lines

llvm/trunk/test/CodeGen/X86/GlobalISel/callingconv.ll

	Show First 20 Lines • Show All 403 Lines • ▼ Show 20 Lines
	; X32-NEXT: .cfi_def_cfa_offset 16			; X32-NEXT: .cfi_def_cfa_offset 16
	; X32-NEXT: movl {{[0-9]+}}(%esp), %eax			; X32-NEXT: movl {{[0-9]+}}(%esp), %eax
	; X32-NEXT: movl {{[0-9]+}}(%esp), %ecx			; X32-NEXT: movl {{[0-9]+}}(%esp), %ecx
	; X32-NEXT: movl (%eax), %eax			; X32-NEXT: movl (%eax), %eax
	; X32-NEXT: movl (%ecx), %edx			; X32-NEXT: movl (%ecx), %edx
	; X32-NEXT: movl 4(%ecx), %ecx			; X32-NEXT: movl 4(%ecx), %ecx
	; X32-NEXT: movl %eax, (%esp)			; X32-NEXT: movl %eax, (%esp)
	; X32-NEXT: movl $4, %eax			; X32-NEXT: movl $4, %eax
	; X32-NEXT: leal (%esp,%eax), %eax			; X32-NEXT: addl %esp, %eax
	; X32-NEXT: movl %edx, {{[0-9]+}}(%esp)			; X32-NEXT: movl %edx, {{[0-9]+}}(%esp)
	; X32-NEXT: movl %ecx, 4(%eax)			; X32-NEXT: movl %ecx, 4(%eax)
	; X32-NEXT: calll variadic_callee			; X32-NEXT: calll variadic_callee
	; X32-NEXT: addl $12, %esp			; X32-NEXT: addl $12, %esp
	; X32-NEXT: .cfi_def_cfa_offset 4			; X32-NEXT: .cfi_def_cfa_offset 4
	; X32-NEXT: retl			; X32-NEXT: retl
	;			;
	; X64-LABEL: test_variadic_call_2:			; X64-LABEL: test_variadic_call_2:
	Show All 17 Lines

llvm/trunk/test/CodeGen/X86/GlobalISel/gep.ll

; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc -mtriple=x86_64-linux-gnu -global-isel -verify-machineinstrs < %s -o - \| FileCheck %s --check-prefix=ALL --check-prefix=X64_GISEL		; RUN: llc -mtriple=x86_64-linux-gnu -global-isel -verify-machineinstrs < %s -o - \| FileCheck %s --check-prefix=ALL --check-prefix=X64_GISEL
; RUN: llc -mtriple=x86_64-linux-gnu -verify-machineinstrs < %s -o - \| FileCheck %s --check-prefix=ALL --check-prefix=X64		; RUN: llc -mtriple=x86_64-linux-gnu -verify-machineinstrs < %s -o - \| FileCheck %s --check-prefix=ALL --check-prefix=X64

define i32* @test_gep_i8(i32 *%arr, i8 %ind) {		define i32* @test_gep_i8(i32 *%arr, i8 %ind) {
; X64_GISEL-LABEL: test_gep_i8:		; X64_GISEL-LABEL: test_gep_i8:
; X64_GISEL: # %bb.0:		; X64_GISEL: # %bb.0:
; X64_GISEL-NEXT: # kill: def $esi killed $esi def $rsi		; X64_GISEL-NEXT: # kill: def $esi killed $esi def $rsi
; X64_GISEL-NEXT: movq $56, %rcx		; X64_GISEL-NEXT: movq $56, %rcx
; X64_GISEL-NEXT: shlq %cl, %rsi		; X64_GISEL-NEXT: shlq %cl, %rsi
; X64_GISEL-NEXT: movq $56, %rcx		; X64_GISEL-NEXT: movq $56, %rcx
; X64_GISEL-NEXT: sarq %cl, %rsi		; X64_GISEL-NEXT: sarq %cl, %rsi
; X64_GISEL-NEXT: movq $4, %rax		; X64_GISEL-NEXT: movq $4, %rax
; X64_GISEL-NEXT: imulq %rsi, %rax		; X64_GISEL-NEXT: imulq %rsi, %rax
; X64_GISEL-NEXT: leaq (%rdi,%rax), %rax		; X64_GISEL-NEXT: addq %rdi, %rax
; X64_GISEL-NEXT: retq		; X64_GISEL-NEXT: retq
;		;
; X64-LABEL: test_gep_i8:		; X64-LABEL: test_gep_i8:
; X64: # %bb.0:		; X64: # %bb.0:
; X64-NEXT: # kill: def $esi killed $esi def $rsi		; X64-NEXT: # kill: def $esi killed $esi def $rsi
; X64-NEXT: movsbq %sil, %rax		; X64-NEXT: movsbq %sil, %rax
; X64-NEXT: leaq (%rdi,%rax,4), %rax		; X64-NEXT: leaq (%rdi,%rax,4), %rax
; X64-NEXT: retq		; X64-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i8 %ind		%arrayidx = getelementptr i32, i32* %arr, i8 %ind
ret i32* %arrayidx		ret i32* %arrayidx
}		}

define i32* @test_gep_i8_const(i32 *%arr) {		define i32* @test_gep_i8_const(i32 *%arr) {
; X64_GISEL-LABEL: test_gep_i8_const:		; X64_GISEL-LABEL: test_gep_i8_const:
; X64_GISEL: # %bb.0:		; X64_GISEL: # %bb.0:
; X64_GISEL-NEXT: movq $80, %rax		; X64_GISEL-NEXT: movq $80, %rax
; X64_GISEL-NEXT: leaq (%rdi,%rax), %rax		; X64_GISEL-NEXT: addq %rdi, %rax
; X64_GISEL-NEXT: retq		; X64_GISEL-NEXT: retq
;		;
; X64-LABEL: test_gep_i8_const:		; X64-LABEL: test_gep_i8_const:
; X64: # %bb.0:		; X64: # %bb.0:
; X64-NEXT: leaq 80(%rdi), %rax		; X64-NEXT: leaq 80(%rdi), %rax
; X64-NEXT: retq		; X64-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i8 20		%arrayidx = getelementptr i32, i32* %arr, i8 20
ret i32* %arrayidx		ret i32* %arrayidx
}		}

define i32* @test_gep_i16(i32 *%arr, i16 %ind) {		define i32* @test_gep_i16(i32 *%arr, i16 %ind) {
; X64_GISEL-LABEL: test_gep_i16:		; X64_GISEL-LABEL: test_gep_i16:
; X64_GISEL: # %bb.0:		; X64_GISEL: # %bb.0:
; X64_GISEL-NEXT: # kill: def $esi killed $esi def $rsi		; X64_GISEL-NEXT: # kill: def $esi killed $esi def $rsi
; X64_GISEL-NEXT: movq $48, %rcx		; X64_GISEL-NEXT: movq $48, %rcx
; X64_GISEL-NEXT: shlq %cl, %rsi		; X64_GISEL-NEXT: shlq %cl, %rsi
; X64_GISEL-NEXT: movq $48, %rcx		; X64_GISEL-NEXT: movq $48, %rcx
; X64_GISEL-NEXT: sarq %cl, %rsi		; X64_GISEL-NEXT: sarq %cl, %rsi
; X64_GISEL-NEXT: movq $4, %rax		; X64_GISEL-NEXT: movq $4, %rax
; X64_GISEL-NEXT: imulq %rsi, %rax		; X64_GISEL-NEXT: imulq %rsi, %rax
; X64_GISEL-NEXT: leaq (%rdi,%rax), %rax		; X64_GISEL-NEXT: addq %rdi, %rax
; X64_GISEL-NEXT: retq		; X64_GISEL-NEXT: retq
;		;
; X64-LABEL: test_gep_i16:		; X64-LABEL: test_gep_i16:
; X64: # %bb.0:		; X64: # %bb.0:
; X64-NEXT: # kill: def $esi killed $esi def $rsi		; X64-NEXT: # kill: def $esi killed $esi def $rsi
; X64-NEXT: movswq %si, %rax		; X64-NEXT: movswq %si, %rax
; X64-NEXT: leaq (%rdi,%rax,4), %rax		; X64-NEXT: leaq (%rdi,%rax,4), %rax
; X64-NEXT: retq		; X64-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i16 %ind		%arrayidx = getelementptr i32, i32* %arr, i16 %ind
ret i32* %arrayidx		ret i32* %arrayidx
}		}

define i32* @test_gep_i16_const(i32 *%arr) {		define i32* @test_gep_i16_const(i32 *%arr) {
; X64_GISEL-LABEL: test_gep_i16_const:		; X64_GISEL-LABEL: test_gep_i16_const:
; X64_GISEL: # %bb.0:		; X64_GISEL: # %bb.0:
; X64_GISEL-NEXT: movq $80, %rax		; X64_GISEL-NEXT: movq $80, %rax
; X64_GISEL-NEXT: leaq (%rdi,%rax), %rax		; X64_GISEL-NEXT: addq %rdi, %rax
; X64_GISEL-NEXT: retq		; X64_GISEL-NEXT: retq
;		;
; X64-LABEL: test_gep_i16_const:		; X64-LABEL: test_gep_i16_const:
; X64: # %bb.0:		; X64: # %bb.0:
; X64-NEXT: leaq 80(%rdi), %rax		; X64-NEXT: leaq 80(%rdi), %rax
; X64-NEXT: retq		; X64-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i16 20		%arrayidx = getelementptr i32, i32* %arr, i16 20
ret i32* %arrayidx		ret i32* %arrayidx
Show All 16 Lines	; X64-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i32 %ind		%arrayidx = getelementptr i32, i32* %arr, i32 %ind
ret i32* %arrayidx		ret i32* %arrayidx
}		}

define i32* @test_gep_i32_const(i32 *%arr) {		define i32* @test_gep_i32_const(i32 *%arr) {
; X64_GISEL-LABEL: test_gep_i32_const:		; X64_GISEL-LABEL: test_gep_i32_const:
; X64_GISEL: # %bb.0:		; X64_GISEL: # %bb.0:
; X64_GISEL-NEXT: movq $20, %rax		; X64_GISEL-NEXT: movq $20, %rax
; X64_GISEL-NEXT: leaq (%rdi,%rax), %rax		; X64_GISEL-NEXT: addq %rdi, %rax
; X64_GISEL-NEXT: retq		; X64_GISEL-NEXT: retq
;		;
; X64-LABEL: test_gep_i32_const:		; X64-LABEL: test_gep_i32_const:
; X64: # %bb.0:		; X64: # %bb.0:
; X64-NEXT: leaq 20(%rdi), %rax		; X64-NEXT: leaq 20(%rdi), %rax
; X64-NEXT: retq		; X64-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i32 5		%arrayidx = getelementptr i32, i32* %arr, i32 5
ret i32* %arrayidx		ret i32* %arrayidx
}		}

define i32* @test_gep_i64(i32 *%arr, i64 %ind) {		define i32* @test_gep_i64(i32 *%arr, i64 %ind) {
; X64_GISEL-LABEL: test_gep_i64:		; X64_GISEL-LABEL: test_gep_i64:
; X64_GISEL: # %bb.0:		; X64_GISEL: # %bb.0:
; X64_GISEL-NEXT: movq $4, %rax		; X64_GISEL-NEXT: movq $4, %rax
; X64_GISEL-NEXT: imulq %rsi, %rax		; X64_GISEL-NEXT: imulq %rsi, %rax
; X64_GISEL-NEXT: leaq (%rdi,%rax), %rax		; X64_GISEL-NEXT: addq %rdi, %rax
; X64_GISEL-NEXT: retq		; X64_GISEL-NEXT: retq
;		;
; X64-LABEL: test_gep_i64:		; X64-LABEL: test_gep_i64:
; X64: # %bb.0:		; X64: # %bb.0:
; X64-NEXT: leaq (%rdi,%rsi,4), %rax		; X64-NEXT: leaq (%rdi,%rsi,4), %rax
; X64-NEXT: retq		; X64-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i64 %ind		%arrayidx = getelementptr i32, i32* %arr, i64 %ind
ret i32* %arrayidx		ret i32* %arrayidx
}		}

define i32* @test_gep_i64_const(i32 *%arr) {		define i32* @test_gep_i64_const(i32 *%arr) {
; X64_GISEL-LABEL: test_gep_i64_const:		; X64_GISEL-LABEL: test_gep_i64_const:
; X64_GISEL: # %bb.0:		; X64_GISEL: # %bb.0:
; X64_GISEL-NEXT: movq $20, %rax		; X64_GISEL-NEXT: movq $20, %rax
; X64_GISEL-NEXT: leaq (%rdi,%rax), %rax		; X64_GISEL-NEXT: addq %rdi, %rax
; X64_GISEL-NEXT: retq		; X64_GISEL-NEXT: retq
;		;
; X64-LABEL: test_gep_i64_const:		; X64-LABEL: test_gep_i64_const:
; X64: # %bb.0:		; X64: # %bb.0:
; X64-NEXT: leaq 20(%rdi), %rax		; X64-NEXT: leaq 20(%rdi), %rax
; X64-NEXT: retq		; X64-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i64 5		%arrayidx = getelementptr i32, i32* %arr, i64 5
ret i32* %arrayidx		ret i32* %arrayidx
}		}

llvm/trunk/test/CodeGen/X86/GlobalISel/memop-scalar.ll

Show First 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	; ALL-NEXT: retq
ret i32 %r		ret i32 %r
}		}

; check that gep index doesn't folded into memory operand		; check that gep index doesn't folded into memory operand
define i32 @test_gep_folding_largeGepIndex(i32* %arr, i32 %val) {		define i32 @test_gep_folding_largeGepIndex(i32* %arr, i32 %val) {
; ALL-LABEL: test_gep_folding_largeGepIndex:		; ALL-LABEL: test_gep_folding_largeGepIndex:
; ALL: # %bb.0:		; ALL: # %bb.0:
; ALL-NEXT: movabsq $228719476720, %rax # imm = 0x3540BE3FF0		; ALL-NEXT: movabsq $228719476720, %rax # imm = 0x3540BE3FF0
; ALL-NEXT: leaq (%rdi,%rax), %rax		; ALL-NEXT: addq %rdi, %rax
; ALL-NEXT: movl %esi, (%rax)		; ALL-NEXT: movl %esi, (%rax)
; ALL-NEXT: movl (%rax), %eax		; ALL-NEXT: movl (%rax), %eax
; ALL-NEXT: retq		; ALL-NEXT: retq
%arrayidx = getelementptr i32, i32* %arr, i64 57179869180		%arrayidx = getelementptr i32, i32* %arr, i64 57179869180
store i32 %val, i32* %arrayidx		store i32 %val, i32* %arrayidx
%r = load i32, i32* %arrayidx		%r = load i32, i32* %arrayidx
ret i32 %r		ret i32 %r
}		}

llvm/trunk/test/CodeGen/X86/MergeConsecutiveStores.ll

	Show First 20 Lines • Show All 626 Lines • ▼ Show 20 Lines
	; BWON-LABEL: loadStoreBaseIndexOffsetSextNoSex:			; BWON-LABEL: loadStoreBaseIndexOffsetSextNoSex:
	; BWON: # %bb.0:			; BWON: # %bb.0:
	; BWON-NEXT: movl %ecx, %r8d			; BWON-NEXT: movl %ecx, %r8d
	; BWON-NEXT: xorl %ecx, %ecx			; BWON-NEXT: xorl %ecx, %ecx
	; BWON-NEXT: .p2align 4, 0x90			; BWON-NEXT: .p2align 4, 0x90
	; BWON-NEXT: .LBB12_1: # =>This Inner Loop Header: Depth=1			; BWON-NEXT: .LBB12_1: # =>This Inner Loop Header: Depth=1
	; BWON-NEXT: movsbq (%rdi,%rcx), %rax			; BWON-NEXT: movsbq (%rdi,%rcx), %rax
	; BWON-NEXT: movzbl (%rdx,%rax), %r9d			; BWON-NEXT: movzbl (%rdx,%rax), %r9d
	; BWON-NEXT: leal 1(%rax), %eax			; BWON-NEXT: incl %eax
	; BWON-NEXT: movsbq %al, %rax			; BWON-NEXT: movsbq %al, %rax
	; BWON-NEXT: movzbl (%rdx,%rax), %eax			; BWON-NEXT: movzbl (%rdx,%rax), %eax
	; BWON-NEXT: movb %r9b, (%rsi,%rcx,2)			; BWON-NEXT: movb %r9b, (%rsi,%rcx,2)
	; BWON-NEXT: movb %al, 1(%rsi,%rcx,2)			; BWON-NEXT: movb %al, 1(%rsi,%rcx,2)
	; BWON-NEXT: incq %rcx			; BWON-NEXT: incq %rcx
	; BWON-NEXT: cmpl %ecx, %r8d			; BWON-NEXT: cmpl %ecx, %r8d
	; BWON-NEXT: jne .LBB12_1			; BWON-NEXT: jne .LBB12_1
	; BWON-NEXT: # %bb.2:			; BWON-NEXT: # %bb.2:
	; BWON-NEXT: retq			; BWON-NEXT: retq
	;			;
	; BWOFF-LABEL: loadStoreBaseIndexOffsetSextNoSex:			; BWOFF-LABEL: loadStoreBaseIndexOffsetSextNoSex:
	; BWOFF: # %bb.0:			; BWOFF: # %bb.0:
	; BWOFF-NEXT: movl %ecx, %r8d			; BWOFF-NEXT: movl %ecx, %r8d
	; BWOFF-NEXT: xorl %ecx, %ecx			; BWOFF-NEXT: xorl %ecx, %ecx
	; BWOFF-NEXT: .p2align 4, 0x90			; BWOFF-NEXT: .p2align 4, 0x90
	; BWOFF-NEXT: .LBB12_1: # =>This Inner Loop Header: Depth=1			; BWOFF-NEXT: .LBB12_1: # =>This Inner Loop Header: Depth=1
	; BWOFF-NEXT: movsbq (%rdi,%rcx), %rax			; BWOFF-NEXT: movsbq (%rdi,%rcx), %rax
	; BWOFF-NEXT: movb (%rdx,%rax), %r9b			; BWOFF-NEXT: movb (%rdx,%rax), %r9b
	; BWOFF-NEXT: leal 1(%rax), %eax			; BWOFF-NEXT: incl %eax
	; BWOFF-NEXT: movsbq %al, %rax			; BWOFF-NEXT: movsbq %al, %rax
	; BWOFF-NEXT: movb (%rdx,%rax), %al			; BWOFF-NEXT: movb (%rdx,%rax), %al
	; BWOFF-NEXT: movb %r9b, (%rsi,%rcx,2)			; BWOFF-NEXT: movb %r9b, (%rsi,%rcx,2)
	; BWOFF-NEXT: movb %al, 1(%rsi,%rcx,2)			; BWOFF-NEXT: movb %al, 1(%rsi,%rcx,2)
	; BWOFF-NEXT: incq %rcx			; BWOFF-NEXT: incq %rcx
	; BWOFF-NEXT: cmpl %ecx, %r8d			; BWOFF-NEXT: cmpl %ecx, %r8d
	; BWOFF-NEXT: jne .LBB12_1			; BWOFF-NEXT: jne .LBB12_1
	; BWOFF-NEXT: # %bb.2:			; BWOFF-NEXT: # %bb.2:
	▲ Show 20 Lines • Show All 260 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/atomic-unordered.ll

	Show First 20 Lines • Show All 766 Lines • ▼ Show 20 Lines
	; CHECK-O3-NEXT: movq (%rdi), %rcx			; CHECK-O3-NEXT: movq (%rdi), %rcx
	; CHECK-O3-NEXT: movabsq $-8608480567731124087, %rdx # imm = 0x8888888888888889			; CHECK-O3-NEXT: movabsq $-8608480567731124087, %rdx # imm = 0x8888888888888889
	; CHECK-O3-NEXT: movq %rcx, %rax			; CHECK-O3-NEXT: movq %rcx, %rax
	; CHECK-O3-NEXT: imulq %rdx			; CHECK-O3-NEXT: imulq %rdx
	; CHECK-O3-NEXT: addq %rcx, %rdx			; CHECK-O3-NEXT: addq %rcx, %rdx
	; CHECK-O3-NEXT: movq %rdx, %rax			; CHECK-O3-NEXT: movq %rdx, %rax
	; CHECK-O3-NEXT: shrq $63, %rax			; CHECK-O3-NEXT: shrq $63, %rax
	; CHECK-O3-NEXT: sarq $3, %rdx			; CHECK-O3-NEXT: sarq $3, %rdx
	; CHECK-O3-NEXT: leaq (%rdx,%rax), %rax			; CHECK-O3-NEXT: addq %rdx, %rax
	; CHECK-O3-NEXT: retq			; CHECK-O3-NEXT: retq
	%v = load atomic i64, i64* %p unordered, align 8			%v = load atomic i64, i64* %p unordered, align 8
	%ret = sdiv i64 %v, 15			%ret = sdiv i64 %v, 15
	ret i64 %ret			ret i64 %ret
	}			}

	; Legal to fold (TODO)			; Legal to fold (TODO)
	define i64 @load_fold_sdiv2(i64* %p, i64 %v2) {			define i64 @load_fold_sdiv2(i64* %p, i64 %v2) {
	▲ Show 20 Lines • Show All 1,765 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/avx512vl-intrinsics-upgrade.ll

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 9,262 Lines • ▼ Show 20 Lines
	;			;
	; X64-LABEL: test_int_x86_avx512_ptestm_d_128:			; X64-LABEL: test_int_x86_avx512_ptestm_d_128:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: vptestmd %xmm1, %xmm0, %k0 # encoding: [0x62,0xf2,0x7d,0x08,0x27,0xc1]			; X64-NEXT: vptestmd %xmm1, %xmm0, %k0 # encoding: [0x62,0xf2,0x7d,0x08,0x27,0xc1]
	; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]			; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]
	; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]			; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]
	; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]			; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]
	; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]			; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]
	; X64-NEXT: leal (%rcx,%rax), %eax # encoding: [0x8d,0x04,0x01]			; X64-NEXT: addl %ecx, %eax # encoding: [0x01,0xc8]
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
	%res = call i8 @llvm.x86.avx512.ptestm.d.128(<4 x i32> %x0, <4 x i32> %x1, i8 %x2)			%res = call i8 @llvm.x86.avx512.ptestm.d.128(<4 x i32> %x0, <4 x i32> %x1, i8 %x2)
	%res1 = call i8 @llvm.x86.avx512.ptestm.d.128(<4 x i32> %x0, <4 x i32> %x1, i8-1)			%res1 = call i8 @llvm.x86.avx512.ptestm.d.128(<4 x i32> %x0, <4 x i32> %x1, i8-1)
	%res2 = add i8 %res, %res1			%res2 = add i8 %res, %res1
	ret i8 %res2			ret i8 %res2
	}			}

	▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	;			;
	; X64-LABEL: test_int_x86_avx512_ptestm_q_128:			; X64-LABEL: test_int_x86_avx512_ptestm_q_128:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: vptestmq %xmm1, %xmm0, %k0 # encoding: [0x62,0xf2,0xfd,0x08,0x27,0xc1]			; X64-NEXT: vptestmq %xmm1, %xmm0, %k0 # encoding: [0x62,0xf2,0xfd,0x08,0x27,0xc1]
	; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]			; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]
	; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]			; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]
	; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]			; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]
	; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]			; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]
	; X64-NEXT: leal (%rcx,%rax), %eax # encoding: [0x8d,0x04,0x01]			; X64-NEXT: addl %ecx, %eax # encoding: [0x01,0xc8]
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
	%res = call i8 @llvm.x86.avx512.ptestm.q.128(<2 x i64> %x0, <2 x i64> %x1, i8 %x2)			%res = call i8 @llvm.x86.avx512.ptestm.q.128(<2 x i64> %x0, <2 x i64> %x1, i8 %x2)
	%res1 = call i8 @llvm.x86.avx512.ptestm.q.128(<2 x i64> %x0, <2 x i64> %x1, i8-1)			%res1 = call i8 @llvm.x86.avx512.ptestm.q.128(<2 x i64> %x0, <2 x i64> %x1, i8-1)
	%res2 = add i8 %res, %res1			%res2 = add i8 %res, %res1
	ret i8 %res2			ret i8 %res2
	}			}

	Show All 15 Lines
	;			;
	; X64-LABEL: test_int_x86_avx512_ptestm_q_256:			; X64-LABEL: test_int_x86_avx512_ptestm_q_256:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: vptestmq %ymm1, %ymm0, %k0 # encoding: [0x62,0xf2,0xfd,0x28,0x27,0xc1]			; X64-NEXT: vptestmq %ymm1, %ymm0, %k0 # encoding: [0x62,0xf2,0xfd,0x28,0x27,0xc1]
	; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]			; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]
	; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]			; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]
	; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]			; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]
	; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]			; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]
	; X64-NEXT: leal (%rcx,%rax), %eax # encoding: [0x8d,0x04,0x01]			; X64-NEXT: addl %ecx, %eax # encoding: [0x01,0xc8]
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: vzeroupper # encoding: [0xc5,0xf8,0x77]			; X64-NEXT: vzeroupper # encoding: [0xc5,0xf8,0x77]
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
	%res = call i8 @llvm.x86.avx512.ptestm.q.256(<4 x i64> %x0, <4 x i64> %x1, i8 %x2)			%res = call i8 @llvm.x86.avx512.ptestm.q.256(<4 x i64> %x0, <4 x i64> %x1, i8 %x2)
	%res1 = call i8 @llvm.x86.avx512.ptestm.q.256(<4 x i64> %x0, <4 x i64> %x1, i8-1)			%res1 = call i8 @llvm.x86.avx512.ptestm.q.256(<4 x i64> %x0, <4 x i64> %x1, i8-1)
	%res2 = add i8 %res, %res1			%res2 = add i8 %res, %res1
	ret i8 %res2			ret i8 %res2
	}			}
	Show All 15 Lines
	;			;
	; X64-LABEL: test_int_x86_avx512_ptestnm_d_128:			; X64-LABEL: test_int_x86_avx512_ptestnm_d_128:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: vptestnmd %xmm1, %xmm0, %k0 # encoding: [0x62,0xf2,0x7e,0x08,0x27,0xc1]			; X64-NEXT: vptestnmd %xmm1, %xmm0, %k0 # encoding: [0x62,0xf2,0x7e,0x08,0x27,0xc1]
	; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]			; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]
	; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]			; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]
	; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]			; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]
	; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]			; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]
	; X64-NEXT: leal (%rcx,%rax), %eax # encoding: [0x8d,0x04,0x01]			; X64-NEXT: addl %ecx, %eax # encoding: [0x01,0xc8]
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
	%res = call i8 @llvm.x86.avx512.ptestnm.d.128(<4 x i32> %x0, <4 x i32> %x1, i8 %x2)			%res = call i8 @llvm.x86.avx512.ptestnm.d.128(<4 x i32> %x0, <4 x i32> %x1, i8 %x2)
	%res1 = call i8 @llvm.x86.avx512.ptestnm.d.128(<4 x i32> %x0, <4 x i32> %x1, i8-1)			%res1 = call i8 @llvm.x86.avx512.ptestnm.d.128(<4 x i32> %x0, <4 x i32> %x1, i8-1)
	%res2 = add i8 %res, %res1			%res2 = add i8 %res, %res1
	ret i8 %res2			ret i8 %res2
	}			}

	▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	;			;
	; X64-LABEL: test_int_x86_avx512_ptestnm_q_128:			; X64-LABEL: test_int_x86_avx512_ptestnm_q_128:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: vptestnmq %xmm1, %xmm0, %k0 # encoding: [0x62,0xf2,0xfe,0x08,0x27,0xc1]			; X64-NEXT: vptestnmq %xmm1, %xmm0, %k0 # encoding: [0x62,0xf2,0xfe,0x08,0x27,0xc1]
	; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]			; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]
	; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]			; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]
	; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]			; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]
	; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]			; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]
	; X64-NEXT: leal (%rcx,%rax), %eax # encoding: [0x8d,0x04,0x01]			; X64-NEXT: addl %ecx, %eax # encoding: [0x01,0xc8]
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
	%res = call i8 @llvm.x86.avx512.ptestnm.q.128(<2 x i64> %x0, <2 x i64> %x1, i8 %x2)			%res = call i8 @llvm.x86.avx512.ptestnm.q.128(<2 x i64> %x0, <2 x i64> %x1, i8 %x2)
	%res1 = call i8 @llvm.x86.avx512.ptestnm.q.128(<2 x i64> %x0, <2 x i64> %x1, i8-1)			%res1 = call i8 @llvm.x86.avx512.ptestnm.q.128(<2 x i64> %x0, <2 x i64> %x1, i8-1)
	%res2 = add i8 %res, %res1			%res2 = add i8 %res, %res1
	ret i8 %res2			ret i8 %res2
	}			}

	Show All 15 Lines
	;			;
	; X64-LABEL: test_int_x86_avx512_ptestnm_q_256:			; X64-LABEL: test_int_x86_avx512_ptestnm_q_256:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: vptestnmq %ymm1, %ymm0, %k0 # encoding: [0x62,0xf2,0xfe,0x28,0x27,0xc1]			; X64-NEXT: vptestnmq %ymm1, %ymm0, %k0 # encoding: [0x62,0xf2,0xfe,0x28,0x27,0xc1]
	; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]			; X64-NEXT: kmovw %edi, %k1 # encoding: [0xc5,0xf8,0x92,0xcf]
	; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]			; X64-NEXT: kandw %k1, %k0, %k1 # encoding: [0xc5,0xfc,0x41,0xc9]
	; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]			; X64-NEXT: kmovw %k1, %eax # encoding: [0xc5,0xf8,0x93,0xc1]
	; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]			; X64-NEXT: kmovw %k0, %ecx # encoding: [0xc5,0xf8,0x93,0xc8]
	; X64-NEXT: leal (%rcx,%rax), %eax # encoding: [0x8d,0x04,0x01]			; X64-NEXT: addl %ecx, %eax # encoding: [0x01,0xc8]
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: vzeroupper # encoding: [0xc5,0xf8,0x77]			; X64-NEXT: vzeroupper # encoding: [0xc5,0xf8,0x77]
	; X64-NEXT: retq # encoding: [0xc3]			; X64-NEXT: retq # encoding: [0xc3]
	%res = call i8 @llvm.x86.avx512.ptestnm.q.256(<4 x i64> %x0, <4 x i64> %x1, i8 %x2)			%res = call i8 @llvm.x86.avx512.ptestnm.q.256(<4 x i64> %x0, <4 x i64> %x1, i8 %x2)
	%res1 = call i8 @llvm.x86.avx512.ptestnm.q.256(<4 x i64> %x0, <4 x i64> %x1, i8-1)			%res1 = call i8 @llvm.x86.avx512.ptestnm.q.256(<4 x i64> %x0, <4 x i64> %x1, i8-1)
	%res2 = add i8 %res, %res1			%res2 = add i8 %res, %res1
	ret i8 %res2			ret i8 %res2
	}			}
	▲ Show 20 Lines • Show All 5,822 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/bitreverse.ll

	Show First 20 Lines • Show All 341 Lines • ▼ Show 20 Lines
	; X64-NEXT: andb $-52, %dil			; X64-NEXT: andb $-52, %dil
	; X64-NEXT: shrb $2, %dil			; X64-NEXT: shrb $2, %dil
	; X64-NEXT: orb %al, %dil			; X64-NEXT: orb %al, %dil
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: andb $85, %al			; X64-NEXT: andb $85, %al
	; X64-NEXT: addb %al, %al			; X64-NEXT: addb %al, %al
	; X64-NEXT: andb $-86, %dil			; X64-NEXT: andb $-86, %dil
	; X64-NEXT: shrb %dil			; X64-NEXT: shrb %dil
	; X64-NEXT: leal (%rdi,%rax), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq			; X64-NEXT: retq
	%b = call i8 @llvm.bitreverse.i8(i8 %a)			%b = call i8 @llvm.bitreverse.i8(i8 %a)
	ret i8 %b			ret i8 %b
	}			}

	declare i4 @llvm.bitreverse.i4(i4) readnone			declare i4 @llvm.bitreverse.i4(i4) readnone

	Show All 27 Lines
	; X64-NEXT: andb $-52, %dil			; X64-NEXT: andb $-52, %dil
	; X64-NEXT: shrb $2, %dil			; X64-NEXT: shrb $2, %dil
	; X64-NEXT: orb %al, %dil			; X64-NEXT: orb %al, %dil
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: andb $80, %al			; X64-NEXT: andb $80, %al
	; X64-NEXT: addb %al, %al			; X64-NEXT: addb %al, %al
	; X64-NEXT: andb $-96, %dil			; X64-NEXT: andb $-96, %dil
	; X64-NEXT: shrb %dil			; X64-NEXT: shrb %dil
	; X64-NEXT: leal (%rdi,%rax), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: shrb $4, %al			; X64-NEXT: shrb $4, %al
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq			; X64-NEXT: retq
	%b = call i4 @llvm.bitreverse.i4(i4 %a)			%b = call i4 @llvm.bitreverse.i4(i4 %a)
	ret i4 %b			ret i4 %b
	}			}

	; These tests check that bitreverse(constant) calls are folded			; These tests check that bitreverse(constant) calls are folded
	▲ Show 20 Lines • Show All 734 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/bswap_tree2.ll

	Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines
	; CHECK64-NEXT: movl %edi, %eax			; CHECK64-NEXT: movl %edi, %eax
	; CHECK64-NEXT: shrl $8, %eax			; CHECK64-NEXT: shrl $8, %eax
	; CHECK64-NEXT: shll $8, %edi			; CHECK64-NEXT: shll $8, %edi
	; CHECK64-NEXT: movl %eax, %ecx			; CHECK64-NEXT: movl %eax, %ecx
	; CHECK64-NEXT: andl $65280, %ecx # imm = 0xFF00			; CHECK64-NEXT: andl $65280, %ecx # imm = 0xFF00
	; CHECK64-NEXT: andl $-16777216, %edi # imm = 0xFF000000			; CHECK64-NEXT: andl $-16777216, %edi # imm = 0xFF000000
	; CHECK64-NEXT: andl $16711680, %eax # imm = 0xFF0000			; CHECK64-NEXT: andl $16711680, %eax # imm = 0xFF0000
	; CHECK64-NEXT: orl %edi, %eax			; CHECK64-NEXT: orl %edi, %eax
	; CHECK64-NEXT: leal (%rax,%rcx), %eax			; CHECK64-NEXT: addl %ecx, %eax
	; CHECK64-NEXT: retq			; CHECK64-NEXT: retq
	%byte1 = lshr i32 %x, 8			%byte1 = lshr i32 %x, 8
	%byte0 = shl i32 %x, 8			%byte0 = shl i32 %x, 8
	%byte3 = shl i32 %x, 8			%byte3 = shl i32 %x, 8
	%byte2 = lshr i32 %x, 8			%byte2 = lshr i32 %x, 8
	%tmp1 = and i32 %byte1, 65280 ; 0x0000ff00			%tmp1 = and i32 %byte1, 65280 ; 0x0000ff00
	%tmp0 = and i32 %byte0, 255 ; 0x000000ff			%tmp0 = and i32 %byte0, 255 ; 0x000000ff
	%tmp3 = and i32 %byte3, 4278190080 ; 0xff000000			%tmp3 = and i32 %byte3, 4278190080 ; 0xff000000
	▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/bypass-slow-division-32.ll

	Show First 20 Lines • Show All 137 Lines • ▼ Show 20 Lines
	define i32 @Test_use_div_reg_imm(i32 %a) nounwind {			define i32 @Test_use_div_reg_imm(i32 %a) nounwind {
	; CHECK-LABEL: Test_use_div_reg_imm:			; CHECK-LABEL: Test_use_div_reg_imm:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: movl $1041204193, %eax # imm = 0x3E0F83E1			; CHECK-NEXT: movl $1041204193, %eax # imm = 0x3E0F83E1
	; CHECK-NEXT: imull {{[0-9]+}}(%esp)			; CHECK-NEXT: imull {{[0-9]+}}(%esp)
	; CHECK-NEXT: movl %edx, %eax			; CHECK-NEXT: movl %edx, %eax
	; CHECK-NEXT: shrl $31, %eax			; CHECK-NEXT: shrl $31, %eax
	; CHECK-NEXT: sarl $3, %edx			; CHECK-NEXT: sarl $3, %edx
	; CHECK-NEXT: leal (%edx,%eax), %eax			; CHECK-NEXT: addl %edx, %eax
	; CHECK-NEXT: retl			; CHECK-NEXT: retl
	%resultdiv = sdiv i32 %a, 33			%resultdiv = sdiv i32 %a, 33
	ret i32 %resultdiv			ret i32 %resultdiv
	}			}

	define i32 @Test_use_rem_reg_imm(i32 %a) nounwind {			define i32 @Test_use_rem_reg_imm(i32 %a) nounwind {
	; CHECK-LABEL: Test_use_rem_reg_imm:			; CHECK-LABEL: Test_use_rem_reg_imm:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	▲ Show 20 Lines • Show All 85 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/combine-srem.ll

	Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines
	; CHECK-LABEL: combine_srem_by_minsigned:			; CHECK-LABEL: combine_srem_by_minsigned:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: # kill: def $edi killed $edi def $rdi			; CHECK-NEXT: # kill: def $edi killed $edi def $rdi
	; CHECK-NEXT: movl %edi, %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: sarl $31, %eax			; CHECK-NEXT: sarl $31, %eax
	; CHECK-NEXT: shrl %eax			; CHECK-NEXT: shrl %eax
	; CHECK-NEXT: addl %edi, %eax			; CHECK-NEXT: addl %edi, %eax
	; CHECK-NEXT: andl $-2147483648, %eax # imm = 0x80000000			; CHECK-NEXT: andl $-2147483648, %eax # imm = 0x80000000
	; CHECK-NEXT: leal (%rax,%rdi), %eax			; CHECK-NEXT: addl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%1 = srem i32 %x, -2147483648			%1 = srem i32 %x, -2147483648
	ret i32 %1			ret i32 %1
	}			}

	define <4 x i32> @combine_vec_srem_by_minsigned(<4 x i32> %x) {			define <4 x i32> @combine_vec_srem_by_minsigned(<4 x i32> %x) {
	; SSE-LABEL: combine_vec_srem_by_minsigned:			; SSE-LABEL: combine_vec_srem_by_minsigned:
	; SSE: # %bb.0:			; SSE: # %bb.0:
	▲ Show 20 Lines • Show All 362 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/dagcombine-shifts.ll

	Show First 20 Lines • Show All 155 Lines • ▼ Show 20 Lines

	define i64 @fun11(i16 zeroext %v) {			define i64 @fun11(i16 zeroext %v) {
	; CHECK-LABEL: fun11:			; CHECK-LABEL: fun11:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: # kill: def $edi killed $edi def $rdi			; CHECK-NEXT: # kill: def $edi killed $edi def $rdi
	; CHECK-NEXT: shrl $4, %edi			; CHECK-NEXT: shrl $4, %edi
	; CHECK-NEXT: movq %rdi, %rax			; CHECK-NEXT: movq %rdi, %rax
	; CHECK-NEXT: shlq $4, %rax			; CHECK-NEXT: shlq $4, %rax
	; CHECK-NEXT: leaq (%rax,%rdi), %rax			; CHECK-NEXT: addq %rdi, %rax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	entry:			entry:
	%shr = lshr i16 %v, 4			%shr = lshr i16 %v, 4
	%ext = zext i16 %shr to i64			%ext = zext i16 %shr to i64
	%shl = shl i64 %ext, 4			%shl = shl i64 %ext, 4
	%add = add i64 %shl, %ext			%add = add i64 %shl, %ext
	ret i64 %add			ret i64 %add
	}			}

	define i64 @fun12(i32 zeroext %v) {			define i64 @fun12(i32 zeroext %v) {
	; CHECK-LABEL: fun12:			; CHECK-LABEL: fun12:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: # kill: def $edi killed $edi def $rdi			; CHECK-NEXT: # kill: def $edi killed $edi def $rdi
	; CHECK-NEXT: shrl $4, %edi			; CHECK-NEXT: shrl $4, %edi
	; CHECK-NEXT: movq %rdi, %rax			; CHECK-NEXT: movq %rdi, %rax
	; CHECK-NEXT: shlq $4, %rax			; CHECK-NEXT: shlq $4, %rax
	; CHECK-NEXT: leaq (%rax,%rdi), %rax			; CHECK-NEXT: addq %rdi, %rax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	entry:			entry:
	%shr = lshr i32 %v, 4			%shr = lshr i32 %v, 4
	%ext = zext i32 %shr to i64			%ext = zext i32 %shr to i64
	%shl = shl i64 %ext, 4			%shl = shl i64 %ext, 4
	%add = add i64 %shl, %ext			%add = add i64 %shl, %ext
	ret i64 %add			ret i64 %add
	}			}
	Show All 28 Lines

llvm/trunk/test/CodeGen/X86/fixup-bw-copy.ll

	Show All 40 Lines

	; Verify we don't mess with H-reg copies (only generated in 32-bit mode).			; Verify we don't mess with H-reg copies (only generated in 32-bit mode).
	define i8 @test_movb_hreg(i16 %a0) {			define i8 @test_movb_hreg(i16 %a0) {
	; X64-LABEL: test_movb_hreg:			; X64-LABEL: test_movb_hreg:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: # kill: def $edi killed $edi def $rdi			; X64-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: shrl $8, %eax			; X64-NEXT: shrl $8, %eax
	; X64-NEXT: leal (%rax,%rdi), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X32-LABEL: test_movb_hreg:			; X32-LABEL: test_movb_hreg:
	; X32: # %bb.0:			; X32: # %bb.0:
	; X32-NEXT: movl {{[0-9]+}}(%esp), %eax			; X32-NEXT: movl {{[0-9]+}}(%esp), %eax
	; X32-NEXT: addb %al, %ah			; X32-NEXT: addb %al, %ah
	; X32-NEXT: movb %ah, %al			; X32-NEXT: movb %ah, %al
	; X32-NEXT: retl			; X32-NEXT: retl
	%tmp0 = trunc i16 %a0 to i8			%tmp0 = trunc i16 %a0 to i8
	%tmp1 = lshr i16 %a0, 8			%tmp1 = lshr i16 %a0, 8
	%tmp2 = trunc i16 %tmp1 to i8			%tmp2 = trunc i16 %tmp1 to i8
	%tmp3 = add i8 %tmp0, %tmp2			%tmp3 = add i8 %tmp0, %tmp2
	ret i8 %tmp3			ret i8 %tmp3
	}			}

llvm/trunk/test/CodeGen/X86/fixup-lea.ll

	Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines
	;			;
	; FAST-LABEL: foo_nosize:			; FAST-LABEL: foo_nosize:
	; FAST: # %bb.0: # %entry			; FAST: # %bb.0: # %entry
	; FAST-NEXT: movw $-1, %cx			; FAST-NEXT: movw $-1, %cx
	; FAST-NEXT: .p2align 4, 0x90			; FAST-NEXT: .p2align 4, 0x90
	; FAST-NEXT: .LBB4_1: # %for.body			; FAST-NEXT: .LBB4_1: # %for.body
	; FAST-NEXT: # =>This Inner Loop Header: Depth=1			; FAST-NEXT: # =>This Inner Loop Header: Depth=1
	; FAST-NEXT: movzwl %cx, %edx			; FAST-NEXT: movzwl %cx, %edx
	; FAST-NEXT: leal -1(%ecx), %ecx			; FAST-NEXT: addl $-1, %ecx
	; FAST-NEXT: cmpl %eax, %edx			; FAST-NEXT: cmpl %eax, %edx
	; FAST-NEXT: jl .LBB4_1			; FAST-NEXT: jl .LBB4_1
	; FAST-NEXT: # %bb.2: # %for.end			; FAST-NEXT: # %bb.2: # %for.end
	; FAST-NEXT: retl			; FAST-NEXT: retl
	entry:			entry:
	br label %for.body			br label %for.body

	for.body:			for.body:
	Show All 23 Lines
	;			;
	; FAST-LABEL: bar_nosize:			; FAST-LABEL: bar_nosize:
	; FAST: # %bb.0: # %entry			; FAST: # %bb.0: # %entry
	; FAST-NEXT: movw $1, %cx			; FAST-NEXT: movw $1, %cx
	; FAST-NEXT: .p2align 4, 0x90			; FAST-NEXT: .p2align 4, 0x90
	; FAST-NEXT: .LBB5_1: # %for.body			; FAST-NEXT: .LBB5_1: # %for.body
	; FAST-NEXT: # =>This Inner Loop Header: Depth=1			; FAST-NEXT: # =>This Inner Loop Header: Depth=1
	; FAST-NEXT: movzwl %cx, %edx			; FAST-NEXT: movzwl %cx, %edx
	; FAST-NEXT: leal 1(%ecx), %ecx			; FAST-NEXT: addl $1, %ecx
	; FAST-NEXT: cmpl %eax, %edx			; FAST-NEXT: cmpl %eax, %edx
	; FAST-NEXT: jl .LBB5_1			; FAST-NEXT: jl .LBB5_1
	; FAST-NEXT: # %bb.2: # %for.end			; FAST-NEXT: # %bb.2: # %for.end
	; FAST-NEXT: retl			; FAST-NEXT: retl
	entry:			entry:
	br label %for.body			br label %for.body

	for.body:			for.body:
	%i.05 = phi i16 [ %inc, %for.body ], [ 0, %entry ]			%i.05 = phi i16 [ %inc, %for.body ], [ 0, %entry ]
	%inc = add i16 %i.05, 1			%inc = add i16 %i.05, 1
	%conv = zext i16 %inc to i32			%conv = zext i16 %inc to i32
	%cmp = icmp slt i32 %conv, %dns			%cmp = icmp slt i32 %conv, %dns
	br i1 %cmp, label %for.body, label %for.end			br i1 %cmp, label %for.body, label %for.end
	for.end:			for.end:
	ret void			ret void
	}			}

llvm/trunk/test/CodeGen/X86/imul.ll

	Show First 20 Lines • Show All 214 Lines • ▼ Show 20 Lines
	}			}

	define i32 @mul33_32(i32 %A) {			define i32 @mul33_32(i32 %A) {
	; X64-LABEL: mul33_32:			; X64-LABEL: mul33_32:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: # kill: def $edi killed $edi def $rdi			; X64-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: shll $5, %eax			; X64-NEXT: shll $5, %eax
	; X64-NEXT: leal (%rax,%rdi), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X86-LABEL: mul33_32:			; X86-LABEL: mul33_32:
	; X86: # %bb.0:			; X86: # %bb.0:
	; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx			; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
	; X86-NEXT: movl %ecx, %eax			; X86-NEXT: movl %ecx, %eax
	; X86-NEXT: shll $5, %eax			; X86-NEXT: shll $5, %eax
	; X86-NEXT: addl %ecx, %eax			; X86-NEXT: addl %ecx, %eax
	▲ Show 20 Lines • Show All 112 Lines • ▼ Show 20 Lines


	define i32 @test2(i32 %a) {			define i32 @test2(i32 %a) {
	; X64-LABEL: test2:			; X64-LABEL: test2:
	; X64: # %bb.0: # %entry			; X64: # %bb.0: # %entry
	; X64-NEXT: # kill: def $edi killed $edi def $rdi			; X64-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: shll $5, %eax			; X64-NEXT: shll $5, %eax
	; X64-NEXT: leal (%rax,%rdi), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X86-LABEL: test2:			; X86-LABEL: test2:
	; X86: # %bb.0: # %entry			; X86: # %bb.0: # %entry
	; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx			; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
	; X86-NEXT: movl %ecx, %eax			; X86-NEXT: movl %ecx, %eax
	; X86-NEXT: shll $5, %eax			; X86-NEXT: shll $5, %eax
	; X86-NEXT: addl %ecx, %eax			; X86-NEXT: addl %ecx, %eax
	; X86-NEXT: retl			; X86-NEXT: retl
	entry:			entry:
	%tmp3 = mul i32 %a, 33			%tmp3 = mul i32 %a, 33
	ret i32 %tmp3			ret i32 %tmp3
	}			}

	define i32 @test3(i32 %a) {			define i32 @test3(i32 %a) {
	; X64-LABEL: test3:			; X64-LABEL: test3:
	; X64: # %bb.0: # %entry			; X64: # %bb.0: # %entry
	; X64-NEXT: # kill: def $edi killed $edi def $rdi			; X64-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: shll $5, %eax			; X64-NEXT: shll $5, %eax
	; X64-NEXT: leal (%rax,%rdi), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: negl %eax			; X64-NEXT: negl %eax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X86-LABEL: test3:			; X86-LABEL: test3:
	; X86: # %bb.0: # %entry			; X86: # %bb.0: # %entry
	; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx			; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
	; X86-NEXT: movl %ecx, %eax			; X86-NEXT: movl %ecx, %eax
	; X86-NEXT: shll $5, %eax			; X86-NEXT: shll $5, %eax
	▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
	}			}


	define i64 @test6(i64 %a) {			define i64 @test6(i64 %a) {
	; X64-LABEL: test6:			; X64-LABEL: test6:
	; X64: # %bb.0: # %entry			; X64: # %bb.0: # %entry
	; X64-NEXT: movq %rdi, %rax			; X64-NEXT: movq %rdi, %rax
	; X64-NEXT: shlq $5, %rax			; X64-NEXT: shlq $5, %rax
	; X64-NEXT: leaq (%rax,%rdi), %rax			; X64-NEXT: addq %rdi, %rax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X86-LABEL: test6:			; X86-LABEL: test6:
	; X86: # %bb.0: # %entry			; X86: # %bb.0: # %entry
	; X86-NEXT: movl {{[0-9]+}}(%esp), %eax			; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
	; X86-NEXT: movl %eax, %ecx			; X86-NEXT: movl %eax, %ecx
	; X86-NEXT: shll $5, %ecx			; X86-NEXT: shll $5, %ecx
	; X86-NEXT: addl %eax, %ecx			; X86-NEXT: addl %eax, %ecx
	; X86-NEXT: movl $33, %eax			; X86-NEXT: movl $33, %eax
	; X86-NEXT: mull {{[0-9]+}}(%esp)			; X86-NEXT: mull {{[0-9]+}}(%esp)
	; X86-NEXT: addl %ecx, %edx			; X86-NEXT: addl %ecx, %edx
	; X86-NEXT: retl			; X86-NEXT: retl
	entry:			entry:
	%tmp3 = mul i64 %a, 33			%tmp3 = mul i64 %a, 33
	ret i64 %tmp3			ret i64 %tmp3
	}			}

	define i64 @test7(i64 %a) {			define i64 @test7(i64 %a) {
	; X64-LABEL: test7:			; X64-LABEL: test7:
	; X64: # %bb.0: # %entry			; X64: # %bb.0: # %entry
	; X64-NEXT: movq %rdi, %rax			; X64-NEXT: movq %rdi, %rax
	; X64-NEXT: shlq $5, %rax			; X64-NEXT: shlq $5, %rax
	; X64-NEXT: leaq (%rax,%rdi), %rax			; X64-NEXT: addq %rdi, %rax
	; X64-NEXT: negq %rax			; X64-NEXT: negq %rax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X86-LABEL: test7:			; X86-LABEL: test7:
	; X86: # %bb.0: # %entry			; X86: # %bb.0: # %entry
	; X86-NEXT: pushl %esi			; X86-NEXT: pushl %esi
	; X86-NEXT: .cfi_def_cfa_offset 8			; X86-NEXT: .cfi_def_cfa_offset 8
	; X86-NEXT: .cfi_offset %esi, -8			; X86-NEXT: .cfi_offset %esi, -8
	▲ Show 20 Lines • Show All 71 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/leaFixup32.mir

Show First 20 Lines • Show All 168 Lines • ▼ Show 20 Lines	frameInfo:
hasCalls: false		hasCalls: false
maxCallFrameSize: 0		maxCallFrameSize: 0
hasOpaqueSPAdjustment: false		hasOpaqueSPAdjustment: false
hasVAStart: false		hasVAStart: false
hasMustTailInVarArgFunc: false		hasMustTailInVarArgFunc: false
body: \|		body: \|
bb.0 (%ir-block.0):		bb.0 (%ir-block.0):
liveins: $eax, $ebp		liveins: $eax, $ebp
; CHECK: $ebp = ADD32rr $ebp, killed $eax		; CHECK: $ebp = ADD32rr $ebp, $eax

$ebp = LEA32r killed $ebp, 1, killed $eax, 0, $noreg		$ebp = LEA32r killed $ebp, 1, killed $eax, 0, $noreg
RETQ $ebp		RETQ $ebp

...		...
---		---
name: testleaadd_32		name: testleaadd_32
alignment: 4		alignment: 4
▲ Show 20 Lines • Show All 324 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/leaFixup64.mir

Show First 20 Lines • Show All 241 Lines • ▼ Show 20 Lines	frameInfo:
hasCalls: false		hasCalls: false
maxCallFrameSize: 0		maxCallFrameSize: 0
hasOpaqueSPAdjustment: false		hasOpaqueSPAdjustment: false
hasVAStart: false		hasVAStart: false
hasMustTailInVarArgFunc: false		hasMustTailInVarArgFunc: false
body: \|		body: \|
bb.0 (%ir-block.0):		bb.0 (%ir-block.0):
liveins: $rax, $rbp		liveins: $rax, $rbp
; CHECK: $ebp = LEA64_32r killed $rax, 1, killed $rbp, 0		; CHECK: $ebp = ADD32rr $ebp, $eax, implicit-def $eflags, implicit $rbp, implicit $rax

$ebp = LEA64_32r killed $rbp, 1, killed $rax, 0, $noreg		$ebp = LEA64_32r killed $rbp, 1, killed $rax, 0, $noreg
RETQ $ebp		RETQ $ebp

...		...
---		---
name: test2add_64		name: test2add_64
alignment: 4		alignment: 4
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	frameInfo:
hasCalls: false		hasCalls: false
maxCallFrameSize: 0		maxCallFrameSize: 0
hasOpaqueSPAdjustment: false		hasOpaqueSPAdjustment: false
hasVAStart: false		hasVAStart: false
hasMustTailInVarArgFunc: false		hasMustTailInVarArgFunc: false
body: \|		body: \|
bb.0 (%ir-block.0):		bb.0 (%ir-block.0):
liveins: $rax, $rbp		liveins: $rax, $rbp
; CHECK: $rbp = ADD64rr $rbp, killed $rax		; CHECK: $rbp = ADD64rr $rbp, $rax

$rbp = LEA64r killed $rbp, 1, killed $rax, 0, $noreg		$rbp = LEA64r killed $rbp, 1, killed $rax, 0, $noreg
RETQ $ebp		RETQ $ebp

...		...
---		---
name: testleaadd_64_32		name: testleaadd_64_32
alignment: 4		alignment: 4
▲ Show 20 Lines • Show All 679 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/mul-constant-i16.ll

	Show First 20 Lines • Show All 315 Lines • ▼ Show 20 Lines
	; X86-NEXT: # kill: def $ax killed $ax killed $eax			; X86-NEXT: # kill: def $ax killed $ax killed $eax
	; X86-NEXT: retl			; X86-NEXT: retl
	;			;
	; X64-LABEL: test_mul_by_17:			; X64-LABEL: test_mul_by_17:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: # kill: def $edi killed $edi def $rdi			; X64-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: shll $4, %eax			; X64-NEXT: shll $4, %eax
	; X64-NEXT: leal (%rax,%rdi), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: # kill: def $ax killed $ax killed $eax			; X64-NEXT: # kill: def $ax killed $ax killed $eax
	; X64-NEXT: retq			; X64-NEXT: retq
	%mul = mul nsw i16 %x, 17			%mul = mul nsw i16 %x, 17
	ret i16 %mul			ret i16 %mul
	}			}

	define i16 @test_mul_by_18(i16 %x) {			define i16 @test_mul_by_18(i16 %x) {
	; X86-LABEL: test_mul_by_18:			; X86-LABEL: test_mul_by_18:
	▲ Show 20 Lines • Show All 509 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/mul-constant-i32.ll

	Show First 20 Lines • Show All 484 Lines • ▼ Show 20 Lines
	; X86-NEXT: addl %ecx, %eax			; X86-NEXT: addl %ecx, %eax
	; X86-NEXT: retl			; X86-NEXT: retl
	;			;
	; X64-LABEL: test_mul_by_17:			; X64-LABEL: test_mul_by_17:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: # kill: def $edi killed $edi def $rdi			; X64-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: shll $4, %eax			; X64-NEXT: shll $4, %eax
	; X64-NEXT: leal (%rax,%rdi), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X86-NOOPT-LABEL: test_mul_by_17:			; X86-NOOPT-LABEL: test_mul_by_17:
	; X86-NOOPT: # %bb.0:			; X86-NOOPT: # %bb.0:
	; X86-NOOPT-NEXT: imull $17, {{[0-9]+}}(%esp), %eax			; X86-NOOPT-NEXT: imull $17, {{[0-9]+}}(%esp), %eax
	; X86-NOOPT-NEXT: retl			; X86-NOOPT-NEXT: retl
	;			;
	; X64-NOOPT-LABEL: test_mul_by_17:			; X64-NOOPT-LABEL: test_mul_by_17:
	▲ Show 20 Lines • Show All 676 Lines • ▼ Show 20 Lines
	; X64-NOOPT-NEXT: imull $66, %edi, %eax			; X64-NOOPT-NEXT: imull $66, %edi, %eax
	; X64-NOOPT-NEXT: retq			; X64-NOOPT-NEXT: retq
	;			;
	; X64-SLM-LABEL: test_mul_by_66:			; X64-SLM-LABEL: test_mul_by_66:
	; X64-SLM: # %bb.0:			; X64-SLM: # %bb.0:
	; X64-SLM-NEXT: # kill: def $edi killed $edi def $rdi			; X64-SLM-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-SLM-NEXT: movl %edi, %eax			; X64-SLM-NEXT: movl %edi, %eax
	; X64-SLM-NEXT: shll $6, %eax			; X64-SLM-NEXT: shll $6, %eax
	; X64-SLM-NEXT: leal (%rax,%rdi), %eax			; X64-SLM-NEXT: addl %edi, %eax
	; X64-SLM-NEXT: addl %edi, %eax			; X64-SLM-NEXT: addl %edi, %eax
	; X64-SLM-NEXT: retq			; X64-SLM-NEXT: retq
	%mul = mul nsw i32 %x, 66			%mul = mul nsw i32 %x, 66
	ret i32 %mul			ret i32 %mul
	}			}

	define i32 @test_mul_by_73(i32 %x) {			define i32 @test_mul_by_73(i32 %x) {
	; X86-LABEL: test_mul_by_73:			; X86-LABEL: test_mul_by_73:
	▲ Show 20 Lines • Show All 269 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/mul-constant-i64.ll

	Show First 20 Lines • Show All 509 Lines • ▼ Show 20 Lines
	; X86-NOOPT-LABEL: test_mul_by_17:			; X86-NOOPT-LABEL: test_mul_by_17:
	; X86-NOOPT: # %bb.0:			; X86-NOOPT: # %bb.0:
	; X86-NOOPT-NEXT: movl $17, %eax			; X86-NOOPT-NEXT: movl $17, %eax
	; X86-NOOPT-NEXT: mull {{[0-9]+}}(%esp)			; X86-NOOPT-NEXT: mull {{[0-9]+}}(%esp)
	; X86-NOOPT-NEXT: imull $17, {{[0-9]+}}(%esp), %ecx			; X86-NOOPT-NEXT: imull $17, {{[0-9]+}}(%esp), %ecx
	; X86-NOOPT-NEXT: addl %ecx, %edx			; X86-NOOPT-NEXT: addl %ecx, %edx
	; X86-NOOPT-NEXT: retl			; X86-NOOPT-NEXT: retl
	;			;
	; X64-HSW-LABEL: test_mul_by_17:			; X64-OPT-LABEL: test_mul_by_17:
	; X64-HSW: # %bb.0:			; X64-OPT: # %bb.0:
	; X64-HSW-NEXT: movq %rdi, %rax			; X64-OPT-NEXT: movq %rdi, %rax
	; X64-HSW-NEXT: shlq $4, %rax			; X64-OPT-NEXT: shlq $4, %rax
	; X64-HSW-NEXT: leaq (%rax,%rdi), %rax			; X64-OPT-NEXT: addq %rdi, %rax
	; X64-HSW-NEXT: retq			; X64-OPT-NEXT: retq
	;
	; X64-JAG-LABEL: test_mul_by_17:
	; X64-JAG: # %bb.0:
	; X64-JAG-NEXT: movq %rdi, %rax
	; X64-JAG-NEXT: shlq $4, %rax
	; X64-JAG-NEXT: leaq (%rax,%rdi), %rax
	; X64-JAG-NEXT: retq
	;
	; X64-SLM-LABEL: test_mul_by_17:
	; X64-SLM: # %bb.0:
	; X64-SLM-NEXT: movq %rdi, %rax
	; X64-SLM-NEXT: shlq $4, %rax
	; X64-SLM-NEXT: addq %rdi, %rax
	; X64-SLM-NEXT: retq
	;			;
	; X64-NOOPT-LABEL: test_mul_by_17:			; X64-NOOPT-LABEL: test_mul_by_17:
	; X64-NOOPT: # %bb.0:			; X64-NOOPT: # %bb.0:
	; X64-NOOPT-NEXT: imulq $17, %rdi, %rax			; X64-NOOPT-NEXT: imulq $17, %rdi, %rax
	; X64-NOOPT-NEXT: retq			; X64-NOOPT-NEXT: retq
	%mul = mul nsw i64 %x, 17			%mul = mul nsw i64 %x, 17
	ret i64 %mul			ret i64 %mul
	}			}
	▲ Show 20 Lines • Show All 1,061 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/mul-constant-i8.ll

	Show First 20 Lines • Show All 185 Lines • ▼ Show 20 Lines
	}			}

	define i8 @test_mul_by_17(i8 %x) {			define i8 @test_mul_by_17(i8 %x) {
	; X64-LABEL: test_mul_by_17:			; X64-LABEL: test_mul_by_17:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: # kill: def $edi killed $edi def $rdi			; X64-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: shll $4, %eax			; X64-NEXT: shll $4, %eax
	; X64-NEXT: leal (%rax,%rdi), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq			; X64-NEXT: retq
	%m = mul i8 %x, 17			%m = mul i8 %x, 17
	ret i8 %m			ret i8 %m
	}			}

	define i8 @test_mul_by_18(i8 %x) {			define i8 @test_mul_by_18(i8 %x) {
	; X64-LABEL: test_mul_by_18:			; X64-LABEL: test_mul_by_18:
	▲ Show 20 Lines • Show All 283 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/popcnt.ll

	Show All 33 Lines
	; X64-NEXT: subb %al, %dil			; X64-NEXT: subb %al, %dil
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: andb $51, %al			; X64-NEXT: andb $51, %al
	; X64-NEXT: shrb $2, %dil			; X64-NEXT: shrb $2, %dil
	; X64-NEXT: andb $51, %dil			; X64-NEXT: andb $51, %dil
	; X64-NEXT: addb %al, %dil			; X64-NEXT: addb %al, %dil
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: shrb $4, %al			; X64-NEXT: shrb $4, %al
	; X64-NEXT: leal (%rax,%rdi), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: andb $15, %al			; X64-NEXT: andb $15, %al
	; X64-NEXT: # kill: def $al killed $al killed $eax			; X64-NEXT: # kill: def $al killed $al killed $eax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X32-POPCNT-LABEL: cnt8:			; X32-POPCNT-LABEL: cnt8:
	; X32-POPCNT: # %bb.0:			; X32-POPCNT: # %bb.0:
	; X32-POPCNT-NEXT: movzbl {{[0-9]+}}(%esp), %eax			; X32-POPCNT-NEXT: movzbl {{[0-9]+}}(%esp), %eax
	; X32-POPCNT-NEXT: popcntl %eax, %eax			; X32-POPCNT-NEXT: popcntl %eax, %eax
	▲ Show 20 Lines • Show All 171 Lines • ▼ Show 20 Lines
	; X64-NEXT: movabsq $3689348814741910323, %rax # imm = 0x3333333333333333			; X64-NEXT: movabsq $3689348814741910323, %rax # imm = 0x3333333333333333
	; X64-NEXT: movq %rdi, %rcx			; X64-NEXT: movq %rdi, %rcx
	; X64-NEXT: andq %rax, %rcx			; X64-NEXT: andq %rax, %rcx
	; X64-NEXT: shrq $2, %rdi			; X64-NEXT: shrq $2, %rdi
	; X64-NEXT: andq %rax, %rdi			; X64-NEXT: andq %rax, %rdi
	; X64-NEXT: addq %rcx, %rdi			; X64-NEXT: addq %rcx, %rdi
	; X64-NEXT: movq %rdi, %rax			; X64-NEXT: movq %rdi, %rax
	; X64-NEXT: shrq $4, %rax			; X64-NEXT: shrq $4, %rax
	; X64-NEXT: leaq (%rax,%rdi), %rax			; X64-NEXT: addq %rdi, %rax
	; X64-NEXT: movabsq $1085102592571150095, %rcx # imm = 0xF0F0F0F0F0F0F0F			; X64-NEXT: movabsq $1085102592571150095, %rcx # imm = 0xF0F0F0F0F0F0F0F
	; X64-NEXT: andq %rax, %rcx			; X64-NEXT: andq %rax, %rcx
	; X64-NEXT: movabsq $72340172838076673, %rax # imm = 0x101010101010101			; X64-NEXT: movabsq $72340172838076673, %rax # imm = 0x101010101010101
	; X64-NEXT: imulq %rcx, %rax			; X64-NEXT: imulq %rcx, %rax
	; X64-NEXT: shrq $56, %rax			; X64-NEXT: shrq $56, %rax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X32-POPCNT-LABEL: cnt64:			; X32-POPCNT-LABEL: cnt64:
	▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines
	; X64-NEXT: movabsq $3689348814741910323, %rax # imm = 0x3333333333333333			; X64-NEXT: movabsq $3689348814741910323, %rax # imm = 0x3333333333333333
	; X64-NEXT: movq %rdi, %rcx			; X64-NEXT: movq %rdi, %rcx
	; X64-NEXT: andq %rax, %rcx			; X64-NEXT: andq %rax, %rcx
	; X64-NEXT: shrq $2, %rdi			; X64-NEXT: shrq $2, %rdi
	; X64-NEXT: andq %rax, %rdi			; X64-NEXT: andq %rax, %rdi
	; X64-NEXT: addq %rcx, %rdi			; X64-NEXT: addq %rcx, %rdi
	; X64-NEXT: movq %rdi, %rax			; X64-NEXT: movq %rdi, %rax
	; X64-NEXT: shrq $4, %rax			; X64-NEXT: shrq $4, %rax
	; X64-NEXT: leaq (%rax,%rdi), %rax			; X64-NEXT: addq %rdi, %rax
	; X64-NEXT: movabsq $1085102592571150095, %rcx # imm = 0xF0F0F0F0F0F0F0F			; X64-NEXT: movabsq $1085102592571150095, %rcx # imm = 0xF0F0F0F0F0F0F0F
	; X64-NEXT: andq %rax, %rcx			; X64-NEXT: andq %rax, %rcx
	; X64-NEXT: movabsq $72340172838076673, %rax # imm = 0x101010101010101			; X64-NEXT: movabsq $72340172838076673, %rax # imm = 0x101010101010101
	; X64-NEXT: imulq %rcx, %rax			; X64-NEXT: imulq %rcx, %rax
	; X64-NEXT: shrq $56, %rax			; X64-NEXT: shrq $56, %rax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X32-POPCNT-LABEL: cnt64_noimplicitfloat:			; X32-POPCNT-LABEL: cnt64_noimplicitfloat:
	Show All 19 Lines

llvm/trunk/test/CodeGen/X86/ragreedy-hoist-spill.ll

	Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: testb %dl, %dl			; CHECK-NEXT: testb %dl, %dl
	; CHECK-NEXT: jne LBB0_29			; CHECK-NEXT: jne LBB0_29
	; CHECK-NEXT: ## %bb.33: ## %if.end517.loopexitsplit			; CHECK-NEXT: ## %bb.33: ## %if.end517.loopexitsplit
	; CHECK-NEXT: ## in Loop: Header=BB0_13 Depth=1			; CHECK-NEXT: ## in Loop: Header=BB0_13 Depth=1
	; CHECK-NEXT: incq %r13			; CHECK-NEXT: incq %r13
	; CHECK-NEXT: LBB0_34: ## %if.end517			; CHECK-NEXT: LBB0_34: ## %if.end517
	; CHECK-NEXT: ## in Loop: Header=BB0_13 Depth=1			; CHECK-NEXT: ## in Loop: Header=BB0_13 Depth=1
	; CHECK-NEXT: movq {{[-0-9]+}}(%r{{[sb]}}p), %rax ## 8-byte Reload			; CHECK-NEXT: movq {{[-0-9]+}}(%r{{[sb]}}p), %rax ## 8-byte Reload
	; CHECK-NEXT: leal -324(%rax), %eax			; CHECK-NEXT: addl $-324, %eax ## imm = 0xFEBC
	; CHECK-NEXT: cmpl $59, %eax			; CHECK-NEXT: cmpl $59, %eax
	; CHECK-NEXT: ja LBB0_35			; CHECK-NEXT: ja LBB0_35
	; CHECK-NEXT: ## %bb.57: ## %if.end517			; CHECK-NEXT: ## %bb.57: ## %if.end517
	; CHECK-NEXT: ## in Loop: Header=BB0_13 Depth=1			; CHECK-NEXT: ## in Loop: Header=BB0_13 Depth=1
	; CHECK-NEXT: movabsq $576460756598390785, %rcx ## imm = 0x800000100000001			; CHECK-NEXT: movabsq $576460756598390785, %rcx ## imm = 0x800000100000001
	; CHECK-NEXT: btq %rax, %rcx			; CHECK-NEXT: btq %rax, %rcx
	; CHECK-NEXT: jb LBB0_38			; CHECK-NEXT: jb LBB0_38
	; CHECK-NEXT: LBB0_35: ## %if.end517			; CHECK-NEXT: LBB0_35: ## %if.end517
	▲ Show 20 Lines • Show All 510 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/reverse_branches.ll

	Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: ## => This Inner Loop Header: Depth=2			; CHECK-NEXT: ## => This Inner Loop Header: Depth=2
	; CHECK-NEXT: incl %r13d			; CHECK-NEXT: incl %r13d
	; CHECK-NEXT: cmpl $999, %r13d ## imm = 0x3E7			; CHECK-NEXT: cmpl $999, %r13d ## imm = 0x3E7
	; CHECK-NEXT: jg LBB0_6			; CHECK-NEXT: jg LBB0_6
	; CHECK-NEXT: ## %bb.4: ## %for.body3			; CHECK-NEXT: ## %bb.4: ## %for.body3
	; CHECK-NEXT: ## in Loop: Header=BB0_3 Depth=2			; CHECK-NEXT: ## in Loop: Header=BB0_3 Depth=2
	; CHECK-NEXT: addq $1002, %rbp ## imm = 0x3EA			; CHECK-NEXT: addq $1002, %rbp ## imm = 0x3EA
	; CHECK-NEXT: movq %rbx, %rdi			; CHECK-NEXT: movq %rbx, %rdi
	; CHECK-NEXT: leaq 1001(%rbx), %rbx			; CHECK-NEXT: addq $1001, %rbx ## imm = 0x3E9
	; CHECK-NEXT: movl $1000, %edx ## imm = 0x3E8			; CHECK-NEXT: movl $1000, %edx ## imm = 0x3E8
	; CHECK-NEXT: movl $120, %esi			; CHECK-NEXT: movl $120, %esi
	; CHECK-NEXT: callq _memchr			; CHECK-NEXT: callq _memchr
	; CHECK-NEXT: cmpq %rax, %rbp			; CHECK-NEXT: cmpq %rax, %rbp
	; CHECK-NEXT: je LBB0_3			; CHECK-NEXT: je LBB0_3
	; CHECK-NEXT: jmp LBB0_5			; CHECK-NEXT: jmp LBB0_5
	; CHECK-NEXT: LBB0_7: ## %for.end11			; CHECK-NEXT: LBB0_7: ## %for.end11
	; CHECK-NEXT: leaq {{.*}}(%rip), %rdi			; CHECK-NEXT: leaq {{.*}}(%rip), %rdi
	▲ Show 20 Lines • Show All 162 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/rotate-extract.ll

	Show First 20 Lines • Show All 150 Lines • ▼ Show 20 Lines
	; X86-NEXT: retl			; X86-NEXT: retl
	;			;
	; X64-LABEL: no_extract_shl:			; X64-LABEL: no_extract_shl:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: movq %rdi, %rax			; X64-NEXT: movq %rdi, %rax
	; X64-NEXT: shlq $5, %rax			; X64-NEXT: shlq $5, %rax
	; X64-NEXT: shlq $10, %rdi			; X64-NEXT: shlq $10, %rdi
	; X64-NEXT: shrq $57, %rax			; X64-NEXT: shrq $57, %rax
	; X64-NEXT: leaq (%rax,%rdi), %rax			; X64-NEXT: addq %rdi, %rax
	; X64-NEXT: retq			; X64-NEXT: retq
	%lhs_mul = shl i64 %i, 5			%lhs_mul = shl i64 %i, 5
	%rhs_mul = shl i64 %i, 10			%rhs_mul = shl i64 %i, 10
	%lhs_shift = lshr i64 %lhs_mul, 57			%lhs_shift = lshr i64 %lhs_mul, 57
	%out = or i64 %lhs_shift, %rhs_mul			%out = or i64 %lhs_shift, %rhs_mul
	ret i64 %out			ret i64 %out
	}			}

	Show All 11 Lines
	;			;
	; X64-LABEL: no_extract_shrl:			; X64-LABEL: no_extract_shrl:
	; X64: # %bb.0:			; X64: # %bb.0:
	; X64-NEXT: # kill: def $edi killed $edi def $rdi			; X64-NEXT: # kill: def $edi killed $edi def $rdi
	; X64-NEXT: movl %edi, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: andl $-8, %eax			; X64-NEXT: andl $-8, %eax
	; X64-NEXT: shll $25, %eax			; X64-NEXT: shll $25, %eax
	; X64-NEXT: shrl $9, %edi			; X64-NEXT: shrl $9, %edi
	; X64-NEXT: leal (%rdi,%rax), %eax			; X64-NEXT: addl %edi, %eax
	; X64-NEXT: retq			; X64-NEXT: retq
	%lhs_div = lshr i32 %i, 3			%lhs_div = lshr i32 %i, 3
	%rhs_div = lshr i32 %i, 9			%rhs_div = lshr i32 %i, 9
	%lhs_shift = shl i32 %lhs_div, 28			%lhs_shift = shl i32 %lhs_div, 28
	%out = or i32 %lhs_shift, %rhs_div			%out = or i32 %lhs_shift, %rhs_div
	ret i32 %out			ret i32 %out
	}			}

	▲ Show 20 Lines • Show All 72 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/sat-add.ll

	Show First 20 Lines • Show All 230 Lines • ▼ Show 20 Lines
	define i16 @unsigned_sat_variable_i16_using_min(i16 %x, i16 %y) {			define i16 @unsigned_sat_variable_i16_using_min(i16 %x, i16 %y) {
	; ANY-LABEL: unsigned_sat_variable_i16_using_min:			; ANY-LABEL: unsigned_sat_variable_i16_using_min:
	; ANY: # %bb.0:			; ANY: # %bb.0:
	; ANY-NEXT: # kill: def $esi killed $esi def $rsi			; ANY-NEXT: # kill: def $esi killed $esi def $rsi
	; ANY-NEXT: movl %esi, %eax			; ANY-NEXT: movl %esi, %eax
	; ANY-NEXT: notl %eax			; ANY-NEXT: notl %eax
	; ANY-NEXT: cmpw %ax, %di			; ANY-NEXT: cmpw %ax, %di
	; ANY-NEXT: cmovbl %edi, %eax			; ANY-NEXT: cmovbl %edi, %eax
	; ANY-NEXT: leal (%rax,%rsi), %eax			; ANY-NEXT: addl %esi, %eax
	; ANY-NEXT: # kill: def $ax killed $ax killed $eax			; ANY-NEXT: # kill: def $ax killed $ax killed $eax
	; ANY-NEXT: retq			; ANY-NEXT: retq
	%noty = xor i16 %y, -1			%noty = xor i16 %y, -1
	%c = icmp ult i16 %x, %noty			%c = icmp ult i16 %x, %noty
	%s = select i1 %c, i16 %x, i16 %noty			%s = select i1 %c, i16 %x, i16 %noty
	%r = add i16 %s, %y			%r = add i16 %s, %y
	ret i16 %r			ret i16 %r
	}			}
	Show All 34 Lines
	define i32 @unsigned_sat_variable_i32_using_min(i32 %x, i32 %y) {			define i32 @unsigned_sat_variable_i32_using_min(i32 %x, i32 %y) {
	; ANY-LABEL: unsigned_sat_variable_i32_using_min:			; ANY-LABEL: unsigned_sat_variable_i32_using_min:
	; ANY: # %bb.0:			; ANY: # %bb.0:
	; ANY-NEXT: # kill: def $esi killed $esi def $rsi			; ANY-NEXT: # kill: def $esi killed $esi def $rsi
	; ANY-NEXT: movl %esi, %eax			; ANY-NEXT: movl %esi, %eax
	; ANY-NEXT: notl %eax			; ANY-NEXT: notl %eax
	; ANY-NEXT: cmpl %eax, %edi			; ANY-NEXT: cmpl %eax, %edi
	; ANY-NEXT: cmovbl %edi, %eax			; ANY-NEXT: cmovbl %edi, %eax
	; ANY-NEXT: leal (%rax,%rsi), %eax			; ANY-NEXT: addl %esi, %eax
	; ANY-NEXT: retq			; ANY-NEXT: retq
	%noty = xor i32 %y, -1			%noty = xor i32 %y, -1
	%c = icmp ult i32 %x, %noty			%c = icmp ult i32 %x, %noty
	%s = select i1 %c, i32 %x, i32 %noty			%s = select i1 %c, i32 %x, i32 %noty
	%r = add i32 %s, %y			%r = add i32 %s, %y
	ret i32 %r			ret i32 %r
	}			}

	Show All 30 Lines

	define i64 @unsigned_sat_variable_i64_using_min(i64 %x, i64 %y) {			define i64 @unsigned_sat_variable_i64_using_min(i64 %x, i64 %y) {
	; ANY-LABEL: unsigned_sat_variable_i64_using_min:			; ANY-LABEL: unsigned_sat_variable_i64_using_min:
	; ANY: # %bb.0:			; ANY: # %bb.0:
	; ANY-NEXT: movq %rsi, %rax			; ANY-NEXT: movq %rsi, %rax
	; ANY-NEXT: notq %rax			; ANY-NEXT: notq %rax
	; ANY-NEXT: cmpq %rax, %rdi			; ANY-NEXT: cmpq %rax, %rdi
	; ANY-NEXT: cmovbq %rdi, %rax			; ANY-NEXT: cmovbq %rdi, %rax
	; ANY-NEXT: leaq (%rax,%rsi), %rax			; ANY-NEXT: addq %rsi, %rax
	; ANY-NEXT: retq			; ANY-NEXT: retq
	%noty = xor i64 %y, -1			%noty = xor i64 %y, -1
	%c = icmp ult i64 %x, %noty			%c = icmp ult i64 %x, %noty
	%s = select i1 %c, i64 %x, i64 %noty			%s = select i1 %c, i64 %x, i64 %noty
	%r = add i64 %s, %y			%r = add i64 %s, %y
	ret i64 %r			ret i64 %r
	}			}

	▲ Show 20 Lines • Show All 598 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/twoaddr-lea.ll

	Show First 20 Lines • Show All 63 Lines • ▼ Show 20 Lines
	bb2:			bb2:
	%tmp6 = load i32, i32* @global, align 4			%tmp6 = load i32, i32* @global, align 4
	%tmp8 = add nsw i32 %tmp6, %tmp2			%tmp8 = add nsw i32 %tmp6, %tmp2
	%tmp9 = sext i32 %tmp8 to i64			%tmp9 = sext i32 %tmp8 to i64
	br label %bb6			br label %bb6

	bb3:			bb3:
	; CHECK: subl %e[[REG0:[a-z0-9]+]],			; CHECK: subl %e[[REG0:[a-z0-9]+]],
	; CHECK: leaq 4({{%[a-z0-9]+}}), %r[[REG0]]			; CHECK: addq $4, %r[[REG0]]
	%tmp14 = phi i64 [ %tmp15, %bb5 ], [ 0, %bb1 ]			%tmp14 = phi i64 [ %tmp15, %bb5 ], [ 0, %bb1 ]
	%tmp15 = add nuw i64 %tmp14, 4			%tmp15 = add nuw i64 %tmp14, 4
	%tmp16 = trunc i64 %tmp14 to i32			%tmp16 = trunc i64 %tmp14 to i32
	%tmp17 = sub i32 %tmp2, %tmp16			%tmp17 = sub i32 %tmp2, %tmp16
	br label %bb4			br label %bb4

	bb4:			bb4:
	%tmp20 = phi i64 [ %tmp14, %bb3 ], [ %tmp34, %bb5 ]			%tmp20 = phi i64 [ %tmp14, %bb3 ], [ %tmp34, %bb5 ]
	Show All 23 Lines

llvm/trunk/test/CodeGen/X86/vector-bitreverse.ll

	Show All 21 Lines
	; SSE-NEXT: andb $-52, %dil			; SSE-NEXT: andb $-52, %dil
	; SSE-NEXT: shrb $2, %dil			; SSE-NEXT: shrb $2, %dil
	; SSE-NEXT: orb %al, %dil			; SSE-NEXT: orb %al, %dil
	; SSE-NEXT: movl %edi, %eax			; SSE-NEXT: movl %edi, %eax
	; SSE-NEXT: andb $85, %al			; SSE-NEXT: andb $85, %al
	; SSE-NEXT: addb %al, %al			; SSE-NEXT: addb %al, %al
	; SSE-NEXT: andb $-86, %dil			; SSE-NEXT: andb $-86, %dil
	; SSE-NEXT: shrb %dil			; SSE-NEXT: shrb %dil
	; SSE-NEXT: leal (%rdi,%rax), %eax			; SSE-NEXT: addl %edi, %eax
	; SSE-NEXT: # kill: def $al killed $al killed $eax			; SSE-NEXT: # kill: def $al killed $al killed $eax
	; SSE-NEXT: retq			; SSE-NEXT: retq
	;			;
	; AVX-LABEL: test_bitreverse_i8:			; AVX-LABEL: test_bitreverse_i8:
	; AVX: # %bb.0:			; AVX: # %bb.0:
	; AVX-NEXT: # kill: def $edi killed $edi def $rdi			; AVX-NEXT: # kill: def $edi killed $edi def $rdi
	; AVX-NEXT: rolb $4, %dil			; AVX-NEXT: rolb $4, %dil
	; AVX-NEXT: movl %edi, %eax			; AVX-NEXT: movl %edi, %eax
	; AVX-NEXT: andb $51, %al			; AVX-NEXT: andb $51, %al
	; AVX-NEXT: shlb $2, %al			; AVX-NEXT: shlb $2, %al
	; AVX-NEXT: andb $-52, %dil			; AVX-NEXT: andb $-52, %dil
	; AVX-NEXT: shrb $2, %dil			; AVX-NEXT: shrb $2, %dil
	; AVX-NEXT: orb %al, %dil			; AVX-NEXT: orb %al, %dil
	; AVX-NEXT: movl %edi, %eax			; AVX-NEXT: movl %edi, %eax
	; AVX-NEXT: andb $85, %al			; AVX-NEXT: andb $85, %al
	; AVX-NEXT: addb %al, %al			; AVX-NEXT: addb %al, %al
	; AVX-NEXT: andb $-86, %dil			; AVX-NEXT: andb $-86, %dil
	; AVX-NEXT: shrb %dil			; AVX-NEXT: shrb %dil
	; AVX-NEXT: leal (%rdi,%rax), %eax			; AVX-NEXT: addl %edi, %eax
	; AVX-NEXT: # kill: def $al killed $al killed $eax			; AVX-NEXT: # kill: def $al killed $al killed $eax
	; AVX-NEXT: retq			; AVX-NEXT: retq
	;			;
	; XOP-LABEL: test_bitreverse_i8:			; XOP-LABEL: test_bitreverse_i8:
	; XOP: # %bb.0:			; XOP: # %bb.0:
	; XOP-NEXT: vmovd %edi, %xmm0			; XOP-NEXT: vmovd %edi, %xmm0
	; XOP-NEXT: vpperm {{.*}}(%rip), %xmm0, %xmm0, %xmm0			; XOP-NEXT: vpperm {{.*}}(%rip), %xmm0, %xmm0, %xmm0
	; XOP-NEXT: vpextrb $0, %xmm0, %eax			; XOP-NEXT: vpextrb $0, %xmm0, %eax
	▲ Show 20 Lines • Show All 2,347 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/win_coreclr_chkstk.ll

	Show All 15 Lines
	; WIN_X64: subq %rax, %rdx			; WIN_X64: subq %rax, %rdx
	; WIN_X64: cmovbq %rcx, %rdx			; WIN_X64: cmovbq %rcx, %rdx
	; WIN_X64: movq %gs:16, %rcx			; WIN_X64: movq %gs:16, %rcx
	; WIN_X64: cmpq %rcx, %rdx			; WIN_X64: cmpq %rcx, %rdx
	; WIN_X64: jae .LBB0_3			; WIN_X64: jae .LBB0_3
	; WIN_X64:# %bb.1:			; WIN_X64:# %bb.1:
	; WIN_X64: andq $-4096, %rdx			; WIN_X64: andq $-4096, %rdx
	; WIN_X64:.LBB0_2:			; WIN_X64:.LBB0_2:
	; WIN_X64: leaq -4096(%rcx), %rcx			; WIN_X64: addq $-4096, %rcx
	; WIN_X64: movb $0, (%rcx)			; WIN_X64: movb $0, (%rcx)
	; WIN_X64: cmpq %rcx, %rdx			; WIN_X64: cmpq %rcx, %rdx
	; WIN_X64: jne .LBB0_2			; WIN_X64: jne .LBB0_2
	; WIN_X64:.LBB0_3:			; WIN_X64:.LBB0_3:
	; WIN_X64: subq %rax, %rsp			; WIN_X64: subq %rax, %rsp
	; WIN_X64: xorl %eax, %eax			; WIN_X64: xorl %eax, %eax
	; WIN_X64: addq $4096, %rsp			; WIN_X64: addq $4096, %rsp
	; WIN_X64: retq			; WIN_X64: retq
	▲ Show 20 Lines • Show All 105 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/LoopStrengthReduce/X86/ivchain-X86.ll

	Show First 20 Lines • Show All 237 Lines • ▼ Show 20 Lines
	; X32-NEXT: movl {{[0-9]+}}(%esp), %edi			; X32-NEXT: movl {{[0-9]+}}(%esp), %edi
	; X32-NEXT: addl %esi, %edi			; X32-NEXT: addl %esi, %edi
	; X32-NEXT: shll $2, %ecx			; X32-NEXT: shll $2, %ecx
	; X32-NEXT: .p2align 4, 0x90			; X32-NEXT: .p2align 4, 0x90
	; X32-NEXT: .LBB2_2: # %for.body			; X32-NEXT: .LBB2_2: # %for.body
	; X32-NEXT: # =>This Inner Loop Header: Depth=1			; X32-NEXT: # =>This Inner Loop Header: Depth=1
	; X32-NEXT: movl (%ebx,%esi), %ebp			; X32-NEXT: movl (%ebx,%esi), %ebp
	; X32-NEXT: addl (%ebx), %ebp			; X32-NEXT: addl (%ebx), %ebp
	; X32-NEXT: leal (%ebx,%esi), %ebx			; X32-NEXT: addl %esi, %ebx
	; X32-NEXT: addl (%esi,%ebx), %ebp			; X32-NEXT: addl (%esi,%ebx), %ebp
	; X32-NEXT: leal (%ebx,%esi), %ebx			; X32-NEXT: addl %esi, %ebx
	; X32-NEXT: addl (%esi,%ebx), %ebp			; X32-NEXT: addl (%esi,%ebx), %ebp
	; X32-NEXT: leal (%ebx,%esi), %ebx			; X32-NEXT: addl %esi, %ebx
	; X32-NEXT: addl (%esi,%ebx), %ebp			; X32-NEXT: addl (%esi,%ebx), %ebp
	; X32-NEXT: movl %ebp, (%edx)			; X32-NEXT: movl %ebp, (%edx)
	; X32-NEXT: leal (%ebx,%esi), %ebx			; X32-NEXT: addl %esi, %ebx
	; X32-NEXT: addl %edi, %ebx			; X32-NEXT: addl %edi, %ebx
	; X32-NEXT: addl %ecx, %edx			; X32-NEXT: addl %ecx, %edx
	; X32-NEXT: decl %eax			; X32-NEXT: decl %eax
	; X32-NEXT: jne .LBB2_2			; X32-NEXT: jne .LBB2_2
	; X32-NEXT: .LBB2_3: # %for.end			; X32-NEXT: .LBB2_3: # %for.end
	; X32-NEXT: popl %esi			; X32-NEXT: popl %esi
	; X32-NEXT: popl %edi			; X32-NEXT: popl %edi
	; X32-NEXT: popl %ebx			; X32-NEXT: popl %ebx
	▲ Show 20 Lines • Show All 315 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[X86FixupLEAs] Turn optIncDec into a generic two address LEA optimizer. Support LEA64_32r properly.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 201392

llvm/trunk/lib/Target/X86/X86FixupLEAs.cpp

llvm/trunk/test/CodeGen/X86/GlobalISel/add-ext.ll

llvm/trunk/test/CodeGen/X86/GlobalISel/callingconv.ll

llvm/trunk/test/CodeGen/X86/GlobalISel/gep.ll

llvm/trunk/test/CodeGen/X86/GlobalISel/memop-scalar.ll

llvm/trunk/test/CodeGen/X86/MergeConsecutiveStores.ll

llvm/trunk/test/CodeGen/X86/atomic-unordered.ll

llvm/trunk/test/CodeGen/X86/avx512vl-intrinsics-upgrade.ll

llvm/trunk/test/CodeGen/X86/bitreverse.ll

llvm/trunk/test/CodeGen/X86/bswap_tree2.ll

llvm/trunk/test/CodeGen/X86/bypass-slow-division-32.ll

llvm/trunk/test/CodeGen/X86/combine-srem.ll

llvm/trunk/test/CodeGen/X86/dagcombine-shifts.ll

llvm/trunk/test/CodeGen/X86/fixup-bw-copy.ll

llvm/trunk/test/CodeGen/X86/fixup-lea.ll

llvm/trunk/test/CodeGen/X86/imul.ll

llvm/trunk/test/CodeGen/X86/leaFixup32.mir

llvm/trunk/test/CodeGen/X86/leaFixup64.mir

llvm/trunk/test/CodeGen/X86/mul-constant-i16.ll

llvm/trunk/test/CodeGen/X86/mul-constant-i32.ll

llvm/trunk/test/CodeGen/X86/mul-constant-i64.ll

llvm/trunk/test/CodeGen/X86/mul-constant-i8.ll

llvm/trunk/test/CodeGen/X86/popcnt.ll

llvm/trunk/test/CodeGen/X86/ragreedy-hoist-spill.ll

llvm/trunk/test/CodeGen/X86/reverse_branches.ll

llvm/trunk/test/CodeGen/X86/rotate-extract.ll

llvm/trunk/test/CodeGen/X86/sat-add.ll

llvm/trunk/test/CodeGen/X86/twoaddr-lea.ll

llvm/trunk/test/CodeGen/X86/vector-bitreverse.ll

llvm/trunk/test/CodeGen/X86/win_coreclr_chkstk.ll

llvm/trunk/test/Transforms/LoopStrengthReduce/X86/ivchain-X86.ll

[X86FixupLEAs] Turn optIncDec into a generic two address LEA optimizer. Support LEA64_32r properly.
ClosedPublic