Download Raw Diff

Details

Reviewers

RKSimon
craig.topper
spatel

Commits

rG33d84c6dacf1: [X86] Move X86DAGToDAGISel::matchBEXTRFromAnd() into X86ISelLowering
rL344179: [X86] Move X86DAGToDAGISel::matchBEXTRFromAnd() into X86ISelLowering

Summary

As discussed in PR38938,
we fail to emit BEXTR if the mask is shifted.
We can't deal with that in X86DAGToDAGISel before the address mode for the inc is selected,
and we can't really do it in the normal DAGCombine, because we don't have generic ISD::BitFieldExtract node,
and if we simply turn the shifted mask into a normal mask + shift-left, it will be folded back.
So it would seem X86ISelLowering is the place to handle this.

Diff Detail

Repository: rL LLVM

Event Timeline

lebedev.ri created this revision.Sep 24 2018, 8:45 AM

lebedev.ri added a parent revision: D52293: [TLI][X86][AArch64] Generalize isDesirableToCommuteWithShift() hook and enable for X86.

lebedev.ri added inline comments.

test/CodeGen/X86/tbm_patterns.ll
52–64 ↗	(On Diff #166697)	This seems to be a miscompile, `BEXTR` does not touch EFLAGS?

lebedev.ri retitled this revision from [X86] Move X86DAGToDAGISel::matchBEXTRFromAnd() into X86ISelLowering [WIP] to [X86] Move X86DAGToDAGISel::matchBEXTRFromAnd() into X86ISelLowering.Sep 24 2018, 9:38 AM

lebedev.ri edited the summary of this revision. (Show Details)

lebedev.ri marked an inline comment as done.

lebedev.ri added inline comments.

test/CodeGen/X86/tbm_patterns.ll
52–64 ↗	(On Diff #166697)	Aha, it is not, i just can't read. http://www.felixcloutier.com/x86/BEXTR.html#flags-affected

craig.topper added inline comments.Sep 24 2018, 4:37 PM

lib/Target/X86/X86ISelLowering.cpp
35252 ↗	(On Diff #166697)	Remove the call to dump.
35308 ↗	(On Diff #166697)	Remove call to dump.
test/CodeGen/X86/extract-bits.ll
5797 ↗	(On Diff #166697)	This is not an improvement. We traded a shift right plus an and for a move immediate, a 2 uop bextr, and a shift left. So we went from 2 uops to 4. At least on Haswell.

Thank you for taking a look!

test/CodeGen/X86/extract-bits.ll
5797 ↗	(On Diff #166697)	Aha. So the D52293 has the same problem, obviously. Is adding `FeatureSlowBEXTR` the way forward?

craig.topper added inline comments.Sep 25 2018, 10:54 PM

test/CodeGen/X86/extract-bits.ll
5797 ↗	(On Diff #166697)	It's still an increase in instruction even on AMD in the BMI1 case. We still went from 2 uops to 3 uops. We'd only be ok with BEXTRI TBM instruction. In the case from PR38938 we were able to fold the shl into an addressing calculation which made it beneficial.

I'm not sure matchBEXTRFromAnd() for BMI was ever a good idea for Intel CPUs. I took it out and got performance improvements on several benchmarks in our internal list. Only a couple regressions and one of those was on a test that's really sensitive to code layout.

In D52426#1246032, @craig.topper wrote:

We'd only be ok with BEXTRI TBM instruction.
I'm not sure matchBEXTRFromAnd() for BMI was ever a good idea for Intel CPUs. I took it out and got performance improvements on several benchmarks in our internal list. Only a couple regressions and one of those was on a test that's really sensitive to code layout.
In the case from PR38938 we were able to fold the shl into an addressing calculation which made it beneficial.

Okay, so these simple cases are ok if there is TBM.
But the PR38938 - AMD Jaguar - does not have TBM.
There really isn't any fundamental difference in the IR between this, and the D52293,
so i would have guessed they should be using the *same* profitability check, correct?
So what would it be? "have TBM, or shifting a newly-loaded value?"
@RKSimon

@lebedev.ri Are you able to refactor this now that D52570 has landed?

In D52426#1250354, @RKSimon wrote:

@lebedev.ri Are you able to refactor this now that D52570 has landed?

Rebased now that D52570 has landed.

Can you remove the D52293 dependency and just move the current code to DAG so we can fix PR38938?

Drop parent review.

lebedev.ri removed a parent revision: D52293: [TLI][X86][AArch64] Generalize isDesirableToCommuteWithShift() hook and enable for X86.Oct 1 2018, 6:35 AM

lebedev.ri edited the summary of this revision. (Show Details)

RKSimon mentioned this in D52304: [X86][BMI1]: X86DAGToDAGISel: select BEXTR from x & ~(-1 << nbits) pattern.Oct 1 2018, 10:47 AM

RKSimon added inline comments.Oct 1 2018, 11:23 AM

lib/Target/X86/X86ISelLowering.cpp
35245 ↗	(On Diff #167721)	Most of this could be replaced with: return (VT == MVT::i32 \|\| (VT == MVT::i64 && Subtarget.is64Bit()));
35261 ↗	(On Diff #167721)	You could just use the EVT value all the way through if you changed hasBEXTR to take an EVT instead of MVT

Consistently use EVT.

craig.topper added inline comments.Oct 1 2018, 11:40 AM

lib/Target/X86/X86ISelLowering.cpp
35407 ↗	(On Diff #167788)	This should probably be below the LegalizeOps check. We should give ample opportunity for AND based DAG combines to optimize this.

Move to after isBeforeLegalizeOps().

lib/Target/X86/X86ISelLowering.cpp
35407 ↗	(On Diff #167788)	I'm not sure about the test for this. Also, should it be right after the `isBeforeLegalizeOps()`, or somewhere at the end of the block?

Ping.

RKSimon added inline comments.Oct 10 2018, 4:33 AM

lib/Target/X86/X86ISelLowering.cpp
35249 ↗	(On Diff #167790)	asserts should have a message
test/CodeGen/X86/bmi-x86_64.ll
105 ↗	(On Diff #167790)	Why is this call non_bextr64?

lebedev.ri added inline comments.Oct 10 2018, 4:37 AM

test/CodeGen/X86/bmi-x86_64.ll
105 ↗	(On Diff #167790)	That is how you named it when adding in rL232580 / https://github.com/llvm-mirror/llvm/commit/cbaefea0c0c1792390375b20c31b7c1fe8d0d2c7 Should i rename it?
105 ↗	(On Diff #167790)	Whoops, typo, s/you//. Still, should i rename it?

I'm a bit worried about these test changes - I thought this patch was about moving the existing code, not altering the pattern matching features.

In D52426#1260326, @RKSimon wrote:

I'm a bit worried about these test changes - I thought this patch was about moving the existing code, not altering the pattern matching features.

I guess i mislabelled it then. Please see the description of this differential for the context.
Would it be better to split this into a NFC code move + non-NFC "support shifted mask"?

Split into two reviews.
This only moves the code, but one test still changes..

lebedev.ri mentioned this in D53080: [X86] X86ISelLowering: combineAndIntoBEXTR(): support shifted mask..Oct 10 2018, 7:28 AM

lebedev.ri added a child revision: D53080: [X86] X86ISelLowering: combineAndIntoBEXTR(): support shifted mask..

LGTM - thanks

This revision is now accepted and ready to land.Oct 10 2018, 8:12 AM

In D52426#1260519, @RKSimon wrote:

LGTM - thanks

Thank you for the review!

Closed by commit rL344179: [X86] Move X86DAGToDAGISel::matchBEXTRFromAnd() into X86ISelLowering (authored by lebedevri). · Explain WhyOct 10 2018, 1:42 PM

This revision was automatically updated to reflect the committed changes.

For the PR we're trying to fix, perhaps we should be looking at these 3 similar functions in X86ISelDAGToDAG.cpp that are called when an address computation comes from a SHL+AND

if (!foldMaskAndShiftToExtract(*CurDAG, N, Mask, Shift, X, AM))
  return false;

// Try to fold the mask and shift directly into the scale.
if (!foldMaskAndShiftToScale(*CurDAG, N, Mask, Shift, X, AM))
  return false;

// Try to swap the mask and shift to place shifts which can be done as
// a scale on the outside of the mask.
if (!foldMaskedShiftToScaledMask(*CurDAG, N, Mask, Shift, X, AM))
  return false;

Diff 169083

llvm/trunk/lib/Target/X86/X86ISelDAGToDAG.cpp

Show First 20 Lines • Show All 451 Lines • ▼ Show 20 Lines	bool useNonTemporalLoad(LoadSDNode *N) const {
case 32:		case 32:
return Subtarget->hasAVX2();		return Subtarget->hasAVX2();
case 64:		case 64:
return Subtarget->hasAVX512();		return Subtarget->hasAVX512();
}		}
}		}

bool foldLoadStoreIntoMemOperand(SDNode *Node);		bool foldLoadStoreIntoMemOperand(SDNode *Node);
bool matchBEXTRFromAnd(SDNode *Node);
bool shrinkAndImmediate(SDNode *N);		bool shrinkAndImmediate(SDNode *N);
bool isMaskZeroExtended(SDNode *N) const;		bool isMaskZeroExtended(SDNode *N) const;
bool tryShiftAmountMod(SDNode *N);		bool tryShiftAmountMod(SDNode *N);

MachineSDNode *emitPCMPISTR(unsigned ROpc, unsigned MOpc, bool MayFoldLoad,		MachineSDNode *emitPCMPISTR(unsigned ROpc, unsigned MOpc, bool MayFoldLoad,
const SDLoc &dl, MVT VT, SDNode *Node);		const SDLoc &dl, MVT VT, SDNode *Node);
MachineSDNode *emitPCMPESTR(unsigned ROpc, unsigned MOpc, bool MayFoldLoad,		MachineSDNode *emitPCMPESTR(unsigned ROpc, unsigned MOpc, bool MayFoldLoad,
const SDLoc &dl, MVT VT, SDNode *Node,		const SDLoc &dl, MVT VT, SDNode *Node,
▲ Show 20 Lines • Show All 2,108 Lines • ▼ Show 20 Lines	bool X86DAGToDAGISel::foldLoadStoreIntoMemOperand(SDNode *Node) {
// Update Load Chain uses as well.		// Update Load Chain uses as well.
ReplaceUses(SDValue(LoadNode, 1), SDValue(Result, 1));		ReplaceUses(SDValue(LoadNode, 1), SDValue(Result, 1));
ReplaceUses(SDValue(StoreNode, 0), SDValue(Result, 1));		ReplaceUses(SDValue(StoreNode, 0), SDValue(Result, 1));
ReplaceUses(SDValue(StoredVal.getNode(), 1), SDValue(Result, 0));		ReplaceUses(SDValue(StoredVal.getNode(), 1), SDValue(Result, 0));
CurDAG->RemoveDeadNode(Node);		CurDAG->RemoveDeadNode(Node);
return true;		return true;
}		}

// See if this is an (X >> C1) & C2 that we can match to BEXTR/BEXTRI.
bool X86DAGToDAGISel::matchBEXTRFromAnd(SDNode *Node) {
MVT NVT = Node->getSimpleValueType(0);
SDLoc dl(Node);

SDValue N0 = Node->getOperand(0);
SDValue N1 = Node->getOperand(1);

// If we have TBM we can use an immediate for the control. If we have BMI
// we should only do this if the BEXTR instruction is implemented well.
// Otherwise moving the control into a register makes this more costly.
// TODO: Maybe load folding, greater than 32-bit masks, or a guarantee of LICM
// hoisting the move immediate would make it worthwhile with a less optimal
// BEXTR?
if (!Subtarget->hasTBM() &&
!(Subtarget->hasBMI() && Subtarget->hasFastBEXTR()))
return false;

// Must have a shift right.
if (N0->getOpcode() != ISD::SRL && N0->getOpcode() != ISD::SRA)
return false;

// Shift can't have additional users.
if (!N0->hasOneUse())
return false;

// Only supported for 32 and 64 bits.
if (NVT != MVT::i32 && NVT != MVT::i64)
return false;

// Shift amount and RHS of and must be constant.
ConstantSDNode *MaskCst = dyn_cast<ConstantSDNode>(N1);
ConstantSDNode *ShiftCst = dyn_cast<ConstantSDNode>(N0->getOperand(1));
if (!MaskCst \|\| !ShiftCst)
return false;

// And RHS must be a mask.
uint64_t Mask = MaskCst->getZExtValue();
if (!isMask_64(Mask))
return false;

uint64_t Shift = ShiftCst->getZExtValue();
uint64_t MaskSize = countPopulation(Mask);

// Don't interfere with something that can be handled by extracting AH.
// TODO: If we are able to fold a load, BEXTR might still be better than AH.
if (Shift == 8 && MaskSize == 8)
return false;

// Make sure we are only using bits that were in the original value, not
// shifted in.
if (Shift + MaskSize > NVT.getSizeInBits())
return false;

// Create a BEXTR node and run it through selection.
SDValue C = CurDAG->getConstant(Shift \| (MaskSize << 8), dl, NVT);
SDValue New = CurDAG->getNode(X86ISD::BEXTR, dl, NVT,
N0->getOperand(0), C);
ReplaceNode(Node, New.getNode());
SelectCode(New.getNode());
return true;
}

// Emit a PCMISTR(I/M) instruction.		// Emit a PCMISTR(I/M) instruction.
MachineSDNode *X86DAGToDAGISel::emitPCMPISTR(unsigned ROpc, unsigned MOpc,		MachineSDNode *X86DAGToDAGISel::emitPCMPISTR(unsigned ROpc, unsigned MOpc,
bool MayFoldLoad, const SDLoc &dl,		bool MayFoldLoad, const SDLoc &dl,
MVT VT, SDNode *Node) {		MVT VT, SDNode *Node) {
SDValue N0 = Node->getOperand(0);		SDValue N0 = Node->getOperand(0);
SDValue N1 = Node->getOperand(1);		SDValue N1 = Node->getOperand(1);
SDValue Imm = Node->getOperand(2);		SDValue Imm = Node->getOperand(2);
const ConstantInt *Val = cast<ConstantSDNode>(Imm)->getConstantIntValue();		const ConstantInt *Val = cast<ConstantSDNode>(Imm)->getConstantIntValue();
▲ Show 20 Lines • Show All 291 Lines • ▼ Show 20 Lines	void X86DAGToDAGISel::Select(SDNode *Node) {
case ISD::SRL:		case ISD::SRL:
case ISD::SRA:		case ISD::SRA:
case ISD::SHL:		case ISD::SHL:
if (tryShiftAmountMod(Node))		if (tryShiftAmountMod(Node))
return;		return;
break;		break;

case ISD::AND:		case ISD::AND:
if (matchBEXTRFromAnd(Node))
return;
if (AndImmShrink && shrinkAndImmediate(Node))		if (AndImmShrink && shrinkAndImmediate(Node))
return;		return;

LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
case ISD::OR:		case ISD::OR:
case ISD::XOR: {		case ISD::XOR: {

// For operations of the form (x << C1) op C2, check if we can use a smaller		// For operations of the form (x << C1) op C2, check if we can use a smaller
▲ Show 20 Lines • Show All 633 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 35,272 Lines • ▼ Show 20 Lines	if (const GetElementPtrInst *GEP = dyn_cast<GetElementPtrInst>(MemOp)) {
return DAG.getNode(ISD::AND, dl, VT, Inp, LShr);		return DAG.getNode(ISD::AND, dl, VT, Inp, LShr);
}		}
}		}
}		}
}		}
return SDValue();		return SDValue();
}		}

		static bool hasBEXTR(const X86Subtarget &Subtarget, EVT VT) {
		// If we have TBM we can use an immediate for the control. If we have BMI
		// we should only do this if the BEXTR instruction is implemented well.
		// Otherwise moving the control into a register makes this more costly.
		// TODO: Maybe load folding, greater than 32-bit masks, or a guarantee of LICM
		// hoisting the move immediate would make it worthwhile with a less optimal
		// BEXTR?
		if (!Subtarget.hasTBM() && !(Subtarget.hasBMI() && Subtarget.hasFastBEXTR()))
		return false;
		return (VT == MVT::i32 \|\| (VT == MVT::i64 && Subtarget.is64Bit()));
		}

		// See if this is an (X >> C1) & C2 that we can match to BEXTR/BEXTRI.
		static SDValue combineAndIntoBEXTR(SDNode *Node, SelectionDAG &DAG,
		const X86Subtarget &Subtarget) {
		EVT NVT = Node->getValueType(0);
		SDLoc dl(Node);

		SDValue N0 = Node->getOperand(0);
		SDValue N1 = Node->getOperand(1);

		// Check if subtarget has BEXTR instruction for the node's type
		if (!hasBEXTR(Subtarget, NVT))
		return SDValue();

		// Must have a shift right.
		if (N0->getOpcode() != ISD::SRL && N0->getOpcode() != ISD::SRA)
		return SDValue();

		// Shift can't have additional users.
		if (!N0->hasOneUse())
		return SDValue();

		// Shift amount and RHS of and must be constant.
		ConstantSDNode *MaskCst = dyn_cast<ConstantSDNode>(N1);
		ConstantSDNode *ShiftCst = dyn_cast<ConstantSDNode>(N0->getOperand(1));
		if (!MaskCst \|\| !ShiftCst)
		return SDValue();

		// And RHS must be a mask.
		uint64_t Mask = MaskCst->getZExtValue();
		if (!isMask_64(Mask))
		return SDValue();

		uint64_t Shift = ShiftCst->getZExtValue();
		uint64_t MaskSize = countPopulation(Mask);

		// Don't interfere with something that can be handled by extracting AH.
		// TODO: If we are able to fold a load, BEXTR might still be better than AH.
		if (Shift == 8 && MaskSize == 8)
		return SDValue();

		// Make sure we are only using bits that were in the original value, not
		// shifted in.
		if (Shift + MaskSize > NVT.getSizeInBits())
		return SDValue();

		// Create a BEXTR node.
		SDValue C = DAG.getConstant(Shift \| (MaskSize << 8), dl, NVT);
		SDValue New = DAG.getNode(X86ISD::BEXTR, dl, NVT, N0->getOperand(0), C);
		return New;
		}

// Look for (and (ctpop X), 1) which is the IR form of __builtin_parity.		// Look for (and (ctpop X), 1) which is the IR form of __builtin_parity.
// Turn it into series of XORs and a setnp.		// Turn it into series of XORs and a setnp.
static SDValue combineParity(SDNode *N, SelectionDAG &DAG,		static SDValue combineParity(SDNode *N, SelectionDAG &DAG,
const X86Subtarget &Subtarget) {		const X86Subtarget &Subtarget) {
EVT VT = N->getValueType(0);		EVT VT = N->getValueType(0);

// We only support 64-bit and 32-bit. 64-bit requires special handling		// We only support 64-bit and 32-bit. 64-bit requires special handling
// unless the 64-bit popcnt instruction is legal.		// unless the 64-bit popcnt instruction is legal.
▲ Show 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	static SDValue combineAnd(SDNode *N, SelectionDAG &DAG,

// This must be done before legalization has expanded the ctpop.		// This must be done before legalization has expanded the ctpop.
if (SDValue V = combineParity(N, DAG, Subtarget))		if (SDValue V = combineParity(N, DAG, Subtarget))
return V;		return V;

if (DCI.isBeforeLegalizeOps())		if (DCI.isBeforeLegalizeOps())
return SDValue();		return SDValue();

		if (SDValue R = combineAndIntoBEXTR(N, DAG, Subtarget))
		return R;

if (SDValue R = combineCompareEqual(N, DAG, DCI, Subtarget))		if (SDValue R = combineCompareEqual(N, DAG, DCI, Subtarget))
return R;		return R;

if (SDValue FPLogic = convertIntLogicToFPLogic(N, DAG, Subtarget))		if (SDValue FPLogic = convertIntLogicToFPLogic(N, DAG, Subtarget))
return FPLogic;		return FPLogic;

if (SDValue R = combineANDXORWithAllOnesIntoANDNP(N, DAG))		if (SDValue R = combineANDXORWithAllOnesIntoANDNP(N, DAG))
return R;		return R;
▲ Show 20 Lines • Show All 6,175 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/tbm_patterns.ll

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	; CHECK-NEXT: retq
%t3 = select i1 %t2, i32 %b, i32 %t1		%t3 = select i1 %t2, i32 %b, i32 %t1
ret i32 %t3		ret i32 %t3
}		}

define i32 @test_x86_tbm_bextri_u32_z2(i32 %a, i32 %b, i32 %c) nounwind {		define i32 @test_x86_tbm_bextri_u32_z2(i32 %a, i32 %b, i32 %c) nounwind {
; CHECK-LABEL: test_x86_tbm_bextri_u32_z2:		; CHECK-LABEL: test_x86_tbm_bextri_u32_z2:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: movl %esi, %eax		; CHECK-NEXT: movl %esi, %eax
; CHECK-NEXT: shrl $4, %edi		; CHECK-NEXT: bextrl $3076, %edi, %ecx # imm = 0xC04
; CHECK-NEXT: testl $4095, %edi # imm = 0xFFF
; CHECK-NEXT: cmovnel %edx, %eax		; CHECK-NEXT: cmovnel %edx, %eax
; CHECK-NEXT: retq		; CHECK-NEXT: retq
%t0 = lshr i32 %a, 4		%t0 = lshr i32 %a, 4
%t1 = and i32 %t0, 4095		%t1 = and i32 %t0, 4095
%t2 = icmp eq i32 %t1, 0		%t2 = icmp eq i32 %t1, 0
%t3 = select i1 %t2, i32 %b, i32 %c		%t3 = select i1 %t2, i32 %b, i32 %c
ret i32 %t3		ret i32 %t3
}		}
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	; CHECK-NEXT: retq
%t3 = select i1 %t2, i64 %b, i64 %t1		%t3 = select i1 %t2, i64 %b, i64 %t1
ret i64 %t3		ret i64 %t3
}		}

define i64 @test_x86_tbm_bextri_u64_z2(i64 %a, i64 %b, i64 %c) nounwind {		define i64 @test_x86_tbm_bextri_u64_z2(i64 %a, i64 %b, i64 %c) nounwind {
; CHECK-LABEL: test_x86_tbm_bextri_u64_z2:		; CHECK-LABEL: test_x86_tbm_bextri_u64_z2:
; CHECK: # %bb.0:		; CHECK: # %bb.0:
; CHECK-NEXT: movq %rsi, %rax		; CHECK-NEXT: movq %rsi, %rax
; CHECK-NEXT: shrl $4, %edi		; CHECK-NEXT: bextrl $3076, %edi, %ecx # imm = 0xC04
; CHECK-NEXT: testl $4095, %edi # imm = 0xFFF
; CHECK-NEXT: cmovneq %rdx, %rax		; CHECK-NEXT: cmovneq %rdx, %rax
; CHECK-NEXT: retq		; CHECK-NEXT: retq
%t0 = lshr i64 %a, 4		%t0 = lshr i64 %a, 4
%t1 = and i64 %t0, 4095		%t1 = and i64 %t0, 4095
%t2 = icmp eq i64 %t1, 0		%t2 = icmp eq i64 %t1, 0
%t3 = select i1 %t2, i64 %b, i64 %c		%t3 = select i1 %t2, i64 %b, i64 %c
ret i64 %t3		ret i64 %t3
}		}
▲ Show 20 Lines • Show All 796 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Move X86DAGToDAGISel::matchBEXTRFromAnd() into X86ISelLowering
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 169083

llvm/trunk/lib/Target/X86/X86ISelDAGToDAG.cpp

llvm/trunk/lib/Target/X86/X86ISelLowering.cpp

llvm/trunk/test/CodeGen/X86/tbm_patterns.ll

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Move X86DAGToDAGISel::matchBEXTRFromAnd() into X86ISelLoweringClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 169083

llvm/trunk/lib/Target/X86/X86ISelDAGToDAG.cpp

llvm/trunk/lib/Target/X86/X86ISelLowering.cpp

llvm/trunk/test/CodeGen/X86/tbm_patterns.ll

[X86] Move X86DAGToDAGISel::matchBEXTRFromAnd() into X86ISelLowering
ClosedPublic