Download Raw Diff

Details

Reviewers

qcolombet
sanjoy
craig.topper
sunfish
sebpop
kparzysz
junbuml
javed.absar
delena

Commits

rG7b88bab386db: [LSR] Combine unfolded offset into invariant register
rG5012e7f6acaf: [LSR] Combine unfolded offset into invariant register
rL346390: [LSR] Combine unfolded offset into invariant register
rL345114: [LSR] Combine unfolded offset into invariant register

Summary

LSR reassociates constants as unfolded offsets when the constants fit as immediate add operands, which currently prevents such constants from being combined later with loop invariant registers.

This patch modifies GenerateCombinations() to generate a second formula which includes the unfolded offset in the combined loop-invariant register.

Diff Detail

Repository: rL LLVM

Event Timeline

gilr created this revision.Sep 10 2018, 8:57 AM

Herald added a reviewer: javed.absar. · View Herald TranscriptSep 10 2018, 8:57 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Ping

LGTM.

One question, what is the impact on compile time?
Put differently are the heuristics to prune the search space able to still reduce the space to something manageable?

This revision is now accepted and ready to land.Oct 8 2018, 3:26 PM

Thanks Quentin!
Regarding effect on compile time please see attached CTMark results on x86.

compile-time-diff-merge-min.txt6 KBDownload

Thanks for the detailed numbers!

Closed by commit rL345114: [LSR] Combine unfolded offset into invariant register (authored by gilr). · Explain WhyOct 24 2018, 12:10 AM

This revision was automatically updated to reflect the committed changes.

gilr reopened this revision.Oct 28 2018, 5:55 AM

This revision is now accepted and ready to land.Oct 28 2018, 5:55 AM

Bug fix: ScalarEvolution::getAddExpr() may modify the vector of SCEVs it is given as argument, causing 2nd formula to be created with only part of the SCEVs in Ops.
This patch fixes this by using a temporary copy of Ops for the 1st formula.

gilr added a reviewer: delena.Oct 28 2018, 6:04 AM

gilr requested review of this revision.Oct 28 2018, 1:01 PM

Bug fix: ScalarEvolution::getAddExpr() may modify the vector of SCEVs it is given as argument, causing 2nd formula to be created with only part of the SCEVs in Ops.
This patch fixes this by using a temporary copy of Ops for the 1st formula.

Is this covered by an existing test case?
If not, could you add one?

In D51861#1280671, @qcolombet wrote:

Is this covered by an existing test case?
If not, could you add one?

The original patch failed several runtime tests of the LLVM test-suite: n-body, oggenc, HPCCG and PENNANT, where LSR produced an incorrect yet legal formula based on the modified vector of SCEVs, leading to semantically wrong generated code.
Fully covering this bug by a LIT test requires a test case where (a) LSR generates both combinations, (b) SCEV erases one of the registers from the Ops vector and (c) the second combination is included in the solution. The LIT tests included in the original patch do (a) + (c) but I couldn't get them to also do (b). The failing runtime tests do (a) and (b) but not (c).

The failing runtime tests do (a) and (b) but not (c).

I am confused. If they don't do (c) how can they fail?

The original patch failed several runtime tests of the LLVM test-suite: [...]

Can't we extract a test from there?

I am confused. If they don't do (c) how can they fail?

The bad formula is cheap (due to the missing register) and therefore enters the solution and produces legal yet incorrect code which fails at runtime. With the fix, the correct formula doesn't make it to the solution and doesn't affect the generated code.

Can't we extract a test from there?

Since testing for optimized or illegal code in these cases isn't an option in these cases, such a LIT could check that the incorrect code is not generated, that the incorrect formula is not generated, or that the correct formula is generated - which all seem fragile and overfitted to this specific bug, so I'm reluctant to add any of them (especially since the bug is covered by 4 runtime tests). What do you think?

Since testing for optimized or illegal code in these cases isn't an option in these cases, such a LIT could check that the incorrect code is not generated, that the incorrect formula is not generated, or that the correct formula is generated - which all seem fragile and overfitted to this specific bug, so I'm reluctant to add any of them (especially since the bug is covered by 4 runtime tests). What do you think?

As long as what we check if that produces the correct sequence for the problematic case, that doesn't seem more fragile than any of the existing test cases.
I would rather have this small test than looking in runtime issues in general.

Added a test case to cover the bug introduced by the first patch and fixed in the second.

qcolombet added inline comments.Nov 5 2018, 8:55 AM

test/Transforms/LoopStrengthReduce/two-combinations-bug.ll
2 ↗	(On Diff #172576)	Could you reduce the test even more? E.g., I usually had some success by adding an assert for the case that interests me and bugpoint on this specific assert (via custom script) Could we check that the transformation looks good instead of checking the debug output? Put differently can we write a test that works for release as well?

Test case for the two-combinations bug greatly reduced by bugpoint (thanks, Quentin!) and checks the generated IR instead of LSR's debug prints.

Hi Gil,

Looks good to me.

Nitpicks inlined, no need for another round of review.

Cheers,
-Quentin

test/Transforms/LoopStrengthReduce/two-combinations-bug.ll
3 ↗	(On Diff #172961)	adopted => adapted?
43 ↗	(On Diff #172961)	Could you get rid of the implicit variable name? (%[0-9]+) Those are painful when manually editing tests. `opt -instnamer` does that for you, but given you have just one, you may want to do it by hand.
43 ↗	(On Diff #172961)	Can we get rid of the metadata? There is an `opt -stripSomethingSomething` if you don't want to do it manually.

This revision is now accepted and ready to land.Nov 7 2018, 10:42 AM

Closed by commit rL346390: [LSR] Combine unfolded offset into invariant register (authored by gilr). · Explain WhyNov 8 2018, 1:04 AM

This revision was automatically updated to reflect the committed changes.

Diff 173126

llvm/trunk/lib/Transforms/Scalar/LoopStrengthReduce.cpp

Show First 20 Lines • Show All 3,632 Lines • ▼ Show 20 Lines	GenerateReassociationsImpl(LU, LUIdx, Base, Depth,
/* Idx / -1, / IsScaledReg */ true);		/* Idx / -1, / IsScaledReg */ true);
}		}

/// Generate a formula consisting of all of the loop-dominating registers added		/// Generate a formula consisting of all of the loop-dominating registers added
/// into a single register.		/// into a single register.
void LSRInstance::GenerateCombinations(LSRUse &LU, unsigned LUIdx,		void LSRInstance::GenerateCombinations(LSRUse &LU, unsigned LUIdx,
Formula Base) {		Formula Base) {
// This method is only interesting on a plurality of registers.		// This method is only interesting on a plurality of registers.
if (Base.BaseRegs.size() + (Base.Scale == 1) <= 1)		if (Base.BaseRegs.size() + (Base.Scale == 1) +
		(Base.UnfoldedOffset != 0) <= 1)
return;		return;

// Flatten the representation, i.e., reg1 + 1*reg2 => reg1 + reg2, before		// Flatten the representation, i.e., reg1 + 1*reg2 => reg1 + reg2, before
// processing the formula.		// processing the formula.
Base.unscale();		Base.unscale();
Formula F = Base;
F.BaseRegs.clear();
SmallVector<const SCEV *, 4> Ops;		SmallVector<const SCEV *, 4> Ops;
		Formula NewBase = Base;
		NewBase.BaseRegs.clear();
		Type *CombinedIntegerType = nullptr;
for (const SCEV *BaseReg : Base.BaseRegs) {		for (const SCEV *BaseReg : Base.BaseRegs) {
if (SE.properlyDominates(BaseReg, L->getHeader()) &&		if (SE.properlyDominates(BaseReg, L->getHeader()) &&
!SE.hasComputableLoopEvolution(BaseReg, L))		!SE.hasComputableLoopEvolution(BaseReg, L)) {
		if (!CombinedIntegerType)
		CombinedIntegerType = SE.getEffectiveSCEVType(BaseReg->getType());
Ops.push_back(BaseReg);		Ops.push_back(BaseReg);
		}
else		else
F.BaseRegs.push_back(BaseReg);		NewBase.BaseRegs.push_back(BaseReg);
}		}
if (Ops.size() > 1) {
const SCEV *Sum = SE.getAddExpr(Ops);		// If no register is relevant, we're done.
		if (Ops.size() == 0)
		return;

		// Utility function for generating the required variants of the combined
		// registers.
		auto GenerateFormula = [&](const SCEV *Sum) {
		Formula F = NewBase;

// TODO: If Sum is zero, it probably means ScalarEvolution missed an		// TODO: If Sum is zero, it probably means ScalarEvolution missed an
// opportunity to fold something. For now, just ignore such cases		// opportunity to fold something. For now, just ignore such cases
// rather than proceed with zero in a register.		// rather than proceed with zero in a register.
if (!Sum->isZero()) {		if (Sum->isZero())
		return;

F.BaseRegs.push_back(Sum);		F.BaseRegs.push_back(Sum);
F.canonicalize(*L);		F.canonicalize(*L);
(void)InsertFormula(LU, LUIdx, F);		(void)InsertFormula(LU, LUIdx, F);
		};

		// If we collected at least two registers, generate a formula combining them.
		if (Ops.size() > 1) {
		SmallVector<const SCEV *, 4> OpsCopy(Ops); // Don't let SE modify Ops.
		GenerateFormula(SE.getAddExpr(OpsCopy));
}		}

		// If we have an unfolded offset, generate a formula combining it with the
		// registers collected.
		if (NewBase.UnfoldedOffset) {
		assert(CombinedIntegerType && "Missing a type for the unfolded offset");
		Ops.push_back(SE.getConstant(CombinedIntegerType, NewBase.UnfoldedOffset,
		true));
		NewBase.UnfoldedOffset = 0;
		GenerateFormula(SE.getAddExpr(Ops));
}		}
}		}

/// Helper function for LSRInstance::GenerateSymbolicOffsets.		/// Helper function for LSRInstance::GenerateSymbolicOffsets.
void LSRInstance::GenerateSymbolicOffsetsImpl(LSRUse &LU, unsigned LUIdx,		void LSRInstance::GenerateSymbolicOffsetsImpl(LSRUse &LU, unsigned LUIdx,
const Formula &Base, size_t Idx,		const Formula &Base, size_t Idx,
bool IsScaledReg) {		bool IsScaledReg) {
const SCEV *G = IsScaledReg ? Base.ScaledReg : Base.BaseRegs[Idx];		const SCEV *G = IsScaledReg ? Base.ScaledReg : Base.BaseRegs[Idx];
▲ Show 20 Lines • Show All 1,924 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/LoopStrengthReduce/AArch64/small-constant.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py

	; RUN: llc < %s -mtriple=aarch64-unknown-unknown \| FileCheck %s			; RUN: llc < %s -mtriple=aarch64-unknown-unknown \| FileCheck %s

	; LSR doesn't consider bumping a pointer by constants outside the loop when the			; Test LSR for giving small constants, which get re-associated as unfolded
	; constants fit as immediate add operands. The constants are re-associated as an			; offset, a chance to get combined with loop-invariant registers (same as
	; unfolded offset rather than a register and are not combined later with			; large constants which do not fit as add immediate operands). LSR
	; loop-invariant registers. For large-enough constants LSR produces better			; favors here to bump the base pointer outside the loop.
	; solutions for these test cases, with test1 switching from:
	;
	; The chosen solution requires 2 instructions 2 regs, with addrec cost 1, plus 1 scale cost, plus 4 imm cost, plus 1 setup cost:
	; LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i64
	; -7 + reg({(7 + %start)<nsw>,+,1}<nsw><%for.body>)
	; LSR Use: Kind=Address of float in addrspace(0), Offsets={0}, widest fixup type: float*
	; reg(%arr) + 4*reg({(7 + %start)<nsw>,+,1}<nsw><%for.body>)
	;
	; to:
	;
	; The chosen solution requires 1 instruction 2 regs, with addrec cost 1, plus 1 scale cost, plus 1 setup cost:
	; LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i64
	; reg({%start,+,1}<nsw><%for.body>)
	; LSR Use: Kind=Address of float in addrspace(0), Offsets={0}, widest fixup type: float*
	; reg((88888 + %arr)) + 4*reg({%start,+,1}<nsw><%for.body>)
	;
	; and test2 switching from:
	;
	; The chosen solution requires 2 instructions 2 regs, with addrec cost 1, plus 1 base add, plus 1 scale cost:
	; LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i64
	; reg({%start,+,1}<nsw><%for.body>)
	; LSR Use: Kind=Basic, Offsets={0}, widest fixup type: i64
	; reg({%start,+,1}<nsw><%for.body>)
	; LSR Use: Kind=Address of float in addrspace(0), Offsets={0}, widest fixup type: float*
	; reg(%arr) + 4*reg({%start,+,1}<nsw><%for.body>) + imm(28)
	;
	; to:
	;
	; The chosen solution requires 1 instruction 2 regs, with addrec cost 1, plus 1 scale cost, plus 1 setup cost:
	; LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i64
	; reg({%start,+,1}<nsw><%for.body>)
	; LSR Use: Kind=Basic, Offsets={0}, widest fixup type: i64
	; reg({%start,+,1}<nsw><%for.body>)
	; LSR Use: Kind=Address of float in addrspace(0), Offsets={0}, widest fixup type: float*
	; reg((88888 + %arr)) + 4*reg({%start,+,1}<nsw><%for.body>)

	; float test(float *arr, long long start, float threshold) {			; float test(float *arr, long long start, float threshold) {
	; for (long long i = start; i != 0; ++i) {			; for (long long i = start; i != 0; ++i) {
	; float x = arr[i + 7];			; float x = arr[i + 7];
	; if (x > threshold)			; if (x > threshold)
	; return x;			; return x;
	; }			; }
	; return -7;			; return -7;
	; }			; }
	define float @test1(float* nocapture readonly %arr, i64 %start, float %threshold) {			define float @test1(float* nocapture readonly %arr, i64 %start, float %threshold) {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: // %bb.0: // %entry			; CHECK: // %bb.0: // %entry
	; CHECK-NEXT: fmov s2, #-7.00000000			; CHECK-NEXT: fmov s2, #-7.00000000
	; CHECK-NEXT: cbz x1, .LBB0_5			; CHECK-NEXT: cbz x1, .LBB0_5
	; CHECK-NEXT: // %bb.1: // %for.body.preheader			; CHECK-NEXT: // %bb.1: // %for.body.preheader
	; CHECK-NEXT: add x8, x1, #7 // =7			; CHECK-NEXT: add x8, x0, #28 // =28
	; CHECK-NEXT: .LBB0_2: // %for.body			; CHECK-NEXT: .LBB0_2: // %for.body
	; CHECK-NEXT: // =>This Inner Loop Header: Depth=1			; CHECK-NEXT: // =>This Inner Loop Header: Depth=1
	; CHECK-NEXT: ldr s1, [x0, x8, lsl #2]			; CHECK-NEXT: ldr s1, [x8, x1, lsl #2]
	; CHECK-NEXT: fcmp s1, s0			; CHECK-NEXT: fcmp s1, s0
	; CHECK-NEXT: b.gt .LBB0_6			; CHECK-NEXT: b.gt .LBB0_6
	; CHECK-NEXT: // %bb.3: // %for.cond			; CHECK-NEXT: // %bb.3: // %for.cond
	; CHECK-NEXT: // in Loop: Header=BB0_2 Depth=1			; CHECK-NEXT: // in Loop: Header=BB0_2 Depth=1
	; CHECK-NEXT: add x8, x8, #1 // =1			; CHECK-NEXT: add x1, x1, #1 // =1
	; CHECK-NEXT: cmp x8, #7 // =7			; CHECK-NEXT: cbnz x1, .LBB0_2
	; CHECK-NEXT: b.ne .LBB0_2
	; CHECK-NEXT: // %bb.4:			; CHECK-NEXT: // %bb.4:
	; CHECK-NEXT: mov v0.16b, v2.16b			; CHECK-NEXT: mov v0.16b, v2.16b
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	; CHECK-NEXT: .LBB0_5:			; CHECK-NEXT: .LBB0_5:
	; CHECK-NEXT: mov v0.16b, v2.16b			; CHECK-NEXT: mov v0.16b, v2.16b
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	; CHECK-NEXT: .LBB0_6: // %cleanup2			; CHECK-NEXT: .LBB0_6: // %cleanup2
	; CHECK-NEXT: mov v0.16b, v1.16b			; CHECK-NEXT: mov v0.16b, v1.16b
	Show All 21 Lines
	}			}

	; Same as test1, except i has another use:			; Same as test1, except i has another use:
	; if (x > threshold) ---> if (x > threshold + i)			; if (x > threshold) ---> if (x > threshold + i)
	define float @test2(float* nocapture readonly %arr, i64 %start, float %threshold) {			define float @test2(float* nocapture readonly %arr, i64 %start, float %threshold) {
	; CHECK-LABEL: test2:			; CHECK-LABEL: test2:
	; CHECK: // %bb.0: // %entry			; CHECK: // %bb.0: // %entry
	; CHECK-NEXT: fmov s2, #-7.00000000			; CHECK-NEXT: fmov s2, #-7.00000000
	; CHECK-NEXT: cbz x1, .LBB1_4			; CHECK-NEXT: cbz x1, .LBB1_5
	; CHECK-NEXT: .LBB1_1: // %for.body			; CHECK-NEXT: // %bb.1: // %for.body.preheader
				; CHECK-NEXT: add x8, x0, #28 // =28
				; CHECK-NEXT: .LBB1_2: // %for.body
	; CHECK-NEXT: // =>This Inner Loop Header: Depth=1			; CHECK-NEXT: // =>This Inner Loop Header: Depth=1
	; CHECK-NEXT: add x8, x0, x1, lsl #2			; CHECK-NEXT: ldr s1, [x8, x1, lsl #2]
	; CHECK-NEXT: ldr s1, [x8, #28]
	; CHECK-NEXT: scvtf s3, x1			; CHECK-NEXT: scvtf s3, x1
	; CHECK-NEXT: fadd s3, s3, s0			; CHECK-NEXT: fadd s3, s3, s0
	; CHECK-NEXT: fcmp s1, s3			; CHECK-NEXT: fcmp s1, s3
	; CHECK-NEXT: b.gt .LBB1_5			; CHECK-NEXT: b.gt .LBB1_6
	; CHECK-NEXT: // %bb.2: // %for.cond			; CHECK-NEXT: // %bb.3: // %for.cond
	; CHECK-NEXT: // in Loop: Header=BB1_1 Depth=1			; CHECK-NEXT: // in Loop: Header=BB1_2 Depth=1
	; CHECK-NEXT: add x1, x1, #1 // =1			; CHECK-NEXT: add x1, x1, #1 // =1
	; CHECK-NEXT: cbnz x1, .LBB1_1			; CHECK-NEXT: cbnz x1, .LBB1_2
	; CHECK-NEXT: // %bb.3:			; CHECK-NEXT: // %bb.4:
	; CHECK-NEXT: mov v0.16b, v2.16b			; CHECK-NEXT: mov v0.16b, v2.16b
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	; CHECK-NEXT: .LBB1_4:			; CHECK-NEXT: .LBB1_5:
	; CHECK-NEXT: mov v0.16b, v2.16b			; CHECK-NEXT: mov v0.16b, v2.16b
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	; CHECK-NEXT: .LBB1_5: // %cleanup4			; CHECK-NEXT: .LBB1_6: // %cleanup4
	; CHECK-NEXT: mov v0.16b, v1.16b			; CHECK-NEXT: mov v0.16b, v1.16b
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	entry:			entry:
	%cmp14 = icmp eq i64 %start, 0			%cmp14 = icmp eq i64 %start, 0
	br i1 %cmp14, label %cleanup4, label %for.body			br i1 %cmp14, label %cleanup4, label %for.body

	for.cond: ; preds = %for.body			for.cond: ; preds = %for.body
	%cmp = icmp eq i64 %inc, 0			%cmp = icmp eq i64 %inc, 0
	Show All 17 Lines

llvm/trunk/test/Transforms/LoopStrengthReduce/two-combinations-bug.ll

				; RUN: opt < %s -loop-reduce -S \| FileCheck %s

				; This test is adapted from the n-body test of the LLVM test-suite: A bug in
				; r345114 caused LSR to generate incorrect code. The test verifies that the
				; induction variable generated for the inner loop depends on the induction
				; variable of the outer loop.

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.planet.0.3.6.11.12.15.16.17.24.25.26.33.44 = type { double, double, double, double, double, double, double }

				; Function Attrs: nounwind uwtable
				define dso_local void @advance(i32 %nbodies, %struct.planet.0.3.6.11.12.15.16.17.24.25.26.33.44* nocapture %bodies) local_unnamed_addr #0 {
				; CHECK-LABEL: @advance(
				; CHECK: for.cond.loopexit:
				; CHECK: [[LSR_IV_NEXT:%.]] = add i64 [[LSR_IV:%.]], -1
				; CHECK: br label %for.body
				; CHECK: for.body:
				; CHECK: [[LSR_IV]] = phi i64 [ [[LSR_IV_NEXT]]
				; CHECK: br label %for.body3
				; CHECK: for.body3:
				; CHECK: [[LSR_IV1:%.]] = phi i64 [ [[LSR_IV_NEXT2:%.]], %for.body3 ], [ [[LSR_IV]], %for.body ]
				; CHECK: [[LSR_IV_NEXT2]] = add i64 [[LSR_IV1]], -1
				; CHECK: [[EXITCOND:%.*]] = icmp eq i64 [[LSR_IV_NEXT2]], 0
				; CHECK: br i1 [[EXITCOND]], label %for.cond.loopexit, label %for.body3
				;
				entry:
				%wide.trip.count = zext i32 %nbodies to i64
				br label %for.body

				for.cond.loopexit: ; preds = %for.body3
				%indvars.iv.next = add nuw nsw i64 %indvars.iv, 1
				br label %for.body

				for.body: ; preds = %for.cond.loopexit, %entry
				%indvars.iv = phi i64 [ 1, %entry ], [ %indvars.iv.next, %for.cond.loopexit ]
				br label %for.body3

				for.body3: ; preds = %for.body3, %for.body
				%indvars.iv98 = phi i64 [ %indvars.iv, %for.body ], [ %indvars.iv.next99, %for.body3 ]
				%z9 = getelementptr inbounds %struct.planet.0.3.6.11.12.15.16.17.24.25.26.33.44, %struct.planet.0.3.6.11.12.15.16.17.24.25.26.33.44* %bodies, i64 %indvars.iv98, i32 2
				%tmp = load double, double* %z9, align 8, !tbaa !0
				%indvars.iv.next99 = add nuw nsw i64 %indvars.iv98, 1
				%exitcond = icmp eq i64 %indvars.iv.next99, %wide.trip.count
				br i1 %exitcond, label %for.cond.loopexit, label %for.body3
				}

				attributes #0 = { nounwind uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }

				!0 = !{!1, !2, i64 16}
				!1 = !{!"planet", !2, i64 0, !2, i64 8, !2, i64 16, !2, i64 24, !2, i64 32, !2, i64 40, !2, i64 48}
				!2 = !{!"double", !3, i64 0}
				!3 = !{!"omnipotent char", !4, i64 0}
				!4 = !{!"Simple C/C++ TBAA"}

This is an archive of the discontinued LLVM Phabricator instance.

[LSR] Combine unfolded offset into invariant register
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 173126

llvm/trunk/lib/Transforms/Scalar/LoopStrengthReduce.cpp

llvm/trunk/test/Transforms/LoopStrengthReduce/AArch64/small-constant.ll

llvm/trunk/test/Transforms/LoopStrengthReduce/two-combinations-bug.ll

This is an archive of the discontinued LLVM Phabricator instance.

[LSR] Combine unfolded offset into invariant registerClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 173126

llvm/trunk/lib/Transforms/Scalar/LoopStrengthReduce.cpp

llvm/trunk/test/Transforms/LoopStrengthReduce/AArch64/small-constant.ll

llvm/trunk/test/Transforms/LoopStrengthReduce/two-combinations-bug.ll

[LSR] Combine unfolded offset into invariant register
ClosedPublic