This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/CodeGen/
-
CodeGen/
-
CGLoopInfo.h
-
CGLoopInfo.cpp
-
CGStmt.cpp
-
test/CodeGenCXX/
-
CodeGenCXX/
-
fno-unroll-loops-metadata.cpp
-
pragma-unroll.cpp

Differential D77058

[Clang] Add llvm.loop.unroll.disable to loops with -fno-unroll-loops.
ClosedPublic

Authored by fhahn on Mar 30 2020, 7:11 AM.

Download Raw Diff

Details

Reviewers

Meinersbur
hfinkel
dexonsmith
tejohnson

Commits

rG338be9c59527: [Clang] Add llvm.loop.unroll.disable to loops with -fno-unroll-loops.

Summary

Currently Clang does not respect -fno-unroll-loops during LTO. During
D76916 it was suggested to respect -fno-unroll-loops on a TU basis.

This patch uses the existing llvm.loop.unroll.disable metadata to
disable loop unrolling explicitly for each loop in the TU if
unrolling is disabled. This should ensure that loops from TUs compiled
with -fno-unroll-loops are skipped by the unroller during LTO.

This also means that if a loop from a TU with -fno-unroll-loops
gets inlined into a TU without this option, the loop won't be
unrolled.

Due to the fact that some transforms might drop loop metadata, there
potentially are cases in which we still unroll loops from TUs with
-fno-unroll-loops. I think we should fix those issues rather than
introducing a function attribute to disable loop unrolling during LTO.
Improving the metadata handling will benefit other use cases, like
various loop pragmas, too. And it is an improvement to clang completely
ignoring -fno-unroll-loops during LTO.

If that direction looks good, we can use a similar approach to also
respect -fno-vectorize during LTO, at least for LoopVectorize.

In the future, this might also allow us to remove the UnrollLoops option
LLVM's PassManagerBuilder.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

fhahn created this revision.Mar 30 2020, 7:11 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 30 2020, 7:11 AM

Herald added a subscriber: zzheng. · View Herald Transcript

fhahn mentioned this in D76916: [Darwin] Respect -fno-unroll-loops during LTO..Mar 30 2020, 7:43 AM

I think this is a good approach, rather than a per-function attribute, since as mentioned this will be preserved through inlining.
@dexonsmith, does that seem reasonable to you? I missed the original patch and agree with you that we don't want to fix this in LTO by passing the option through to LTO.

Harbormaster failed remote builds in B50956: Diff 253589!Mar 30 2020, 8:38 AM

In D77058#1950019, @tejohnson wrote:

I think this is a good approach, rather than a per-function attribute, since as mentioned this will be preserved through inlining.
@dexonsmith, does that seem reasonable to you? I missed the original patch and agree with you that we don't want to fix this in LTO by passing the option through to LTO.

SGTM!

ping.

@tejohnson are you happy with this approach, given that it sounds good to @dexonsmith as well?

yep, LGTM

This revision is now accepted and ready to land.Apr 6 2020, 8:55 AM

Note that loop-metadata is best-effort only and may be forgotten in the optimization pipeline.

Do we also need an equivalent to -Xclang -disable-O0-optnone?

Personally, I don't like to the optnone approach: There have been many post on llvm-dev using clang -emit-llvm and being surprised that opt has no effect.

In D77058#1964427, @Meinersbur wrote:

Note that loop-metadata is best-effort only and may be forgotten in the optimization pipeline.

Agreed, that can be a potential issue (I tried to note that in the description), but I think that's pretty much the same issue we have with the loop related pragmas. Ideally there would be even more incentive now to fix the offending transforms.

Do we also need an equivalent to -Xclang -disable-O0-optnone?

Personally, I don't like to the optnone approach: There have been many post on llvm-dev using clang -emit-llvm and being surprised that opt has no effect.

Ah yes, optnone is a common pitfall :(

IIUC we don't need a patch similar like this one for optnone, as it already gets added to the function attributes (for -O0) and has an option to disable adding it (-Xclang -disable-O0-optnone) on a per-TU basis.

LGTM, since it continues current practice. optnone will always be the more annoying.

In D77058#1964714, @fhahn wrote:

IIUC we don't need a patch similar like this one for optnone, as it already gets added to the function attributes (for -O0) and has an option to disable adding it (-Xclang -disable-O0-optnone) on a per-TU basis.

My question was the other way around: Do we need something like -xclang -disable-fno-unroll-loops-metadata.

I documented for Polly how to get the IR for further processing. There is clang -Xclang -disable-O0-optnone and another is clang -O1 -Xclang -disable-llvm-passes. Both avoid the optnone attribute, but will yield different results with -fno-unroll-loops. Which is the 'correct' way?

Meinersbur accepted this revision.Apr 7 2020, 4:37 AM

Closed by commit rG338be9c59527: [Clang] Add llvm.loop.unroll.disable to loops with -fno-unroll-loops. (authored by fhahn). · Explain WhyApr 7 2020, 6:28 AM

This revision was automatically updated to reflect the committed changes.

tejohnson mentioned this in D77989: Allow disabling of vectorization using internal options.Apr 12 2020, 7:41 PM

tejohnson mentioned this in rG33ffb62e23e7: Allow disabling of vectorization using internal options.Apr 14 2020, 6:31 PM

Revision Contents

Path

Size

clang/

lib/

CodeGen/

CGLoopInfo.h

2 lines

CGLoopInfo.cpp

10 lines

CGStmt.cpp

10 lines

test/

CodeGenCXX/

fno-unroll-loops-metadata.cpp

48 lines

pragma-unroll.cpp

3 lines

Diff 255662

clang/lib/CodeGen/CGLoopInfo.h

Show All 23 Lines
class BasicBlock;		class BasicBlock;
class Instruction;		class Instruction;
class MDNode;		class MDNode;
} // end namespace llvm		} // end namespace llvm

namespace clang {		namespace clang {
class Attr;		class Attr;
class ASTContext;		class ASTContext;
		class CodeGenOptions;
namespace CodeGen {		namespace CodeGen {

/// Attributes that may be specified on loops.		/// Attributes that may be specified on loops.
struct LoopAttributes {		struct LoopAttributes {
explicit LoopAttributes(bool IsParallel = false);		explicit LoopAttributes(bool IsParallel = false);
void clear();		void clear();

/// Generate llvm.loop.parallel metadata for loads and stores.		/// Generate llvm.loop.parallel metadata for loads and stores.
▲ Show 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	public:
/// Begin a new structured loop. The set of staged attributes will be		/// Begin a new structured loop. The set of staged attributes will be
/// applied to the loop and then cleared.		/// applied to the loop and then cleared.
void push(llvm::BasicBlock *Header, const llvm::DebugLoc &StartLoc,		void push(llvm::BasicBlock *Header, const llvm::DebugLoc &StartLoc,
const llvm::DebugLoc &EndLoc);		const llvm::DebugLoc &EndLoc);

/// Begin a new structured loop. Stage attributes from the Attrs list.		/// Begin a new structured loop. Stage attributes from the Attrs list.
/// The staged attributes are applied to the loop and then cleared.		/// The staged attributes are applied to the loop and then cleared.
void push(llvm::BasicBlock *Header, clang::ASTContext &Ctx,		void push(llvm::BasicBlock *Header, clang::ASTContext &Ctx,
		const clang::CodeGenOptions &CGOpts,
llvm::ArrayRef<const Attr *> Attrs, const llvm::DebugLoc &StartLoc,		llvm::ArrayRef<const Attr *> Attrs, const llvm::DebugLoc &StartLoc,
const llvm::DebugLoc &EndLoc);		const llvm::DebugLoc &EndLoc);

/// End the current loop.		/// End the current loop.
void pop();		void pop();

/// Return the top loop id metadata.		/// Return the top loop id metadata.
llvm::MDNode *getCurLoopID() const { return getInfo().getLoopID(); }		llvm::MDNode *getCurLoopID() const { return getInfo().getLoopID(); }
▲ Show 20 Lines • Show All 76 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGLoopInfo.cpp

//===---- CGLoopInfo.cpp - LLVM CodeGen for loop metadata -- C++ --------===//		//===---- CGLoopInfo.cpp - LLVM CodeGen for loop metadata -- C++ --------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "CGLoopInfo.h"		#include "CGLoopInfo.h"
#include "clang/AST/ASTContext.h"		#include "clang/AST/ASTContext.h"
#include "clang/AST/Attr.h"		#include "clang/AST/Attr.h"
#include "clang/AST/Expr.h"		#include "clang/AST/Expr.h"
		#include "clang/Basic/CodeGenOptions.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/CFG.h"		#include "llvm/IR/CFG.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/InstrTypes.h"		#include "llvm/IR/InstrTypes.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/Metadata.h"		#include "llvm/IR/Metadata.h"
using namespace clang::CodeGen;		using namespace clang::CodeGen;
using namespace llvm;		using namespace llvm;
▲ Show 20 Lines • Show All 547 Lines • ▼ Show 20 Lines	void LoopInfoStack::push(BasicBlock *Header, const llvm::DebugLoc &StartLoc,
Active.emplace_back(		Active.emplace_back(
new LoopInfo(Header, StagedAttrs, StartLoc, EndLoc,		new LoopInfo(Header, StagedAttrs, StartLoc, EndLoc,
Active.empty() ? nullptr : Active.back().get()));		Active.empty() ? nullptr : Active.back().get()));
// Clear the attributes so nested loops do not inherit them.		// Clear the attributes so nested loops do not inherit them.
StagedAttrs.clear();		StagedAttrs.clear();
}		}

void LoopInfoStack::push(BasicBlock *Header, clang::ASTContext &Ctx,		void LoopInfoStack::push(BasicBlock *Header, clang::ASTContext &Ctx,
		const clang::CodeGenOptions &CGOpts,
ArrayRef<const clang::Attr *> Attrs,		ArrayRef<const clang::Attr *> Attrs,
const llvm::DebugLoc &StartLoc,		const llvm::DebugLoc &StartLoc,
const llvm::DebugLoc &EndLoc) {		const llvm::DebugLoc &EndLoc) {

// Identify loop hint attributes from Attrs.		// Identify loop hint attributes from Attrs.
for (const auto *Attr : Attrs) {		for (const auto *Attr : Attrs) {
const LoopHintAttr *LH = dyn_cast<LoopHintAttr>(Attr);		const LoopHintAttr *LH = dyn_cast<LoopHintAttr>(Attr);
const OpenCLUnrollHintAttr *OpenCLHint =		const OpenCLUnrollHintAttr *OpenCLHint =
▲ Show 20 Lines • Show All 164 Lines • ▼ Show 20 Lines	case LoopHintAttr::Numeric:
case LoopHintAttr::PipelineDisabled:		case LoopHintAttr::PipelineDisabled:
llvm_unreachable("Options cannot be assigned a value.");		llvm_unreachable("Options cannot be assigned a value.");
break;		break;
}		}
break;		break;
}		}
}		}

		if (CGOpts.OptimizationLevel > 0)
		// Disable unrolling for the loop, if unrolling is disabled (via
		// -fno-unroll-loops) and no pragmas override the decision.
		if (!CGOpts.UnrollLoops &&
		(StagedAttrs.UnrollEnable == LoopAttributes::Unspecified &&
		StagedAttrs.UnrollCount == 0))
		setUnrollState(LoopAttributes::Disable);

/// Stage the attributes.		/// Stage the attributes.
push(Header, StartLoc, EndLoc);		push(Header, StartLoc, EndLoc);
}		}

void LoopInfoStack::pop() {		void LoopInfoStack::pop() {
assert(!Active.empty() && "No active loops to pop");		assert(!Active.empty() && "No active loops to pop");
Active.back()->finish();		Active.back()->finish();
Active.pop_back();		Active.pop_back();
Show All 34 Lines

clang/lib/CodeGen/CGStmt.cpp

Show First 20 Lines • Show All 722 Lines • ▼ Show 20 Lines
void CodeGenFunction::EmitWhileStmt(const WhileStmt &S,		void CodeGenFunction::EmitWhileStmt(const WhileStmt &S,
ArrayRef<const Attr *> WhileAttrs) {		ArrayRef<const Attr *> WhileAttrs) {
// Emit the header for the loop, which will also become		// Emit the header for the loop, which will also become
// the continue target.		// the continue target.
JumpDest LoopHeader = getJumpDestInCurrentScope("while.cond");		JumpDest LoopHeader = getJumpDestInCurrentScope("while.cond");
EmitBlock(LoopHeader.getBlock());		EmitBlock(LoopHeader.getBlock());

const SourceRange &R = S.getSourceRange();		const SourceRange &R = S.getSourceRange();
LoopStack.push(LoopHeader.getBlock(), CGM.getContext(), WhileAttrs,		LoopStack.push(LoopHeader.getBlock(), CGM.getContext(), CGM.getCodeGenOpts(),
SourceLocToDebugLoc(R.getBegin()),		WhileAttrs, SourceLocToDebugLoc(R.getBegin()),
SourceLocToDebugLoc(R.getEnd()));		SourceLocToDebugLoc(R.getEnd()));

// Create an exit block for when the condition fails, which will		// Create an exit block for when the condition fails, which will
// also become the break target.		// also become the break target.
JumpDest LoopExit = getJumpDestInCurrentScope("while.end");		JumpDest LoopExit = getJumpDestInCurrentScope("while.end");

// Store the blocks to use for break and continue.		// Store the blocks to use for break and continue.
BreakContinueStack.push_back(BreakContinue(LoopExit, LoopHeader));		BreakContinueStack.push_back(BreakContinue(LoopExit, LoopHeader));
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	void CodeGenFunction::EmitDoStmt(const DoStmt &S,
{		{
RunCleanupsScope BodyScope(*this);		RunCleanupsScope BodyScope(*this);
EmitStmt(S.getBody());		EmitStmt(S.getBody());
}		}

EmitBlock(LoopCond.getBlock());		EmitBlock(LoopCond.getBlock());

const SourceRange &R = S.getSourceRange();		const SourceRange &R = S.getSourceRange();
LoopStack.push(LoopBody, CGM.getContext(), DoAttrs,		LoopStack.push(LoopBody, CGM.getContext(), CGM.getCodeGenOpts(), DoAttrs,
SourceLocToDebugLoc(R.getBegin()),		SourceLocToDebugLoc(R.getBegin()),
SourceLocToDebugLoc(R.getEnd()));		SourceLocToDebugLoc(R.getEnd()));

// C99 6.8.5.2: "The evaluation of the controlling expression takes place		// C99 6.8.5.2: "The evaluation of the controlling expression takes place
// after each execution of the loop body."		// after each execution of the loop body."

// Evaluate the conditional in the while header.		// Evaluate the conditional in the while header.
// C99 6.8.5p2/p4: The first substatement is executed if the expression		// C99 6.8.5p2/p4: The first substatement is executed if the expression
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	void CodeGenFunction::EmitForStmt(const ForStmt &S,
// Start the loop with a block that tests the condition.		// Start the loop with a block that tests the condition.
// If there's an increment, the continue scope will be overwritten		// If there's an increment, the continue scope will be overwritten
// later.		// later.
JumpDest Continue = getJumpDestInCurrentScope("for.cond");		JumpDest Continue = getJumpDestInCurrentScope("for.cond");
llvm::BasicBlock *CondBlock = Continue.getBlock();		llvm::BasicBlock *CondBlock = Continue.getBlock();
EmitBlock(CondBlock);		EmitBlock(CondBlock);

const SourceRange &R = S.getSourceRange();		const SourceRange &R = S.getSourceRange();
LoopStack.push(CondBlock, CGM.getContext(), ForAttrs,		LoopStack.push(CondBlock, CGM.getContext(), CGM.getCodeGenOpts(), ForAttrs,
SourceLocToDebugLoc(R.getBegin()),		SourceLocToDebugLoc(R.getBegin()),
SourceLocToDebugLoc(R.getEnd()));		SourceLocToDebugLoc(R.getEnd()));

// If the for loop doesn't have an increment we can just use the		// If the for loop doesn't have an increment we can just use the
// condition as the continue block. Otherwise we'll need to create		// condition as the continue block. Otherwise we'll need to create
// a block for it (in the current scope, i.e. in the scope of the		// a block for it (in the current scope, i.e. in the scope of the
// condition), and that we will become our continue block.		// condition), and that we will become our continue block.
if (S.getInc())		if (S.getInc())
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	CodeGenFunction::EmitCXXForRangeStmt(const CXXForRangeStmt &S,

// Start the loop with a block that tests the condition.		// Start the loop with a block that tests the condition.
// If there's an increment, the continue scope will be overwritten		// If there's an increment, the continue scope will be overwritten
// later.		// later.
llvm::BasicBlock *CondBlock = createBasicBlock("for.cond");		llvm::BasicBlock *CondBlock = createBasicBlock("for.cond");
EmitBlock(CondBlock);		EmitBlock(CondBlock);

const SourceRange &R = S.getSourceRange();		const SourceRange &R = S.getSourceRange();
LoopStack.push(CondBlock, CGM.getContext(), ForAttrs,		LoopStack.push(CondBlock, CGM.getContext(), CGM.getCodeGenOpts(), ForAttrs,
SourceLocToDebugLoc(R.getBegin()),		SourceLocToDebugLoc(R.getBegin()),
SourceLocToDebugLoc(R.getEnd()));		SourceLocToDebugLoc(R.getEnd()));

// If there are any cleanups between here and the loop-exit scope,		// If there are any cleanups between here and the loop-exit scope,
// create a block to stage a loop exit along.		// create a block to stage a loop exit along.
llvm::BasicBlock *ExitBlock = LoopExit.getBlock();		llvm::BasicBlock *ExitBlock = LoopExit.getBlock();
if (ForScope.requiresCleanups())		if (ForScope.requiresCleanups())
ExitBlock = createBasicBlock("for.cond.cleanup");		ExitBlock = createBasicBlock("for.cond.cleanup");
▲ Show 20 Lines • Show All 1,477 Lines • Show Last 20 Lines

clang/test/CodeGenCXX/fno-unroll-loops-metadata.cpp

This file was added.

				// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s -O0 -disable-llvm-optzns -fno-unroll-loops \| FileCheck --check-prefix=NO_UNROLL_MD %s
				// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s -O1 -disable-llvm-optzns -fno-unroll-loops \| FileCheck --check-prefix=UNROLL_DISABLED_MD %s
				// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s -O2 -disable-llvm-optzns -fno-unroll-loops \| FileCheck --check-prefix=UNROLL_DISABLED_MD %s
				// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s -O3 -disable-llvm-optzns -fno-unroll-loops \| FileCheck --check-prefix=UNROLL_DISABLED_MD %s
				// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s -O3 -disable-llvm-optzns \| FileCheck --check-prefix=NO_UNROLL_MD %s

				// NO_UNROLL_MD-NOT: llvm.loop

				// Verify unroll.disable metadata is added to while loop with -fno-unroll-loops
				// and optlevel > 0.
				void while_test(int *List, int Length) {
				// UNROLL_DISABLED_MD: define {{.*}} @_Z10while_test
				int i = 0;

				while (i < Length) {
				// UNROLL_DISABLED_MD: br label {{.}}, !llvm.loop ![[LOOP_1:.]]
				List[i] = i * 2;
				i++;
				}
				}

				// Verify unroll.disable metadata is added to do-while loop with
				// -fno-unroll-loops and optlevel > 0.
				void do_test(int *List, int Length) {
				// UNROLL_DISABLED_MD: define {{.*}} @_Z7do_test
				int i = 0;

				do {
				// UNROLL_DISABLED_MD: br i1 {{.}}, label {{.}}, label {{.}}, !llvm.loop ![[LOOP_2:.]]
				List[i] = i * 2;
				i++;
				} while (i < Length);
				}

				// Verify unroll.disable metadata is added to while loop with -fno-unroll-loops
				// and optlevel > 0.
				void for_test(int *List, int Length) {
				// UNROLL_DISABLED_MD: define {{.*}} @_Z8for_test
				for (int i = 0; i < Length; i++) {
				// UNROLL_DISABLED_MD: br label {{.}}, !llvm.loop ![[LOOP_3:.]]
				List[i] = i * 2;
				}
				}

				// UNROLL_DISABLED_MD: ![[LOOP_1]] = distinct !{![[LOOP_1]], ![[UNROLL_DISABLE:.*]]}
				// UNROLL_DISABLED_MD: ![[UNROLL_DISABLE]] = !{!"llvm.loop.unroll.disable"}
				// UNROLL_DISABLED_MD: ![[LOOP_2]] = distinct !{![[LOOP_2:.]], ![[UNROLL_DISABLE:.]]}
				// UNROLL_DISABLED_MD: ![[LOOP_3]] = distinct !{![[LOOP_3]], ![[UNROLL_DISABLE:.*]]}

clang/test/CodeGenCXX/pragma-unroll.cpp

	// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s \| FileCheck %s			// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s \| FileCheck %s

				// Check that passing -fno-unroll-loops does not impact the decision made using pragmas.
				// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - -O1 -disable-llvm-optzns -fno-unroll-loops %s \| FileCheck %s

	// Verify while loop is recognized after unroll pragma.			// Verify while loop is recognized after unroll pragma.
	void while_test(int *List, int Length) {			void while_test(int *List, int Length) {
	// CHECK: define {{.*}} @_Z10while_test			// CHECK: define {{.*}} @_Z10while_test
	int i = 0;			int i = 0;

	#pragma unroll			#pragma unroll
	while (i < Length) {			while (i < Length) {
	// CHECK: br label {{.}}, !llvm.loop ![[LOOP_1:.]]			// CHECK: br label {{.}}, !llvm.loop ![[LOOP_1:.]]
	▲ Show 20 Lines • Show All 96 Lines • Show Last 20 Lines