This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Vectorize/
-
Transforms/
-
Vectorize/
2
LoopVectorize.cpp
1
VPlan.h
3/7
VPlan.cpp
-
test/Transforms/LoopVectorize/
-
Transforms/
-
LoopVectorize/
2/3
icmp-uniforms.ll
1
vplan-printing.ll
-
unittests/Transforms/Vectorize/
-
Transforms/
-
Vectorize/
-
VPlanHCFGTest.cpp
-
VPlanTest.cpp

Differential D96628

[VPlan] Add plain text (not DOT's digraph) dumps
ClosedPublic

Authored by a.elovikov on Feb 12 2021, 12:42 PM.

Download Raw Diff

Details

Reviewers

fhahn
gilr
Ayal

Commits

rG93a9d2de8f4f: [VPlan] Add plain text (not DOT's digraph) dumps
rG6b053c9867a3: [VPlan] Add plain text (not DOT's digraph) dumps

Summary

I foresee two uses for this:

It's easier to use those in debugger.
Once we start implementing more VPlan-to-VPlan transformations (especially inner loop massaging stuff), using the vectorized LLVM IR as CHECK targets in LIT test would become too obscure. I can imagine that we'd want to CHECK against VPlan dumps after multiple transformations instead. That would be easier with plain text dumps than with DOT format.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	290 ms	x64 debian > MemProfiler-x86_64-linux-dynamic.TestCases::test_malloc_load_store.c
	450 ms	x64 debian > MemProfiler-x86_64-linux.TestCases::test_malloc_load_store.c

Event Timeline

a.elovikov created this revision.Feb 12 2021, 12:42 PM

Herald added subscribers: tschuett, psnobl, rogfer01 and 2 others. · View Herald TranscriptFeb 12 2021, 12:42 PM

a.elovikov requested review of this revision.Feb 12 2021, 12:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 12 2021, 12:42 PM

Herald added subscribers: llvm-commits, vkmr. · View Herald Transcript

Harbormaster completed remote builds in B89053: Diff 323452.Feb 12 2021, 3:51 PM

Ping

Thanks for working on this! Could you split off the changes not directly related to printing in non-DOT mode (like respecting Indent in the individual print() implementations)? Then we can focus the review here on the details how to move to support non-DOT printing, which is very valuable IMO and should probably become the default in the long run.

llvm/lib/Transforms/Vectorize/VPlan.cpp
907	Could you split off the changes not directly related to printing in non-DOT mode (like respecting `Indent` in the individual print() implementations)?

Rebased on top of D97787.

Harbormaster completed remote builds in B91621: Diff 327519.Mar 2 2021, 10:56 AM

a.elovikov added a parent revision: D97787: [NFCI][VPlan] Modify Recipes' print methods to honor Indent parameter.Mar 2 2021, 10:56 AM

Ping.

fhahn added inline comments.Mar 9 2021, 1:39 PM

llvm/lib/Transforms/Vectorize/VPlan.cpp
53	I think having an option to just print the plans make sense, but as a first step, should we start with an option to just toggle between dot printing and regular printing for when using `-debug`?
849	Could you add a comment explaining that we first print the block and then split up/reconstruct the output with the .dot syntax>

fhahn added inline comments.Mar 9 2021, 1:40 PM

llvm/test/Transforms/LoopVectorize/icmp-uniforms.ll
40	Did the value here change because the plan gets printed later?

a.elovikov added inline comments.Mar 9 2021, 2:02 PM

llvm/lib/Transforms/Vectorize/VPlan.cpp
53	I preferred dedicated option for several reasons: Toggling is harder to implement, especially when caring about consistent interfaces. For example, I think that `VPBasicBlock`\|`VPValue`\|`VPDef`::`print` should be dedicated for text prints only (and `VPBasicBlock` can't even be printed as a digraph before this patch), so the only logical place for triggering would be `operator<<(ostream&)`. On the other hand, there is no guarantees that none of the LLVM_DEBUG prints call the print method directly (and not via stream operator). I personally don't like using -debug/-debug-only for LIT testing purposes. Having a more limited/specific output is, IMO, preferable. I'm hoping that option(s) to print VPlan at a given place in the pipeline would be convenient tools for writing LIT tests. As such, I think having this option from the very first plain dump patch makes sense. If going through the toggling approach, do you expect existing LIT tests (non-unittest) to continue using digraph output or be switched to plain dumps in the first patch? If the former, what would be your suggestion to test the plain dumps?
849	Sure, will update in the next patch set.
llvm/test/Transforms/LoopVectorize/icmp-uniforms.ll
40	I didn't study in details, but I'd expect it to be so. Is that something expected for you, or do you want me to study that change in details (my knowledge of the VPlan pipeline is still limited)?

Improve comments.
Don't add a new dump point via cl::opt. Instead, add cl::opt to toggle behavior (dot/plain dumps) for the LVP::printPlans method.

Herald added a subscriber: bmahjour. · View Herald TranscriptMar 11 2021, 12:36 PM

a.elovikov added inline comments.Mar 11 2021, 12:36 PM

llvm/test/Transforms/LoopVectorize/icmp-uniforms.ll
40	Yes, it gets changed in the middle of InnerLoopVectorizer::createVectorizedLoopSkeleton/InnerLoopVectorizer::createInductionResumeValues. We seem to be using original LLVM IR value through `VPlanIngredient` when printing `VPWidenIntOrFpInductionRecipe`, so modifying original loop when creating the skeleton changes printing.

Harbormaster completed remote builds in B93358: Diff 330054.Mar 11 2021, 6:32 PM

Thanks for the latest updates!

I just have a few remaining comments. After those, I think this is good to go. If there are any concerns with the direction by others, it would be great to hear them soon.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
7810	I think it would make sense to flip things here, as in have the operator for `raw_ostream` use the non-dot style (because that's what most useful in debuggers too) and then have a `Plan::printWithDot()` or soemthing for the DOT logic.
llvm/lib/Transforms/Vectorize/VPlan.cpp
393	I think it would be good to separate the block names by a comma or something like that. You could either use `join_items` or `ListSeparator` from `StringExtras.h`
llvm/lib/Transforms/Vectorize/VPlan.h
1856	nit: comment, something like `/// Print the plan to \p O.`. Some for other instances.
llvm/test/Transforms/LoopVectorize/vplan-printing.ll
3	I think we should have at least a LIT test that also checks `-vplan-print-in-dot-format=true`. Perhaps this file would a good candidate to do so?

Address review comments.

llvm/lib/Transforms/Vectorize/VPlan.cpp
393	`ListSeparator` is great! Wasn't aware of it before.

Harbormaster completed remote builds in B93574: Diff 330333.Mar 12 2021, 2:10 PM

LGTM, thanks! Please wait a day or 2 with committing, in case there are any remaining concerns with the direction. But I think it is a nice improvement to the user experience.

llvm/test/Transforms/LoopVectorize/vplan-dot-printing.ll
3 ↗	(On Diff #330333)	`-prefer-inloop-reductions` should not be needed
26 ↗	(On Diff #330333)	Nit: it would be good to at least one additional BB in the loop, to make sure the edges between blocks are still printed.

This revision is now accepted and ready to land.Mar 14 2021, 2:52 PM

a.elovikov added inline comments.Mar 14 2021, 7:29 PM

llvm/test/Transforms/LoopVectorize/vplan-dot-printing.ll
26 ↗	(On Diff #330333)	This functionality is being tested in unittests already, so I'm only using this file to target the toggle switch, not the digraph dump by itself.

This revision was landed with ongoing or failed builds.Mar 18 2021, 11:46 AM

Closed by commit rG6b053c9867a3: [VPlan] Add plain text (not DOT's digraph) dumps (authored by a.elovikov). · Explain Why

This revision was automatically updated to reflect the committed changes.

a.elovikov added a commit: rG6b053c9867a3: [VPlan] Add plain text (not DOT's digraph) dumps.

mehdi_amini added a reverting change: rG3614df3537f9: Revert "[VPlan] Add plain text (not DOT's digraph) dumps".Mar 18 2021, 12:21 PM

Push a revert, it seems like it broke the link of clang itself:

FAILED: bin/clang-13
: && /usr/bin/clang++-8 -fPIC -fvisibility-inlines-hidden -Werror=date-time -Werror=unguarded-availability-new -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -pedantic -Wno-long-long -Wimplicit-fallthrough -Wcovered-switch-default -Wno-noexcept-type -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wstring-conversion -fdiagnostics-color -ffunction-sections -fdata-sections -fno-common -Woverloaded-virtual -Wno-nested-anon-types -O3 -DNDEBUG -fuse-ld=lld -Wl,--color-diagnostics   -Wl,--export-dynamic  -Wl,-O3 tools/clang/tools/driver/CMakeFiles/clang.dir/driver.cpp.o tools/clang/tools/driver/CMakeFiles/clang.dir/cc1_main.cpp.o tools/clang/tools/driver/CMakeFiles/clang.dir/cc1as_main.cpp.o tools/clang/tools/driver/CMakeFiles/clang.dir/cc1gen_reproducer_main.cpp.o -o bin/clang-13  -Wl,-rpath,"\$ORIGIN/../lib"  lib/libLLVMX86CodeGen.a  lib/libLLVMX86AsmParser.a  lib/libLLVMX86Desc.a  lib/libLLVMX86Disassembler.a  lib/libLLVMX86Info.a  lib/libLLVMNVPTXCodeGen.a  lib/libLLVMNVPTXDesc.a  lib/libLLVMNVPTXInfo.a  lib/libLLVMAMDGPUCodeGen.a  lib/libLLVMAMDGPUAsmParser.a  lib/libLLVMAMDGPUDesc.a  lib/libLLVMAMDGPUDisassembler.a  lib/libLLVMAMDGPUInfo.a  lib/libLLVMAMDGPUUtils.a  lib/libLLVMAnalysis.a  lib/libLLVMCodeGen.a  lib/libLLVMCore.a  lib/libLLVMipo.a  lib/libLLVMAggressiveInstCombine.a  lib/libLLVMInstCombine.a  lib/libLLVMInstrumentation.a  lib/libLLVMMC.a  lib/libLLVMMCParser.a  lib/libLLVMObjCARCOpts.a  lib/libLLVMOption.a  lib/libLLVMScalarOpts.a  lib/libLLVMSupport.a  lib/libLLVMTransformUtils.a  lib/libLLVMVectorize.a  -lpthread  lib/libclangBasic.a  lib/libclangCodeGen.a  lib/libclangDriver.a  lib/libclangFrontend.a  lib/libclangFrontendTool.a  lib/libclangSerialization.a  lib/libLLVMCFGuard.a  lib/libLLVMAsmPrinter.a  lib/libLLVMDebugInfoDWARF.a  lib/libLLVMDebugInfoMSF.a  lib/libLLVMGlobalISel.a  lib/libLLVMSelectionDAG.a  lib/libLLVMMIRParser.a  lib/libLLVMAMDGPUDesc.a  lib/libLLVMAMDGPUInfo.a  lib/libLLVMAMDGPUUtils.a  lib/libLLVMMCDisassembler.a  lib/libclangCodeGen.a  lib/libLLVMCoverage.a  lib/libLLVMLTO.a  lib/libLLVMCodeGen.a  lib/libLLVMPasses.a  lib/libLLVMObjCARCOpts.a  lib/libLLVMTarget.a  lib/libLLVMCoroutines.a  lib/libLLVMipo.a  lib/libLLVMInstrumentation.a  lib/libLLVMVectorize.a  lib/libLLVMIRReader.a  lib/libLLVMAsmParser.a  lib/libLLVMScalarOpts.a  lib/libLLVMAggressiveInstCombine.a  lib/libLLVMInstCombine.a  lib/libLLVMBitWriter.a  lib/libLLVMLinker.a  lib/libLLVMExtensions.a  lib/libclangRewriteFrontend.a  lib/libclangARCMigrate.a  lib/libclangStaticAnalyzerFrontend.a  lib/libclangStaticAnalyzerCheckers.a  lib/libclangStaticAnalyzerCore.a  lib/libclangCrossTU.a  lib/libclangIndex.a  lib/libclangFrontend.a  lib/libclangDriver.a  lib/libLLVMOption.a  lib/libclangParse.a  lib/libclangSerialization.a  lib/libclangSema.a  lib/libclangAnalysis.a  lib/libclangASTMatchers.a  lib/libclangEdit.a  lib/libclangAST.a  lib/libLLVMFrontendOpenMP.a  lib/libLLVMTransformUtils.a  lib/libLLVMAnalysis.a  lib/libLLVMProfileData.a  lib/libLLVMObject.a  lib/libLLVMMCParser.a  lib/libLLVMBitReader.a  lib/libLLVMTextAPI.a  lib/libclangFormat.a  lib/libclangToolingInclusions.a  lib/libclangToolingCore.a  lib/libclangRewrite.a  lib/libclangLex.a  lib/libclangBasic.a  lib/libLLVMCore.a  lib/libLLVMRemarks.a  lib/libLLVMBitstreamReader.a  lib/libLLVMMC.a  lib/libLLVMBinaryFormat.a  lib/libLLVMDebugInfoCodeView.a  lib/libLLVMSupport.a  -lrt  -ldl  -lpthread  -lm  /usr/lib/x86_64-linux-gnu/libz.so  /usr/lib/x86_64-linux-gnu/libtinfo.so  lib/libLLVMDemangle.a && :
ld.lld: error: undefined symbol: llvm::VPlan::printDOT(llvm::raw_ostream&) const
>>> referenced by LoopVectorize.cpp
>>>               LoopVectorize.cpp.o:(llvm::LoopVectorizationPlanner::printPlans(llvm::raw_ostream&)) in archive lib/libLLVMVectorize.a
 
ld.lld: error: undefined symbol: llvm::VPlan::print(llvm::raw_ostream&) const
>>> referenced by LoopVectorize.cpp
>>>               LoopVectorize.cpp.o:(llvm::LoopVectorizationPlanner::printPlans(llvm::raw_ostream&)) in archive lib/libLLVMVectorize.a
clang: error: linker command failed with exit code 1 (use -v to see invocation)

FYI: this seems to break the Clang target when building MLIR: https://buildkite.com/mlir/mlir-core/builds/12351#2ce52a15-5694-4dac-b661-bf77b3348461

Reading the logic a bit more, it looks to me that we need to put #if !defined(NDEBUG) || defined(LLVM_ENABLE_DUMP) inside LoopVectorizationPlanner::printPlans.

Sorry for the troubles and thanks for reverting that for me - I didn't receive notifications from buildbots for it. Yes, it's probably related to those #ifs. I'm starting a Release build to reproduce and create a fix.

mehdi_amini added inline comments.Mar 18 2021, 12:40 PM

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
7813	I think this ought to be guarded by `#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)` as well

It seems the code wasn't properly guarded before this change, and the proper fix would require changing too many places. I'd prefer to do it in separate patch.

I see two options:

"Unguard" the print routines here and commit, work on the cleanup as the next patch.
Delay this patch, develop cleanup first, rebase/reland this patch once cleanup is committed. Note that this patch doesn't make things worse - we had all operator<< accessible in release builds before it.

I'd prefer to go with the first one, but I can understand if some people would prefer two. What do you think?

In D96628#2635549, @a.elovikov wrote:

I'd prefer to go with the first one, but I can understand if some people would prefer two. What do you think?

Whichever order works for you seems reasonable to me.

a.elovikov reopened this revision.Mar 18 2021, 1:30 PM

This revision is now accepted and ready to land.Mar 18 2021, 1:30 PM

a.elovikov updated this revision to Diff 331673.Mar 18 2021, 1:30 PM

a.elovikov added a child revision: D98897: [NFC][VPlan] Guard print routines with "#if !defined(NDEBUG) || defined(LLVM_ENABLE_DUMP)".Mar 18 2021, 2:12 PM

Harbormaster completed remote builds in B94543: Diff 331673.Mar 18 2021, 3:12 PM

This revision was landed with ongoing or failed builds.Mar 19 2021, 10:50 AM

Closed by commit rG93a9d2de8f4f: [VPlan] Add plain text (not DOT's digraph) dumps (authored by a.elovikov). · Explain Why

This revision was automatically updated to reflect the committed changes.

a.elovikov added a commit: rG93a9d2de8f4f: [VPlan] Add plain text (not DOT's digraph) dumps.

bmahjour removed a subscriber: bmahjour.Mar 19 2021, 5:43 PM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Vectorize/

LoopVectorize.cpp

4 lines

VPlan.h

27 lines

VPlan.cpp

151 lines

test/

Transforms/

LoopVectorize/

icmp-uniforms.ll

18 lines

vplan-printing.ll

131 lines

unittests/

Transforms/

Vectorize/

VPlanHCFGTest.cpp

27 lines

VPlanTest.cpp

41 lines

Diff 323452

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

//===- LoopVectorize.cpp - A Loop Vectorizer ------------------------------===//		//===- LoopVectorize.cpp - A Loop Vectorizer ------------------------------===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
▲ Show 20 Lines • Show All 7,793 Lines • ▼ Show 20 Lines	void LoopVectorizationPlanner::executePlan(InnerLoopVectorizer &ILV,
ILV.fixVectorizedLoop(State);		ILV.fixVectorizedLoop(State);

ILV.printDebugTracesAtEnd();		ILV.printDebugTracesAtEnd();
}		}

void LoopVectorizationPlanner::collectTriviallyDeadInstructions(		void LoopVectorizationPlanner::collectTriviallyDeadInstructions(
SmallPtrSetImpl<Instruction *> &DeadInstructions) {		SmallPtrSetImpl<Instruction *> &DeadInstructions) {

// We create new control-flow for the vectorized loop, so the original exit		// We create new control-flow for the vectorized loop, so the original exit
		fhahnUnsubmitted Not Done Reply Inline Actions I think it would make sense to flip things here, as in have the operator for `raw_ostream` use the non-dot style (because that's what most useful in debuggers too) and then have a `Plan::printWithDot()` or soemthing for the DOT logic. fhahn: I think it would make sense to flip things here, as in have the operator for `raw_ostream` use…
// conditions will be dead after vectorization if it's only used by the		// conditions will be dead after vectorization if it's only used by the
// terminator		// terminator
SmallVector<BasicBlock*> ExitingBlocks;		SmallVector<BasicBlock*> ExitingBlocks;
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions I think this ought to be guarded by `#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)` as well mehdi_amini: I think this ought to be guarded by `#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)` as well
OrigLoop->getExitingBlocks(ExitingBlocks);		OrigLoop->getExitingBlocks(ExitingBlocks);
for (auto *BB : ExitingBlocks) {		for (auto *BB : ExitingBlocks) {
auto *Cmp = dyn_cast<Instruction>(BB->getTerminator()->getOperand(0));		auto *Cmp = dyn_cast<Instruction>(BB->getTerminator()->getOperand(0));
if (!Cmp \|\| !Cmp->hasOneUse())		if (!Cmp \|\| !Cmp->hasOneUse())
continue;		continue;

// TODO: we should introduce a getUniqueExitingBlocks on Loop		// TODO: we should introduce a getUniqueExitingBlocks on Loop
if (!DeadInstructions.insert(Cmp).second)		if (!DeadInstructions.insert(Cmp).second)
▲ Show 20 Lines • Show All 1,168 Lines • ▼ Show 20 Lines

Value *LoopVectorizationPlanner::VPCallbackILV::getOrCreateScalarValue(		Value *LoopVectorizationPlanner::VPCallbackILV::getOrCreateScalarValue(
Value *V, const VPIteration &Instance) {		Value *V, const VPIteration &Instance) {
return ILV.getOrCreateScalarValue(V, Instance);		return ILV.getOrCreateScalarValue(V, Instance);
}		}

void VPInterleaveRecipe::print(raw_ostream &O, const Twine &Indent,		void VPInterleaveRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "\"INTERLEAVE-GROUP with factor " << IG->getFactor() << " at ";		O << Indent << "INTERLEAVE-GROUP with factor " << IG->getFactor() << " at ";
IG->getInsertPos()->printAsOperand(O, false);		IG->getInsertPos()->printAsOperand(O, false);
O << ", ";		O << ", ";
getAddr()->printAsOperand(O, SlotTracker);		getAddr()->printAsOperand(O, SlotTracker);
VPValue *Mask = getMask();		VPValue *Mask = getMask();
if (Mask) {		if (Mask) {
O << ", ";		O << ", ";
Mask->printAsOperand(O, SlotTracker);		Mask->printAsOperand(O, SlotTracker);
}		}
for (unsigned i = 0; i < IG->getFactor(); ++i)		for (unsigned i = 0; i < IG->getFactor(); ++i)
if (Instruction *I = IG->getMember(i))		if (Instruction *I = IG->getMember(i))
O << "\\l\" +\n" << Indent << "\" " << VPlanIngredient(I) << " " << i;		O << "\n" << Indent << " " << VPlanIngredient(I) << " " << i;
}		}

void VPWidenCallRecipe::execute(VPTransformState &State) {		void VPWidenCallRecipe::execute(VPTransformState &State) {
State.ILV->widenCallInstruction(*cast<CallInst>(getUnderlyingInstr()), this,		State.ILV->widenCallInstruction(*cast<CallInst>(getUnderlyingInstr()), this,
*this, State);		*this, State);
}		}

void VPWidenSelectRecipe::execute(VPTransformState &State) {		void VPWidenSelectRecipe::execute(VPTransformState &State) {
▲ Show 20 Lines • Show All 832 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/VPlan.h

//===- VPlan.h - Represent A Vectorizer Plan --------------------- C++ --===//		//===- VPlan.h - Represent A Vectorizer Plan --------------------- C++ --===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
▲ Show 20 Lines • Show All 621 Lines • ▼ Show 20 Lines	public:

/// Delete all blocks reachable from a given VPBlockBase, inclusive.		/// Delete all blocks reachable from a given VPBlockBase, inclusive.
static void deleteCFG(VPBlockBase *Entry);		static void deleteCFG(VPBlockBase *Entry);

void printAsOperand(raw_ostream &OS, bool PrintType) const {		void printAsOperand(raw_ostream &OS, bool PrintType) const {
OS << getName();		OS << getName();
}		}

void print(raw_ostream &OS) const {
// TODO: Only printing VPBB name for now since we only have dot printing
// support for VPInstructions/Recipes.
printAsOperand(OS, false);
}

/// Return true if it is legal to hoist instructions into this block.		/// Return true if it is legal to hoist instructions into this block.
bool isLegalToHoistInto() {		bool isLegalToHoistInto() {
// There are currently no constraints that prevent an instruction to be		// There are currently no constraints that prevent an instruction to be
// hoisted into a VPBlockBase.		// hoisted into a VPBlockBase.
return true;		return true;
}		}

/// Replace all operands of VPUsers in the block with \p NewValue and also		/// Replace all operands of VPUsers in the block with \p NewValue and also
/// replaces all uses of VPValues defined in the block with NewValue.		/// replaces all uses of VPValues defined in the block with NewValue.
virtual void dropAllReferences(VPValue *NewValue) = 0;		virtual void dropAllReferences(VPValue *NewValue) = 0;

		virtual void print(raw_ostream &O, const Twine &Indent,
		VPSlotTracker &SlotTracker) const = 0;
		void print(raw_ostream &O) const {
		VPSlotTracker SlotTracker(getPlan());
		print(O, "", SlotTracker);
		}
		void dump() const { print(dbgs()); }
};		};

/// VPRecipeBase is a base class modeling a sequence of one or more output IR		/// VPRecipeBase is a base class modeling a sequence of one or more output IR
/// instructions. VPRecipeBase owns the the VPValues it defines through VPDef		/// instructions. VPRecipeBase owns the the VPValues it defines through VPDef
/// and is responsible for deleting its defined values. Single-value		/// and is responsible for deleting its defined values. Single-value
/// VPRecipeBases that also inherit from VPValue must make sure to inherit from		/// VPRecipeBases that also inherit from VPValue must make sure to inherit from
/// VPRecipeBase before VPValue.		/// VPRecipeBase before VPValue.
class VPRecipeBase : public ilist_node_with_parent<VPRecipeBase, VPBasicBlock>,		class VPRecipeBase : public ilist_node_with_parent<VPRecipeBase, VPBasicBlock>,
▲ Show 20 Lines • Show All 616 Lines • ▼ Show 20 Lines	public:

/// Generate the extraction of the appropriate bit from the block mask and the		/// Generate the extraction of the appropriate bit from the block mask and the
/// conditional branch.		/// conditional branch.
void execute(VPTransformState &State) override;		void execute(VPTransformState &State) override;

/// Print the recipe.		/// Print the recipe.
void print(raw_ostream &O, const Twine &Indent,		void print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const override {		VPSlotTracker &SlotTracker) const override {
O << " +\n" << Indent << "\"BRANCH-ON-MASK ";		O << Indent << "BRANCH-ON-MASK ";
if (VPValue *Mask = getMask())		if (VPValue *Mask = getMask())
Mask->printAsOperand(O, SlotTracker);		Mask->printAsOperand(O, SlotTracker);
else		else
O << " All-One";		O << " All-One";
O << "\\l\"";
}		}

/// Return the mask used by this recipe. Note that a full mask is represented		/// Return the mask used by this recipe. Note that a full mask is represented
/// by a nullptr.		/// by a nullptr.
VPValue *getMask() const {		VPValue *getMask() const {
assert(getNumOperands() <= 1 && "should have either 0 or 1 operands");		assert(getNumOperands() <= 1 && "should have either 0 or 1 operands");
// Mask is optional.		// Mask is optional.
return getNumOperands() == 1 ? getOperand(0) : nullptr;		return getNumOperands() == 1 ? getOperand(0) : nullptr;
▲ Show 20 Lines • Show All 196 Lines • ▼ Show 20 Lines	public:
/// this VPBasicBlock, thereby "executing" the VPlan.		/// this VPBasicBlock, thereby "executing" the VPlan.
void execute(struct VPTransformState *State) override;		void execute(struct VPTransformState *State) override;

/// Return the position of the first non-phi node recipe in the block.		/// Return the position of the first non-phi node recipe in the block.
iterator getFirstNonPhi();		iterator getFirstNonPhi();

void dropAllReferences(VPValue *NewValue) override;		void dropAllReferences(VPValue *NewValue) override;

		void print(raw_ostream &O, const Twine &Indent,
		VPSlotTracker &SlotTracker) const override;
		using VPBlockBase::print; // Get the print(raw_stream &O) version.

private:		private:
/// Create an IR BasicBlock to hold the output instructions generated by this		/// Create an IR BasicBlock to hold the output instructions generated by this
/// VPBasicBlock, and return it. Update the CFGState accordingly.		/// VPBasicBlock, and return it. Update the CFGState accordingly.
BasicBlock *createEmptyBasicBlock(VPTransformState::CFGState &CFG);		BasicBlock *createEmptyBasicBlock(VPTransformState::CFGState &CFG);
};		};

/// VPRegionBlock represents a collection of VPBasicBlocks and VPRegionBlocks		/// VPRegionBlock represents a collection of VPBasicBlocks and VPRegionBlocks
/// which form a Single-Entry-Single-Exit subgraph of the output IR CFG.		/// which form a Single-Entry-Single-Exit subgraph of the output IR CFG.
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	public:
/// instances of output IR corresponding to its VPBlockBases.		/// instances of output IR corresponding to its VPBlockBases.
bool isReplicator() const { return IsReplicator; }		bool isReplicator() const { return IsReplicator; }

/// The method which generates the output IR instructions that correspond to		/// The method which generates the output IR instructions that correspond to
/// this VPRegionBlock, thereby "executing" the VPlan.		/// this VPRegionBlock, thereby "executing" the VPlan.
void execute(struct VPTransformState *State) override;		void execute(struct VPTransformState *State) override;

void dropAllReferences(VPValue *NewValue) override;		void dropAllReferences(VPValue *NewValue) override;

		void print(raw_ostream &O, const Twine &Indent,
		VPSlotTracker &SlotTracker) const override;
		using VPBlockBase::print; // Get the print(raw_stream &O) version.
};		};

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// GraphTraits specializations for VPlan Hierarchical Control-Flow Graphs //		// GraphTraits specializations for VPlan Hierarchical Control-Flow Graphs //
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// The following set of template specializations implement GraphTraits to treat		// The following set of template specializations implement GraphTraits to treat
// any VPBlockBase as a node in a graph of VPBlockBases. It's important to note		// any VPBlockBase as a node in a graph of VPBlockBases. It's important to note
▲ Show 20 Lines • Show All 236 Lines • ▼ Show 20 Lines	public:
}		}

void removeVPValueFor(Value *V) { Value2VPValue.erase(V); }		void removeVPValueFor(Value *V) { Value2VPValue.erase(V); }

/// Return the VPLoopInfo analysis for this VPlan.		/// Return the VPLoopInfo analysis for this VPlan.
VPLoopInfo &getVPLoopInfo() { return VPLInfo; }		VPLoopInfo &getVPLoopInfo() { return VPLInfo; }
const VPLoopInfo &getVPLoopInfo() const { return VPLInfo; }		const VPLoopInfo &getVPLoopInfo() const { return VPLInfo; }

		void print(raw_ostream &O) const;
		fhahnUnsubmitted Not Done Reply Inline Actions nit: comment, something like `/// Print the plan to \p O.`. Some for other instances. fhahn: nit: comment, something like `/// Print the plan to \p O.`. Some for other instances.

/// Dump the plan to stderr (for debugging).		/// Dump the plan to stderr (for debugging).
void dump() const;		void dump() const;

/// Returns a range mapping the values the range \p Operands to their		/// Returns a range mapping the values the range \p Operands to their
/// corresponding VPValues.		/// corresponding VPValues.
iterator_range<mapped_iterator<Use , std::function<VPValue (Value *)>>>		iterator_range<mapped_iterator<Use , std::function<VPValue (Value *)>>>
mapToVPValues(User::op_range Operands) {		mapToVPValues(User::op_range Operands) {
std::function<VPValue (Value )> Fn = [this](Value *Op) {		std::function<VPValue (Value )> Fn = [this](Value *Op) {
▲ Show 20 Lines • Show All 320 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/VPlan.cpp

//===- VPlan.cpp - Vectorizer Plan ----------------------------------------===//		//===- VPlan.cpp - Vectorizer Plan ----------------------------------------===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
///		///
Show All 34 Lines
#include <cassert>		#include <cassert>
#include <iterator>		#include <iterator>
#include <string>		#include <string>
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;
extern cl::opt<bool> EnableVPlanNativePath;		extern cl::opt<bool> EnableVPlanNativePath;

		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
		cl::opt<bool>
		PrintExecutedVPlan("print-executed-vplan", cl::init(false), cl::Hidden,
		fhahnUnsubmitted Not Done Reply Inline Actions I think having an option to just print the plans make sense, but as a first step, should we start with an option to just toggle between dot printing and regular printing for when using `-debug`? fhahn: I think having an option to just print the plans make sense, but as a first step, should we…
		a.elovikovAuthorUnsubmitted Done Reply Inline Actions I preferred dedicated option for several reasons: Toggling is harder to implement, especially when caring about consistent interfaces. For example, I think that `VPBasicBlock`\|`VPValue`\|`VPDef`::`print` should be dedicated for text prints only (and `VPBasicBlock` can't even be printed as a digraph before this patch), so the only logical place for triggering would be `operator<<(ostream&)`. On the other hand, there is no guarantees that none of the LLVM_DEBUG prints call the print method directly (and not via stream operator). I personally don't like using -debug/-debug-only for LIT testing purposes. Having a more limited/specific output is, IMO, preferable. I'm hoping that option(s) to print VPlan at a given place in the pipeline would be convenient tools for writing LIT tests. As such, I think having this option from the very first plain dump patch makes sense. If going through the toggling approach, do you expect existing LIT tests (non-unittest) to continue using digraph output or be switched to plain dumps in the first patch? If the former, what would be your suggestion to test the plain dumps? a.elovikov: I preferred dedicated option for several reasons: 1) Toggling is harder to implement…
		cl::desc("Dump the VPlan that is being executed to "
		"stdout. For LIT testing purposes."));
		#endif

#define DEBUG_TYPE "vplan"		#define DEBUG_TYPE "vplan"

raw_ostream &llvm::operator<<(raw_ostream &OS, const VPValue &V) {		raw_ostream &llvm::operator<<(raw_ostream &OS, const VPValue &V) {
const VPInstruction *Instr = dyn_cast<VPInstruction>(&V);		const VPInstruction *Instr = dyn_cast<VPInstruction>(&V);
VPSlotTracker SlotTracker(		VPSlotTracker SlotTracker(
(Instr && Instr->getParent()) ? Instr->getParent()->getPlan() : nullptr);		(Instr && Instr->getParent()) ? Instr->getParent()->getPlan() : nullptr);
V.print(OS, SlotTracker);		V.print(OS, SlotTracker);
return OS;		return OS;
▲ Show 20 Lines • Show All 300 Lines • ▼ Show 20 Lines	for (auto *Def : R.definedValues())
Def->replaceAllUsesWith(NewValue);		Def->replaceAllUsesWith(NewValue);

if (auto *User = R.toVPUser())		if (auto *User = R.toVPUser())
for (unsigned I = 0, E = User->getNumOperands(); I != E; I++)		for (unsigned I = 0, E = User->getNumOperands(); I != E; I++)
User->setOperand(I, NewValue);		User->setOperand(I, NewValue);
}		}
}		}


		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - Lint: Pre-merge checks: clang-format: please reformat the code ``` - ```
		void VPBasicBlock::print(raw_ostream &O, const Twine &Indent,
		VPSlotTracker &SlotTracker) const {
		O << Indent << getName() << ":\n";
		if (const VPValue *Pred = getPredicate()) {
		O << Indent << "BlockPredicate:";
		Pred->printAsOperand(O, SlotTracker);
		if (auto *PredInst = dyn_cast<VPInstruction>(Pred))
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto PredInst' can be declared as 'const auto PredInst' [llvm-qualified-auto] not useful Lint: Pre-merge checks: clang-tidy: warning: 'auto PredInst' can be declared as 'const auto PredInst' [llvm-qualified…
		O << " (" << PredInst->getParent()->getName() << ")";
		O << '\n';
		}

		auto RecipeIndent = Indent + " ";
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: twine variables are prone to use-after-free bugs [llvm-twine-local] not useful Lint: Pre-merge checks: clang-tidy: warning: twine variables are prone to use-after-free bugs [llvm-twine-local]…
		for (const VPRecipeBase &Recipe : *this) {
		Recipe.print(O, RecipeIndent, SlotTracker);
		O << '\n';
		}

		O << Indent << "Successor(s):";
		for (auto *Succ : getSuccessors())
		fhahnUnsubmitted Not Done Reply Inline Actions I think it would be good to separate the block names by a comma or something like that. You could either use `join_items` or `ListSeparator` from `StringExtras.h` fhahn: I think it would be good to separate the block names by a comma or something like that. You…
		a.elovikovAuthorUnsubmitted Done Reply Inline Actions `ListSeparator` is great! Wasn't aware of it before. a.elovikov: `ListSeparator` is great! Wasn't aware of it before.
		O << Indent << " " << Succ->getName();
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - O << Indent << " " << Succ->getName(); + O << Indent << " " << Succ->getName(); Lint: Pre-merge checks: clang-format: please reformat the code ``` - O << Indent << " " << Succ->getName(); + O…
		O << '\n';

		if (const VPValue *CBV = getCondBit()) {
		O << Indent << "CondBit: ";
		CBV->printAsOperand(O, SlotTracker);
		if (auto *CBI = dyn_cast<VPInstruction>(CBV))
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto CBI' can be declared as 'const auto CBI' [llvm-qualified-auto] not useful Lint: Pre-merge checks: clang-tidy: warning: 'auto CBI' can be declared as 'const auto CBI' [llvm-qualified-auto]…
		O << " (" << CBI->getParent()->getName() << ")";
		O << '\n';
		}
		}

void VPRegionBlock::dropAllReferences(VPValue *NewValue) {		void VPRegionBlock::dropAllReferences(VPValue *NewValue) {
for (VPBlockBase *Block : depth_first(Entry))		for (VPBlockBase *Block : depth_first(Entry))
// Drop all references in VPBasicBlocks and replace all uses with		// Drop all references in VPBasicBlocks and replace all uses with
// DummyValue.		// DummyValue.
Block->dropAllReferences(NewValue);		Block->dropAllReferences(NewValue);
}		}

void VPRegionBlock::execute(VPTransformState *State) {		void VPRegionBlock::execute(VPTransformState *State) {
Show All 40 Lines	for (unsigned Lane = 0, VF = State->VF.getKnownMinValue(); Lane < VF;
}		}
}		}
}		}

// Exit replicating mode.		// Exit replicating mode.
State->Instance.reset();		State->Instance.reset();
}		}

		void VPRegionBlock::print(raw_ostream &O, const Twine &Indent,
		VPSlotTracker &SlotTracker) const {
		O << Indent << (isReplicator() ? "<xVFxUF> " : "<x1> ") << getName() << ": {";
		auto NewIndent = Indent + " ";
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: twine variables are prone to use-after-free bugs [llvm-twine-local] not useful Lint: Pre-merge checks: clang-tidy: warning: twine variables are prone to use-after-free bugs [llvm-twine-local]…
		for (auto *BlockBase : depth_first(Entry)) {
		O << '\n';
		BlockBase->print(O, NewIndent, SlotTracker);
		}
		O << Indent << "}\n";
		}

void VPRecipeBase::insertBefore(VPRecipeBase *InsertPos) {		void VPRecipeBase::insertBefore(VPRecipeBase *InsertPos) {
assert(!Parent && "Recipe already in some VPBasicBlock");		assert(!Parent && "Recipe already in some VPBasicBlock");
assert(InsertPos->getParent() &&		assert(InsertPos->getParent() &&
"Insertion position not in any VPBasicBlock");		"Insertion position not in any VPBasicBlock");
Parent = InsertPos->getParent();		Parent = InsertPos->getParent();
Parent->getRecipeList().insert(InsertPos->getIterator(), this);		Parent->getRecipeList().insert(InsertPos->getIterator(), this);
}		}

▲ Show 20 Lines • Show All 90 Lines • ▼ Show 20 Lines

void VPInstruction::dump() const {		void VPInstruction::dump() const {
VPSlotTracker SlotTracker(getParent()->getPlan());		VPSlotTracker SlotTracker(getParent()->getPlan());
print(dbgs(), "", SlotTracker);		print(dbgs(), "", SlotTracker);
}		}

void VPInstruction::print(raw_ostream &O, const Twine &Indent,		void VPInstruction::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "EMIT ";		O << Indent << "EMIT ";

if (hasResult()) {		if (hasResult()) {
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << " = ";		O << " = ";
}		}

switch (getOpcode()) {		switch (getOpcode()) {
case VPInstruction::Not:		case VPInstruction::Not:
Show All 21 Lines	for (const VPValue *Operand : operands()) {
Operand->printAsOperand(O, SlotTracker);		Operand->printAsOperand(O, SlotTracker);
}		}
}		}

/// Generate the code inside the body of the vectorized loop. Assumes a single		/// Generate the code inside the body of the vectorized loop. Assumes a single
/// LoopVectorBody basic-block was created for this. Introduce additional		/// LoopVectorBody basic-block was created for this. Introduce additional
/// basic-blocks as needed, and fill them all.		/// basic-blocks as needed, and fill them all.
void VPlan::execute(VPTransformState *State) {		void VPlan::execute(VPTransformState *State) {
		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
		if (PrintExecutedVPlan) {
		print(outs());
		outs().flush();
		}
		LLVM_DEBUG(dbgs() << "VPlan being executed:\n"; dump());
		#endif

// -1. Check if the backedge taken count is needed, and if so build it.		// -1. Check if the backedge taken count is needed, and if so build it.
if (BackedgeTakenCount && BackedgeTakenCount->getNumUsers()) {		if (BackedgeTakenCount && BackedgeTakenCount->getNumUsers()) {
Value *TC = State->TripCount;		Value *TC = State->TripCount;
IRBuilder<> Builder(State->CFG.PrevBB->getTerminator());		IRBuilder<> Builder(State->CFG.PrevBB->getTerminator());
auto *TCMO = Builder.CreateSub(TC, ConstantInt::get(TC->getType(), 1),		auto *TCMO = Builder.CreateSub(TC, ConstantInt::get(TC->getType(), 1),
"trip.count.minus.1");		"trip.count.minus.1");
auto VF = State->VF;		auto VF = State->VF;
Value *VTCMO =		Value *VTCMO =
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	#endif
// We do not attempt to preserve DT for outer loop vectorization currently.		// We do not attempt to preserve DT for outer loop vectorization currently.
if (!EnableVPlanNativePath)		if (!EnableVPlanNativePath)
updateDominatorTree(State->DT, VectorPreHeaderBB, VectorLatchBB,		updateDominatorTree(State->DT, VectorPreHeaderBB, VectorLatchBB,
L->getExitBlock());		L->getExitBlock());
}		}

#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
LLVM_DUMP_METHOD		LLVM_DUMP_METHOD
void VPlan::dump() const { dbgs() << *this << '\n'; }		void VPlan::print(raw_ostream &O) const {
		VPSlotTracker SlotTracker(this);

		O << "VPlan {";
		for (const VPBlockBase *Block : depth_first(getEntry())) {
		O << '\n';
		Block->print(O, "", SlotTracker);
		}
		O << "}\n";
		}

		LLVM_DUMP_METHOD
		void VPlan::dump() const {
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code -void VPlan::dump() const { - print(dbgs()); -} +void VPlan::dump() const { print(dbgs()); } Lint: Pre-merge checks: clang-format: please reformat the code ``` -void VPlan::dump() const { - print(dbgs()); -}…
		print(dbgs());
		}
#endif		#endif

void VPlan::updateDominatorTree(DominatorTree DT, BasicBlock LoopPreHeaderBB,		void VPlan::updateDominatorTree(DominatorTree DT, BasicBlock LoopPreHeaderBB,
BasicBlock *LoopLatchBB,		BasicBlock *LoopLatchBB,
BasicBlock *LoopExitBB) {		BasicBlock *LoopExitBB) {
BasicBlock *LoopHeaderBB = LoopPreHeaderBB->getSingleSuccessor();		BasicBlock *LoopHeaderBB = LoopPreHeaderBB->getSingleSuccessor();
assert(LoopHeaderBB && "Loop preheader does not have a single successor.");		assert(LoopHeaderBB && "Loop preheader does not have a single successor.");
// The vector body may be more than a single basic-block by this point.		// The vector body may be more than a single basic-block by this point.
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	else if (Successors.size() == 2) {
for (auto *Successor : Successors)		for (auto *Successor : Successors)
drawEdge(Block, Successor, false, Twine(SuccessorNumber++));		drawEdge(Block, Successor, false, Twine(SuccessorNumber++));
}		}
}		}

void VPlanPrinter::dumpBasicBlock(const VPBasicBlock *BasicBlock) {		void VPlanPrinter::dumpBasicBlock(const VPBasicBlock *BasicBlock) {
OS << Indent << getUID(BasicBlock) << " [label =\n";		OS << Indent << getUID(BasicBlock) << " [label =\n";
bumpIndent(1);		bumpIndent(1);
OS << Indent << "\"" << DOT::EscapeString(BasicBlock->getName()) << ":\\n\"";		std::string Str;
bumpIndent(1);		raw_string_ostream SS(Str);
		// Use no indentation as we need to wrap the lines into quotes ourselves.
		BasicBlock->print(SS, "", SlotTracker);
		fhahnUnsubmitted Not Done Reply Inline Actions Could you add a comment explaining that we first print the block and then split up/reconstruct the output with the .dot syntax> fhahn: Could you add a comment explaining that we first print the block and then split up/reconstruct…
		a.elovikovAuthorUnsubmitted Done Reply Inline Actions Sure, will update in the next patch set. a.elovikov: Sure, will update in the next patch set.
		SmallVector<StringRef, 0> Lines;
		StringRef(Str).rtrim('\n').split(Lines, "\n");

		auto EmitLine = [&](StringRef Line, StringRef Suffix) {
		OS << Indent << '"' << DOT::EscapeString(Line.str()) << "\\l\"" << Suffix;
		};

		// Don't need the "+" after the last line as well.
		for (auto Line : make_range(Lines.begin(), Lines.end() - 1))
		EmitLine(Line, " +\n");
		EmitLine(Lines.back(), "\n");

// Dump the block predicate.		bumpIndent(-1);
const VPValue *Pred = BasicBlock->getPredicate();		OS << Indent << "]\n";
if (Pred) {
OS << " +\n" << Indent << " \"BlockPredicate: \"";
if (const VPInstruction *PredI = dyn_cast<VPInstruction>(Pred)) {
PredI->printAsOperand(OS, SlotTracker);
OS << " (" << DOT::EscapeString(PredI->getParent()->getName())
<< ")\\l\"";
} else
Pred->printAsOperand(OS, SlotTracker);
}

for (const VPRecipeBase &Recipe : *BasicBlock) {
OS << " +\n" << Indent << "\"";
Recipe.print(OS, Indent, SlotTracker);
OS << "\\l\"";
}

// Dump the condition bit.
const VPValue *CBV = BasicBlock->getCondBit();
if (CBV) {
OS << " +\n" << Indent << " \"CondBit: ";
if (const VPInstruction *CBI = dyn_cast<VPInstruction>(CBV)) {
CBI->printAsOperand(OS, SlotTracker);
OS << " (" << DOT::EscapeString(CBI->getParent()->getName()) << ")\\l\"";
} else {
CBV->printAsOperand(OS, SlotTracker);
OS << "\"";
}
}

bumpIndent(-2);
OS << "\n" << Indent << "]\n";
dumpEdges(BasicBlock);		dumpEdges(BasicBlock);
}		}

void VPlanPrinter::dumpRegion(const VPRegionBlock *Region) {		void VPlanPrinter::dumpRegion(const VPRegionBlock *Region) {
OS << Indent << "subgraph " << getUID(Region) << " {\n";		OS << Indent << "subgraph " << getUID(Region) << " {\n";
bumpIndent(1);		bumpIndent(1);
OS << Indent << "fontname=Courier\n"		OS << Indent << "fontname=Courier\n"
<< Indent << "label=\""		<< Indent << "label=\""
Show All 26 Lines	void VPlanPrinter::printAsIngredient(raw_ostream &O, const Value *V) {
} else // !Inst		} else // !Inst
V->printAsOperand(RSO, false);		V->printAsOperand(RSO, false);
RSO.flush();		RSO.flush();
O << DOT::EscapeString(IngredientString);		O << DOT::EscapeString(IngredientString);
}		}

void VPWidenCallRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenCallRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "WIDEN-CALL ";		O << Indent << "WIDEN-CALL ";
		fhahnUnsubmitted Not Done Reply Inline Actions Could you split off the changes not directly related to printing in non-DOT mode (like respecting `Indent` in the individual print() implementations)? fhahn: Could you split off the changes not directly related to printing in non-DOT mode (like…

auto *CI = cast<CallInst>(getUnderlyingInstr());		auto *CI = cast<CallInst>(getUnderlyingInstr());
if (CI->getType()->isVoidTy())		if (CI->getType()->isVoidTy())
O << "void ";		O << "void ";
else {		else {
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << " = ";		O << " = ";
}		}

O << "call @" << CI->getCalledFunction()->getName() << "(";		O << "call @" << CI->getCalledFunction()->getName() << "(";
printOperands(O, SlotTracker);		printOperands(O, SlotTracker);
O << ")";		O << ")";
}		}

void VPWidenSelectRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenSelectRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "WIDEN-SELECT ";		O << Indent << "WIDEN-SELECT ";
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << " = select ";		O << " = select ";
getOperand(0)->printAsOperand(O, SlotTracker);		getOperand(0)->printAsOperand(O, SlotTracker);
O << ", ";		O << ", ";
getOperand(1)->printAsOperand(O, SlotTracker);		getOperand(1)->printAsOperand(O, SlotTracker);
O << ", ";		O << ", ";
getOperand(2)->printAsOperand(O, SlotTracker);		getOperand(2)->printAsOperand(O, SlotTracker);
O << (InvariantCond ? " (condition is loop invariant)" : "");		O << (InvariantCond ? " (condition is loop invariant)" : "");
}		}

void VPWidenRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "WIDEN ";		O << Indent << "WIDEN ";
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << " = " << getUnderlyingInstr()->getOpcodeName() << " ";		O << " = " << getUnderlyingInstr()->getOpcodeName() << " ";
printOperands(O, SlotTracker);		printOperands(O, SlotTracker);
}		}

void VPWidenIntOrFpInductionRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenIntOrFpInductionRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "WIDEN-INDUCTION";		O << Indent << "WIDEN-INDUCTION";
if (getTruncInst()) {		if (getTruncInst()) {
O << "\\l\"";		O << "\\l\"";
O << " +\n" << Indent << "\" " << VPlanIngredient(IV) << "\\l\"";		O << " +\n" << Indent << "\" " << VPlanIngredient(IV) << "\\l\"";
O << " +\n" << Indent << "\" ";		O << " +\n" << Indent << "\" ";
getVPValue(0)->printAsOperand(O, SlotTracker);		getVPValue(0)->printAsOperand(O, SlotTracker);
} else		} else
O << " " << VPlanIngredient(IV);		O << " " << VPlanIngredient(IV);
}		}

void VPWidenGEPRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenGEPRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "WIDEN-GEP ";		O << Indent << "WIDEN-GEP ";
O << (IsPtrLoopInvariant ? "Inv" : "Var");		O << (IsPtrLoopInvariant ? "Inv" : "Var");
size_t IndicesNumber = IsIndexLoopInvariant.size();		size_t IndicesNumber = IsIndexLoopInvariant.size();
for (size_t I = 0; I < IndicesNumber; ++I)		for (size_t I = 0; I < IndicesNumber; ++I)
O << "[" << (IsIndexLoopInvariant[I] ? "Inv" : "Var") << "]";		O << "[" << (IsIndexLoopInvariant[I] ? "Inv" : "Var") << "]";

O << " ";		O << " ";
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << " = getelementptr ";		O << " = getelementptr ";
printOperands(O, SlotTracker);		printOperands(O, SlotTracker);
}		}

void VPWidenPHIRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenPHIRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "WIDEN-PHI " << VPlanIngredient(Phi);		O << Indent << "WIDEN-PHI " << VPlanIngredient(Phi);
}		}

void VPBlendRecipe::print(raw_ostream &O, const Twine &Indent,		void VPBlendRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "BLEND ";		O << Indent << "BLEND ";
Phi->printAsOperand(O, false);		Phi->printAsOperand(O, false);
O << " =";		O << " =";
if (getNumIncomingValues() == 1) {		if (getNumIncomingValues() == 1) {
// Not a User of any mask: not really blending, this is a		// Not a User of any mask: not really blending, this is a
// single-predecessor phi.		// single-predecessor phi.
O << " ";		O << " ";
getIncomingValue(0)->printAsOperand(O, SlotTracker);		getIncomingValue(0)->printAsOperand(O, SlotTracker);
} else {		} else {
for (unsigned I = 0, E = getNumIncomingValues(); I < E; ++I) {		for (unsigned I = 0, E = getNumIncomingValues(); I < E; ++I) {
O << " ";		O << " ";
getIncomingValue(I)->printAsOperand(O, SlotTracker);		getIncomingValue(I)->printAsOperand(O, SlotTracker);
O << "/";		O << "/";
getMask(I)->printAsOperand(O, SlotTracker);		getMask(I)->printAsOperand(O, SlotTracker);
}		}
}		}
}		}

void VPReductionRecipe::print(raw_ostream &O, const Twine &Indent,		void VPReductionRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "REDUCE ";		O << Indent << "REDUCE ";
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << " = ";		O << " = ";
getChainOp()->printAsOperand(O, SlotTracker);		getChainOp()->printAsOperand(O, SlotTracker);
O << " + reduce." << Instruction::getOpcodeName(RdxDesc->getOpcode())		O << " + reduce." << Instruction::getOpcodeName(RdxDesc->getOpcode())
<< " (";		<< " (";
getVecOp()->printAsOperand(O, SlotTracker);		getVecOp()->printAsOperand(O, SlotTracker);
if (getCondOp()) {		if (getCondOp()) {
O << ", ";		O << ", ";
getCondOp()->printAsOperand(O, SlotTracker);		getCondOp()->printAsOperand(O, SlotTracker);
}		}
O << ")";		O << ")";
}		}

void VPReplicateRecipe::print(raw_ostream &O, const Twine &Indent,		void VPReplicateRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << (IsUniform ? "CLONE " : "REPLICATE ");		O << Indent << (IsUniform ? "CLONE " : "REPLICATE ");

if (!getUnderlyingInstr()->getType()->isVoidTy()) {		if (!getUnderlyingInstr()->getType()->isVoidTy()) {
printAsOperand(O, SlotTracker);		printAsOperand(O, SlotTracker);
O << " = ";		O << " = ";
}		}
O << Instruction::getOpcodeName(getUnderlyingInstr()->getOpcode()) << " ";		O << Instruction::getOpcodeName(getUnderlyingInstr()->getOpcode()) << " ";
printOperands(O, SlotTracker);		printOperands(O, SlotTracker);

if (AlsoPack)		if (AlsoPack)
O << " (S->V)";		O << " (S->V)";
}		}

void VPPredInstPHIRecipe::print(raw_ostream &O, const Twine &Indent,		void VPPredInstPHIRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "PHI-PREDICATED-INSTRUCTION ";		O << Indent << "PHI-PREDICATED-INSTRUCTION ";
printOperands(O, SlotTracker);		printOperands(O, SlotTracker);
}		}

void VPWidenMemoryInstructionRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenMemoryInstructionRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "WIDEN ";		O << Indent << "WIDEN ";

if (!isStore()) {		if (!isStore()) {
getVPValue()->printAsOperand(O, SlotTracker);		getVPValue()->printAsOperand(O, SlotTracker);
O << " = ";		O << " = ";
}		}
O << Instruction::getOpcodeName(Ingredient.getOpcode()) << " ";		O << Instruction::getOpcodeName(Ingredient.getOpcode()) << " ";

printOperands(O, SlotTracker);		printOperands(O, SlotTracker);
Show All 21 Lines	for (unsigned Part = 0, UF = State.UF; Part < UF; ++Part) {
// Add the consecutive indices to the vector value.		// Add the consecutive indices to the vector value.
Value *CanonicalVectorIV = Builder.CreateAdd(VStart, VStep, "vec.iv");		Value *CanonicalVectorIV = Builder.CreateAdd(VStart, VStep, "vec.iv");
State.set(getVPValue(), CanonicalVectorIV, Part);		State.set(getVPValue(), CanonicalVectorIV, Part);
}		}
}		}

void VPWidenCanonicalIVRecipe::print(raw_ostream &O, const Twine &Indent,		void VPWidenCanonicalIVRecipe::print(raw_ostream &O, const Twine &Indent,
VPSlotTracker &SlotTracker) const {		VPSlotTracker &SlotTracker) const {
O << "EMIT ";		O << Indent << "EMIT ";
getVPValue()->printAsOperand(O, SlotTracker);		getVPValue()->printAsOperand(O, SlotTracker);
O << " = WIDEN-CANONICAL-INDUCTION";		O << " = WIDEN-CANONICAL-INDUCTION";
}		}

template void DomTreeBuilder::Calculate<VPDominatorTree>(VPDominatorTree &DT);		template void DomTreeBuilder::Calculate<VPDominatorTree>(VPDominatorTree &DT);

void VPValue::replaceAllUsesWith(VPValue *New) {		void VPValue::replaceAllUsesWith(VPValue *New) {
for (unsigned J = 0; J < getNumUsers();) {		for (unsigned J = 0; J < getNumUsers();) {
▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopVectorize/icmp-uniforms.ll

; REQUIRES: asserts		; REQUIRES: asserts
; RUN: opt < %s -loop-vectorize -force-vector-width=4 -force-vector-interleave=1 -instcombine -debug-only=loop-vectorize -disable-output -print-after=instcombine 2>&1 -enable-new-pm=0 \| FileCheck %s		; RUN: opt < %s -loop-vectorize -force-vector-width=4 -force-vector-interleave=1 -instcombine -print-executed-vplan -disable-output -print-after=instcombine -enable-new-pm=0 2>&1 \| FileCheck %s
; RUN: opt < %s -passes=loop-vectorize,instcombine -force-vector-width=4 -force-vector-interleave=1 -debug-only=loop-vectorize -disable-output -print-after=instcombine 2>&1 \| FileCheck %s		; RUN: opt < %s -passes=loop-vectorize,instcombine -force-vector-width=4 -force-vector-interleave=1 -print-executed-vplan -disable-output -print-after=instcombine 2>&1 \| FileCheck %s

target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"		target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"

; CHECK-LABEL: more_than_one_use		; CHECK-LABEL: more_than_one_use
;		;
; PR30627. Check that a compare instruction with more than one use is not		; PR30627. Check that a compare instruction with more than one use is not
; recognized as uniform and is vectorized.		; recognized as uniform and is vectorized.
;		;
; CHECK-NOT: Found uniform instruction: %cond = icmp slt i64 %i.next, %n
; CHECK: vector.body		; CHECK: vector.body
; CHECK: %[[I:.+]] = add nuw nsw <4 x i64> %vec.ind, <i64 1, i64 1, i64 1, i64 1>		; CHECK: %[[I:.+]] = add nuw nsw <4 x i64> %vec.ind, <i64 1, i64 1, i64 1, i64 1>
; CHECK: icmp slt <4 x i64> %[[I]], %broadcast.splat		; CHECK: icmp slt <4 x i64> %[[I]], %broadcast.splat
; CHECK: br i1 {{.*}}, label %middle.block, label %vector.body		; CHECK: br i1 {{.*}}, label %middle.block, label %vector.body
;		;
define i32 @more_than_one_use(i32* %a, i64 %n) {		define i32 @more_than_one_use(i32* %a, i64 %n) {
entry:		entry:
br label %for.body		br label %for.body
Show All 10 Lines	for.body:
br i1 %cond, label %for.body, label %for.end		br i1 %cond, label %for.body, label %for.end

for.end:		for.end:
%tmp4 = phi i32 [ %tmp3, %for.body ]		%tmp4 = phi i32 [ %tmp3, %for.body ]
ret i32 %tmp4		ret i32 %tmp4
}		}

; Check for crash exposed by D76992.		; Check for crash exposed by D76992.
; CHECK: N0 [label =		; CHECK: VPlan {
; CHECK-NEXT: "loop:\n" +		; CHECK-NEXT: loop:
; CHECK-NEXT: "WIDEN-INDUCTION %iv = phi 0, %iv.next\l" +		; CHECK-NEXT: WIDEN-INDUCTION %iv = phi %bc.resume.val, %iv.next
		fhahnUnsubmitted Not Done Reply Inline Actions Did the value here change because the plan gets printed later? fhahn: Did the value here change because the plan gets printed later?
		a.elovikovAuthorUnsubmitted Done Reply Inline Actions I didn't study in details, but I'd expect it to be so. Is that something expected for you, or do you want me to study that change in details (my knowledge of the VPlan pipeline is still limited)? a.elovikov: I didn't study in details, but I'd expect it to be so. Is that something expected for you, or…
		a.elovikovAuthorUnsubmitted Done Reply Inline Actions Yes, it gets changed in the middle of InnerLoopVectorizer::createVectorizedLoopSkeleton/InnerLoopVectorizer::createInductionResumeValues. We seem to be using original LLVM IR value through `VPlanIngredient` when printing `VPWidenIntOrFpInductionRecipe`, so modifying original loop when creating the skeleton changes printing. a.elovikov: Yes, it gets changed in the middle of InnerLoopVectorizer…
; CHECK-NEXT: "WIDEN ir<%cond0> = icmp ir<%iv>, ir<13>\l" +		; CHECK-NEXT: WIDEN ir<%cond0> = icmp ir<%iv>, ir<13>
; CHECK-NEXT: "WIDEN-SELECT ir<%s> = select ir<%cond0>, ir<10>, ir<20>\l"		; CHECK-NEXT: WIDEN-SELECT ir<%s> = select ir<%cond0>, ir<10>, ir<20>
; CHECK-NEXT: ]		; CHECK-NEXT: Successor(s):
		; CHECK-NEXT: }
define void @test() {		define void @test() {
entry:		entry:
br label %loop		br label %loop

loop: ; preds = %loop, %entry		loop: ; preds = %loop, %entry
%iv = phi i64 [ 0, %entry ], [ %iv.next, %loop ]		%iv = phi i64 [ 0, %entry ], [ %iv.next, %loop ]
%cond0 = icmp ult i64 %iv, 13		%cond0 = icmp ult i64 %iv, 13
%s = select i1 %cond0, i32 10, i32 20		%s = select i1 %cond0, i32 10, i32 20
%iv.next = add nuw nsw i64 %iv, 1		%iv.next = add nuw nsw i64 %iv, 1
%exitcond = icmp eq i64 %iv.next, 14		%exitcond = icmp eq i64 %iv.next, 14
br i1 %exitcond, label %exit, label %loop		br i1 %exitcond, label %exit, label %loop

exit: ; preds = %loop		exit: ; preds = %loop
ret void		ret void
}		}

llvm/test/Transforms/LoopVectorize/vplan-printing.ll

	; REQUIRES: asserts			; REQUIRES: asserts

	; RUN: opt -loop-vectorize -debug-only=loop-vectorize -force-vector-interleave=1 -force-vector-width=4 -prefer-inloop-reductions -disable-output %s 2>&1 \| FileCheck %s			; RUN: opt -loop-vectorize -print-executed-vplan -force-vector-interleave=1 -force-vector-width=4 -prefer-inloop-reductions -disable-output < %s \| FileCheck %s
	fhahnUnsubmitted Not Done Reply Inline Actions I think we should have at least a LIT test that also checks `-vplan-print-in-dot-format=true`. Perhaps this file would a good candidate to do so? fhahn: I think we should have at least a LIT test that also checks `-vplan-print-in-dot-format=true`.

	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"

	; Tests for printing VPlans.			; Tests for printing VPlans.

	define void @print_call_and_memory(i64 %n, float* noalias %y, float* noalias %x) nounwind uwtable {			define void @print_call_and_memory(i64 %n, float* noalias %y, float* noalias %x) nounwind uwtable {
	; CHECK: N0 [label =			; CHECK: VPlan {
	; CHECK-NEXT: "for.body:\n" +			; CHECK-NEXT: for.body:
	; CHECK-NEXT: "WIDEN-INDUCTION %iv = phi %iv.next, 0\l" +			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi %iv.next, %bc.resume.val
	; CHECK-NEXT: "CLONE ir<%arrayidx> = getelementptr ir<%y>, ir<%iv>\l" +			; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%y>, ir<%iv>
	; CHECK-NEXT: "WIDEN ir<%lv> = load ir<%arrayidx>\l" +			; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>
	; CHECK-NEXT: "WIDEN-CALL ir<%call> = call @llvm.sqrt.f32(ir<%lv>)\l" +			; CHECK-NEXT: WIDEN-CALL ir<%call> = call @llvm.sqrt.f32(ir<%lv>)
	; CHECK-NEXT: "CLONE ir<%arrayidx2> = getelementptr ir<%x>, ir<%iv>\l" +			; CHECK-NEXT: CLONE ir<%arrayidx2> = getelementptr ir<%x>, ir<%iv>
	; CHECK-NEXT: "WIDEN store ir<%arrayidx2>, ir<%call>\l"			; CHECK-NEXT: WIDEN store ir<%arrayidx2>, ir<%call>
	; CHECK-NEXT: ]			; CHECK-NEXT: Successor(s):
				; CHECK-NEXT: }
				;
	entry:			entry:
	%cmp6 = icmp sgt i64 %n, 0			%cmp6 = icmp sgt i64 %n, 0
	br i1 %cmp6, label %for.body, label %for.end			br i1 %cmp6, label %for.body, label %for.end

	for.body: ; preds = %entry, %for.body			for.body: ; preds = %entry, %for.body
	%iv = phi i64 [ %iv.next, %for.body ], [ 0, %entry ]			%iv = phi i64 [ %iv.next, %for.body ], [ 0, %entry ]
	%arrayidx = getelementptr inbounds float, float* %y, i64 %iv			%arrayidx = getelementptr inbounds float, float* %y, i64 %iv
	%lv = load float, float* %arrayidx, align 4			%lv = load float, float* %arrayidx, align 4
	%call = tail call float @llvm.sqrt.f32(float %lv) nounwind readnone			%call = tail call float @llvm.sqrt.f32(float %lv) nounwind readnone
	%arrayidx2 = getelementptr inbounds float, float* %x, i64 %iv			%arrayidx2 = getelementptr inbounds float, float* %x, i64 %iv
	store float %call, float* %arrayidx2, align 4			store float %call, float* %arrayidx2, align 4
	%iv.next = add i64 %iv, 1			%iv.next = add i64 %iv, 1
	%exitcond = icmp eq i64 %iv.next, %n			%exitcond = icmp eq i64 %iv.next, %n
	br i1 %exitcond, label %for.end, label %for.body			br i1 %exitcond, label %for.end, label %for.body

	for.end: ; preds = %for.body, %entry			for.end: ; preds = %for.body, %entry
	ret void			ret void
	}			}

	define void @print_widen_gep_and_select(i64 %n, float* noalias %y, float* noalias %x, float* %z) nounwind uwtable {			define void @print_widen_gep_and_select(i64 %n, float* noalias %y, float* noalias %x, float* %z) nounwind uwtable {
	; CHECK: N0 [label =			; CHECK: VPlan {
	; CHECK-NEXT: "for.body:\n" +			; CHECK-NEXT: for.body:
	; CHECK-NEXT: "WIDEN-INDUCTION %iv = phi %iv.next, 0\l" +			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi %iv.next, %bc.resume.val
	; CHECK-NEXT: "WIDEN-GEP Inv[Var] ir<%arrayidx> = getelementptr ir<%y>, ir<%iv>\l" +			; CHECK-NEXT: WIDEN-GEP Inv[Var] ir<%arrayidx> = getelementptr ir<%y>, ir<%iv>
	; CHECK-NEXT: "WIDEN ir<%lv> = load ir<%arrayidx>\l" +			; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>
	; CHECK-NEXT: "WIDEN ir<%cmp> = icmp ir<%arrayidx>, ir<%z>\l" +			; CHECK-NEXT: WIDEN ir<%cmp> = icmp ir<%arrayidx>, ir<%z>
	; CHECK-NEXT: "WIDEN-SELECT ir<%sel> = select ir<%cmp>, ir<1.000000e+01>, ir<2.000000e+01>\l" +			; CHECK-NEXT: WIDEN-SELECT ir<%sel> = select ir<%cmp>, ir<1.000000e+01>, ir<2.000000e+01>
	; CHECK-NEXT: "WIDEN ir<%add> = fadd ir<%lv>, ir<%sel>\l" +			; CHECK-NEXT: WIDEN ir<%add> = fadd ir<%lv>, ir<%sel>
	; CHECK-NEXT: "CLONE ir<%arrayidx2> = getelementptr ir<%x>, ir<%iv>\l" +			; CHECK-NEXT: CLONE ir<%arrayidx2> = getelementptr ir<%x>, ir<%iv>
	; CHECK-NEXT: "WIDEN store ir<%arrayidx2>, ir<%add>\l"			; CHECK-NEXT: WIDEN store ir<%arrayidx2>, ir<%add>
	; CHECK-NEXT: ]			; CHECK-NEXT: Successor(s):
				; CHECK-NEXT: }
				;
	entry:			entry:
	%cmp6 = icmp sgt i64 %n, 0			%cmp6 = icmp sgt i64 %n, 0
	br i1 %cmp6, label %for.body, label %for.end			br i1 %cmp6, label %for.body, label %for.end

	for.body: ; preds = %entry, %for.body			for.body: ; preds = %entry, %for.body
	%iv = phi i64 [ %iv.next, %for.body ], [ 0, %entry ]			%iv = phi i64 [ %iv.next, %for.body ], [ 0, %entry ]
	%arrayidx = getelementptr inbounds float, float* %y, i64 %iv			%arrayidx = getelementptr inbounds float, float* %y, i64 %iv
	%lv = load float, float* %arrayidx, align 4			%lv = load float, float* %arrayidx, align 4
	%cmp = icmp eq float* %arrayidx, %z			%cmp = icmp eq float* %arrayidx, %z
	%sel = select i1 %cmp, float 10.0, float 20.0			%sel = select i1 %cmp, float 10.0, float 20.0
	%add = fadd float %lv, %sel			%add = fadd float %lv, %sel
	%arrayidx2 = getelementptr inbounds float, float* %x, i64 %iv			%arrayidx2 = getelementptr inbounds float, float* %x, i64 %iv
	store float %add, float* %arrayidx2, align 4			store float %add, float* %arrayidx2, align 4
	%iv.next = add i64 %iv, 1			%iv.next = add i64 %iv, 1
	%exitcond = icmp eq i64 %iv.next, %n			%exitcond = icmp eq i64 %iv.next, %n
	br i1 %exitcond, label %for.end, label %for.body			br i1 %exitcond, label %for.end, label %for.body

	for.end: ; preds = %for.body, %entry			for.end: ; preds = %for.body, %entry
	ret void			ret void
	}			}

	define float @print_reduction(i64 %n, float* noalias %y) {			define float @print_reduction(i64 %n, float* noalias %y) {
	; CHECK: N0 [label =			; CHECK: VPlan {
	; CHECK-NEXT: "for.body:\n" +			; CHECK-NEXT: for.body:
	; CHECK-NEXT: "WIDEN-INDUCTION %iv = phi %iv.next, 0\l" +			; CHECK-NEXT: WIDEN-INDUCTION %iv = phi %iv.next, %bc.resume.val
	; CHECK-NEXT: "WIDEN-PHI %red = phi %red.next, 0.000000e+00\l" +			; CHECK-NEXT: WIDEN-PHI %red = phi %red.next, 0.000000e+00
	; CHECK-NEXT: "CLONE ir<%arrayidx> = getelementptr ir<%y>, ir<%iv>\l" +			; CHECK-NEXT: CLONE ir<%arrayidx> = getelementptr ir<%y>, ir<%iv>
	; CHECK-NEXT: "WIDEN ir<%lv> = load ir<%arrayidx>\l" +			; CHECK-NEXT: WIDEN ir<%lv> = load ir<%arrayidx>
	; CHECK-NEXT: "REDUCE ir<%red.next> = ir<%red> + reduce.fadd (ir<%lv>)\l"			; CHECK-NEXT: REDUCE ir<%red.next> = ir<%red> + reduce.fadd (ir<%lv>)
	; CHECK-NEXT: ]			; CHECK-NEXT: Successor(s):
				; CHECK-NEXT: }
				;
	entry:			entry:
	br label %for.body			br label %for.body

	for.body: ; preds = %entry, %for.body			for.body: ; preds = %entry, %for.body
	%iv = phi i64 [ %iv.next, %for.body ], [ 0, %entry ]			%iv = phi i64 [ %iv.next, %for.body ], [ 0, %entry ]
	%red = phi float [ %red.next, %for.body ], [ 0.0, %entry ]			%red = phi float [ %red.next, %for.body ], [ 0.0, %entry ]
	%arrayidx = getelementptr inbounds float, float* %y, i64 %iv			%arrayidx = getelementptr inbounds float, float* %y, i64 %iv
	%lv = load float, float* %arrayidx, align 4			%lv = load float, float* %arrayidx, align 4
	%red.next = fadd fast float %lv, %red			%red.next = fadd fast float %lv, %red
	%iv.next = add i64 %iv, 1			%iv.next = add i64 %iv, 1
	%exitcond = icmp eq i64 %iv.next, %n			%exitcond = icmp eq i64 %iv.next, %n
	br i1 %exitcond, label %for.end, label %for.body			br i1 %exitcond, label %for.end, label %for.body

	for.end: ; preds = %for.body, %entry			for.end: ; preds = %for.body, %entry
	ret float %red.next			ret float %red.next
	}			}

	define void @print_replicate_predicated_phi(i64 %n, i64* %x) {			define void @print_replicate_predicated_phi(i64 %n, i64* %x) {
	; CHECK: N0 [label =			; CHECK: VPlan {
	; CHECK-NEXT: "for.body:\n" +			; CHECK-NEXT: for.body:
	; CHECK-NEXT: "WIDEN-INDUCTION %i = phi 0, %i.next\l" +			; CHECK-NEXT: WIDEN-INDUCTION %i = phi %bc.resume.val, %i.next
	; CHECK-NEXT: "WIDEN ir<%cmp> = icmp ir<%i>, ir<5>\l"			; CHECK-NEXT: WIDEN ir<%cmp> = icmp ir<%i>, ir<5>
	; CHECK-NEXT: ]			; CHECK-NEXT: Successor(s): if.then
	;			; CHECK-EMPTY:
	; CHECK: N2 [label =			; CHECK-NEXT: if.then:
	; CHECK-NEXT: "pred.udiv.entry:\n" +			; CHECK-NEXT: Successor(s): pred.udiv
	; CHECK-NEXT: +			; CHECK-EMPTY:
	; CHECK-NEXT: "BRANCH-ON-MASK ir<%cmp>\l"\l			; CHECK-NEXT: <xVFxUF> pred.udiv: {
	; CHECK-NEXT: "CondBit: ir<%cmp>"			; CHECK-NEXT: pred.udiv.entry:
	; CHECK-NEXT: ]			; CHECK-NEXT: BRANCH-ON-MASK ir<%cmp>
	;			; CHECK-NEXT: Successor(s): pred.udiv.if pred.udiv.continue
	; CHECK: N4 [label =			; CHECK-NEXT: CondBit: ir<%cmp>
	; CHECK-NEXT: "pred.udiv.if:\n" +			; CHECK-EMPTY:
	; CHECK-NEXT: "REPLICATE ir<%tmp4> = udiv ir<%n>, ir<%i> (S->V)\l"			; CHECK-NEXT: pred.udiv.if:
	; CHECK-NEXT: ]			; CHECK-NEXT: REPLICATE ir<%tmp4> = udiv ir<%n>, ir<%i> (S->V)
	;			; CHECK-NEXT: Successor(s): pred.udiv.continue
	; CHECK: N5 [label =			; CHECK-EMPTY:
	; CHECK-NEXT: "pred.udiv.continue:\n" +			; CHECK-NEXT: pred.udiv.continue:
	; CHECK-NEXT: "PHI-PREDICATED-INSTRUCTION ir<%tmp4>\l"			; CHECK-NEXT: PHI-PREDICATED-INSTRUCTION ir<%tmp4>
	; CHECK-NEXT: ]			; CHECK-NEXT: Successor(s):
	;			; CHECK-NEXT: }
	; CHECK: N7 [label =			; CHECK-EMPTY:
	; CHECK-NEXT: "for.inc:\n" +			; CHECK-NEXT: if.then.0:
	; CHECK-NEXT: "EMIT vp<%4> = not ir<%cmp>\l" +			; CHECK-NEXT: Successor(s): for.inc
	; CHECK-NEXT: "BLEND %d = ir<0>/vp<%4> ir<%tmp4>/ir<%cmp>\l" +			; CHECK-EMPTY:
	; CHECK-NEXT: "CLONE ir<%idx> = getelementptr ir<%x>, ir<%i>\l" +			; CHECK-NEXT: for.inc:
	; CHECK-NEXT: "WIDEN store ir<%idx>, ir<%d>\l"			; CHECK-NEXT: EMIT vp<%4> = not ir<%cmp>
	; CHECK-NEXT: ]			; CHECK-NEXT: BLEND %d = ir<0>/vp<%4> ir<%tmp4>/ir<%cmp>
				; CHECK-NEXT: CLONE ir<%idx> = getelementptr ir<%x>, ir<%i>
				; CHECK-NEXT: WIDEN store ir<%idx>, ir<%d>
				; CHECK-NEXT: Successor(s):
				; CHECK-NEXT: }
	;			;
	entry:			entry:
	br label %for.body			br label %for.body

	for.body: ; preds = %for.inc, %entry			for.body: ; preds = %for.inc, %entry
	%i = phi i64 [ 0, %entry ], [ %i.next, %for.inc ]			%i = phi i64 [ 0, %entry ], [ %i.next, %for.inc ]
	%cmp = icmp ult i64 %i, 5			%cmp = icmp ult i64 %i, 5
	br i1 %cmp, label %if.then, label %for.inc			br i1 %cmp, label %if.then, label %for.inc
	Show All 18 Lines

llvm/unittests/Transforms/Vectorize/VPlanHCFGTest.cpp

	//===- llvm/unittest/Transforms/Vectorize/VPlanHCFGTest.cpp ---------------===//			//===- llvm/unittest/Transforms/Vectorize/VPlanHCFGTest.cpp ---------------===//
				Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	▲ Show 20 Lines • Show All 89 Lines • ▼ Show 20 Lines
	graph [labelloc=t, fontsize=30; label="Vectorization Plan"]			graph [labelloc=t, fontsize=30; label="Vectorization Plan"]
	node [shape=rect, fontname=Courier, fontsize=30]			node [shape=rect, fontname=Courier, fontsize=30]
	edge [fontname=Courier, fontsize=30]			edge [fontname=Courier, fontsize=30]
	compound=true			compound=true
	subgraph cluster_N0 {			subgraph cluster_N0 {
	fontname=Courier			fontname=Courier
	label="\<x1\> TopRegion"			label="\<x1\> TopRegion"
	N1 [label =			N1 [label =
	"entry:\n"			"entry:\l" +
				"Successor(s): for.body\l"
	]			]
	N1 -> N2 [ label=""]			N1 -> N2 [ label=""]
	N2 [label =			N2 [label =
	"for.body:\n" +			"for.body:\l" +
	"EMIT ir<%indvars.iv> = phi ir<0> ir<%indvars.iv.next>\l" +			" EMIT ir\<%indvars.iv\> = phi ir\<0\> ir\<%indvars.iv.next\>\l" +
	"EMIT ir<%arr.idx> = getelementptr ir<%A> ir<%indvars.iv>\l" +			" EMIT ir\<%arr.idx\> = getelementptr ir\<%A\> ir\<%indvars.iv\>\l" +
	"EMIT ir<%l1> = load ir<%arr.idx>\l" +			" EMIT ir\<%l1\> = load ir\<%arr.idx\>\l" +
	"EMIT ir<%res> = add ir<%l1> ir<10>\l" +			" EMIT ir\<%res\> = add ir\<%l1\> ir\<10\>\l" +
	"EMIT store ir<%res> ir<%arr.idx>\l" +			" EMIT store ir\<%res\> ir\<%arr.idx\>\l" +
	"EMIT ir<%indvars.iv.next> = add ir<%indvars.iv> ir<1>\l" +			" EMIT ir\<%indvars.iv.next\> = add ir\<%indvars.iv\> ir\<1\>\l" +
	"EMIT ir<%exitcond> = icmp ir<%indvars.iv.next> ir<%N>\l" +			" EMIT ir\<%exitcond\> = icmp ir\<%indvars.iv.next\> ir\<%N\>\l" +
	"CondBit: ir<%exitcond> (for.body)\l"			"Successor(s): for.body for.end\l" +
				"CondBit: ir\<%exitcond\> (for.body)\l"
	]			]
	N2 -> N2 [ label="T"]			N2 -> N2 [ label="T"]
	N2 -> N3 [ label="F"]			N2 -> N3 [ label="F"]
	N3 [label =			N3 [label =
	"for.end:\n" +			"for.end:\l" +
	"EMIT ret\l"			" EMIT ret\l" +
				"Successor(s):\l"
	]			]
	}			}
	}			}
	)";			)";
	EXPECT_EQ(ExpectedStr, FullDump);			EXPECT_EQ(ExpectedStr, FullDump);

	LoopVectorizationLegality::InductionList Inductions;			LoopVectorizationLegality::InductionList Inductions;
	SmallPtrSet<Instruction *, 1> DeadInstructions;			SmallPtrSet<Instruction *, 1> DeadInstructions;
	▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

llvm/unittests/Transforms/Vectorize/VPlanTest.cpp

//===- llvm/unittests/Transforms/Vectorize/VPlanTest.cpp - VPlan tests ----===//		//===- llvm/unittests/Transforms/Vectorize/VPlanTest.cpp - VPlan tests ----===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 319 Lines • ▼ Show 20 Lines	TEST(VPBasicBlockTest, print) {
VPInstruction *I1 = new VPInstruction(Instruction::Add, {});		VPInstruction *I1 = new VPInstruction(Instruction::Add, {});
VPInstruction *I2 = new VPInstruction(Instruction::Sub, {I1});		VPInstruction *I2 = new VPInstruction(Instruction::Sub, {I1});
VPInstruction *I3 = new VPInstruction(Instruction::Br, {I1, I2});		VPInstruction *I3 = new VPInstruction(Instruction::Br, {I1, I2});

VPBasicBlock *VPBB1 = new VPBasicBlock();		VPBasicBlock *VPBB1 = new VPBasicBlock();
VPBB1->appendRecipe(I1);		VPBB1->appendRecipe(I1);
VPBB1->appendRecipe(I2);		VPBB1->appendRecipe(I2);
VPBB1->appendRecipe(I3);		VPBB1->appendRecipe(I3);
		VPBB1->setName("bb1");

VPInstruction *I4 = new VPInstruction(Instruction::Mul, {I2, I1});		VPInstruction *I4 = new VPInstruction(Instruction::Mul, {I2, I1});
VPInstruction *I5 = new VPInstruction(Instruction::Ret, {I4});		VPInstruction *I5 = new VPInstruction(Instruction::Ret, {I4});
VPBasicBlock *VPBB2 = new VPBasicBlock();		VPBasicBlock *VPBB2 = new VPBasicBlock();
VPBB2->appendRecipe(I4);		VPBB2->appendRecipe(I4);
VPBB2->appendRecipe(I5);		VPBB2->appendRecipe(I5);
		VPBB2->setName("bb2");

VPBlockUtils::connectBlocks(VPBB1, VPBB2);		VPBlockUtils::connectBlocks(VPBB1, VPBB2);

// Check printing an instruction without associated VPlan.		// Check printing an instruction without associated VPlan.
{		{
std::string I3Dump;		std::string I3Dump;
raw_string_ostream OS(I3Dump);		raw_string_ostream OS(I3Dump);
VPSlotTracker SlotTracker;		VPSlotTracker SlotTracker;
I3->print(OS, "", SlotTracker);		I3->print(OS, "", SlotTracker);
OS.flush();		OS.flush();
EXPECT_EQ("EMIT br <badref> <badref>", I3Dump);		EXPECT_EQ("EMIT br <badref> <badref>", I3Dump);
}		}

VPlan Plan;		VPlan Plan;
Plan.setEntry(VPBB1);		Plan.setEntry(VPBB1);
std::string FullDump;		std::string FullDump;
raw_string_ostream(FullDump) << Plan;		raw_string_ostream(FullDump) << Plan;

const char *ExpectedStr = R"(digraph VPlan {		const char *ExpectedStr = R"(digraph VPlan {
graph [labelloc=t, fontsize=30; label="Vectorization Plan"]		graph [labelloc=t, fontsize=30; label="Vectorization Plan"]
node [shape=rect, fontname=Courier, fontsize=30]		node [shape=rect, fontname=Courier, fontsize=30]
edge [fontname=Courier, fontsize=30]		edge [fontname=Courier, fontsize=30]
compound=true		compound=true
N0 [label =		N0 [label =
":\n" +		"bb1:\l" +
"EMIT vp<%0> = add\l" +		" EMIT vp\<%0\> = add\l" +
"EMIT vp<%1> = sub vp<%0>\l" +		" EMIT vp\<%1\> = sub vp\<%0\>\l" +
"EMIT br vp<%0> vp<%1>\l"		" EMIT br vp\<%0\> vp\<%1\>\l" +
		"Successor(s): bb2\l"
]		]
N0 -> N1 [ label=""]		N0 -> N1 [ label=""]
N1 [label =		N1 [label =
":\n" +		"bb2:\l" +
"EMIT vp<%3> = mul vp<%1> vp<%0>\l" +		" EMIT vp\<%3\> = mul vp\<%1\> vp\<%0\>\l" +
"EMIT ret vp<%3>\l"		" EMIT ret vp\<%3\>\l" +
		"Successor(s):\l"
]		]
}		}
)";		)";
EXPECT_EQ(ExpectedStr, FullDump);		EXPECT_EQ(ExpectedStr, FullDump);

		const char *ExpectedBlock1Str = R"(bb1:
		EMIT vp<%0> = add
		EMIT vp<%1> = sub vp<%0>
		EMIT br vp<%0> vp<%1>
		Successor(s): bb2
		)";
		std::string Block1Dump;
		raw_string_ostream OS1(Block1Dump);
		VPBB1->print(OS1);
		EXPECT_EQ(ExpectedBlock1Str, Block1Dump);


		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - Lint: Pre-merge checks: clang-format: please reformat the code ``` - ```
		// Ensure that numbering is good when dumping the second block in isolation.
		const char *ExpectedBlock2Str = R"(bb2:
		EMIT vp<%3> = mul vp<%1> vp<%0>
		EMIT ret vp<%3>
		Successor(s):
		)";
		std::string Block2Dump;
		raw_string_ostream OS2(Block2Dump);
		VPBB2->print(OS2);
		EXPECT_EQ(ExpectedBlock2Str, Block2Dump);

{		{
std::string I3Dump;		std::string I3Dump;
raw_string_ostream OS(I3Dump);		raw_string_ostream OS(I3Dump);
VPSlotTracker SlotTracker(&Plan);		VPSlotTracker SlotTracker(&Plan);
I3->print(OS, "", SlotTracker);		I3->print(OS, "", SlotTracker);
OS.flush();		OS.flush();
EXPECT_EQ("EMIT br vp<%0> vp<%1>", I3Dump);		EXPECT_EQ("EMIT br vp<%0> vp<%1>", I3Dump);
}		}
▲ Show 20 Lines • Show All 322 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[VPlan] Add plain text (not DOT's digraph) dumpsClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 323452

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

llvm/lib/Transforms/Vectorize/VPlan.h

llvm/lib/Transforms/Vectorize/VPlan.cpp

llvm/test/Transforms/LoopVectorize/icmp-uniforms.ll

llvm/test/Transforms/LoopVectorize/vplan-printing.ll

llvm/unittests/Transforms/Vectorize/VPlanHCFGTest.cpp

llvm/unittests/Transforms/Vectorize/VPlanTest.cpp

[VPlan] Add plain text (not DOT's digraph) dumps
ClosedPublic