This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/Proposals/
-
Proposals/
-
VectorizationPlan.rst
-
lib/Transforms/Vectorize/
-
Transforms/
-
Vectorize/
6/14
VPlanValue.h
-
unittests/Transforms/Vectorize/
-
Transforms/
-
Vectorize/
-
VPlanTest.cpp

Differential D88380

[VPlan] Extend VPValue to also model sub- & 'virtual' values.
AbandonedPublic

Authored by fhahn on Sep 27 2020, 9:16 AM.

Download Raw Diff

Details

Reviewers

Ayal
rengolin
gilr

Summary

This patch extends VPValue to model 3 different kinds of VPValues:

Concrete VPValues Concrete VPValues are either live-ins coming from IR or instructions/ recipes in VPlan which produce a single value. They are the most common kind.
Sub VPValues Sub-VPValues are result values from instructions/recipes in VPlan that produce multiple values. They contain a reference to the producing 'virtual' VPValue.
Virtual VPValues Virtual VPValues are used to model instructions/recipes that either produce multiple subvalues or no values at all. A virtual VPValue does not refer to a concrete value, which means it cannot be used like concrete or subvalues. For example, they cannot be used as operands. They can be used to traverse the def-use chains upwards. They also provide convenient access to all users of all sub-values of the producer.

Most existing recipes will be concrete VPValues (e.g. VPInstruction,
VPWidenRecipe & so on).Sub-VPValues can be used to model multiple result
values for VPInterleaveRecipe. VPInterleaveRecipe itself is a 'virtual'
VPValue, which allows for convenient traversal of the def-use chains.

The main advantage of handling everything in a VPValue over introducing
a new sub-class for sub-values (D87752) is that it slightly simplifies
things further down the road, by turning VPRecipeBase itself directly
into a VPValue. This simplifies things, by removing the duplicated
VPRecipeBaseID.

Following on this patch, we could make VPRecipeBase inherit from VPValue (D88379)
and VPUser (D88378), which should allow full traversal. Note that for the last 2 patches,
we should probably migrate the remaining recipes to manage operands using VPUser and
turn them into VPValues individually.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

fhahn created this revision.Sep 27 2020, 9:16 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 27 2020, 9:16 AM

Herald added subscribers: psnobl, rogfer01, bollu, hiraditya. · View Herald Transcript

fhahn requested review of this revision.Sep 27 2020, 9:16 AM

Herald added a subscriber: vkmr. · View Herald TranscriptSep 27 2020, 9:16 AM

Harbormaster completed remote builds in B73089: Diff 294555.Sep 27 2020, 9:17 AM

fhahn edited the summary of this revision. (Show Details)Sep 27 2020, 9:28 AM

fhahn mentioned this in D84679: [VPlan] Disconnect VPValue and VPUser..Sep 27 2020, 9:33 AM

dmgreen mentioned this in D84684: [VPlan] Use VPValue def for VPInterleaveRecipe..Oct 2 2020, 9:09 AM

rebase

Harbormaster completed remote builds in B73997: Diff 296181.Oct 5 2020, 7:40 AM

fhahn added a child revision: D84684: [VPlan] Use VPValue def for VPInterleaveRecipe..Oct 5 2020, 8:41 AM

dmgreen added a subscriber: dmgreen.Oct 6 2020, 7:01 AM

dmgreen added inline comments.

llvm/lib/Transforms/Vectorize/VPlanValue.h
55	"no values at all" could be concrete VPValues, just with void return type.
116	When would VPVirtualValueSC be used?
117	Would a VPMultiValueSC ever be used, or would it always be a VPVInterleaveSC? (Or whatever other recipe type it became)
144–145	The defining recipe would be another user of the value? The sounds like it would complicate the number of uses. When would it be useful to store this in both places?
203	Could this function just be checking for a new type of SubclassID?

fhahn added inline comments.Oct 6 2020, 8:09 AM

llvm/lib/Transforms/Vectorize/VPlanValue.h
55	If we have recipes that exclusively have void return types, 'virtual' values could indeed be used. The problem with the current recipes unfortunately is that for example `VPWidenMemoryInstructionRecipe` combines both load & store instructions and they share the same SubclassID. Same for others, like `VPWidenCall`. Not sure if we would create plain VPValues with void return type in practice.
116	This is only used for testing. Alternatively I could just keep VPInterleaveSC in this patch and use that instead.
117	This is left over from a previous version. I'll remove it.
122	This should be part of D84684
144–145	The defining recipe would be another user of the value? As is yes. The main advantage is that this would allow clients to traverse the def-use chains without necessarily needing to account for 'virtual' values directly. A cleaner alternative would be to account for that when the client asks for the list of users and combine the users of all sub-values for virtual values on demand.

Streamline handling of underlying value/defining VPValue, use union & checking the SubclassID instead of using PointerSumType. Should be much simpler now.

Harbormaster completed remote builds in B74155: Diff 296489.Oct 6 2020, 9:49 AM

I went ahead and also implemented an alternative approach that is along the lines of the first iteration, using VPMultiDef. Instead of extending VPValue it introduces a new VPDef class that mirrors VPuser but for defined values. This means VPValue and other parts end up being simpler, at the cost that VPValues cannot be dyn_casted to recipes directly. Instead, you have to get the VPDef for a VPValue first and cast that to a recipe. Overall this approach seems a bit simpler than making VPValue more complicated as in the this series.

The interesting patches are D90558, D90564, D90562 and D90565 (which changes VPInstruction to using VPDef and gives an idea into what changes are required to operate on VPDef).

Hello Florian. Sorry for the delay. I believe that the design of something is the most important part to get right - and the hardest part to change. I've been trying to take a look at these and the other new patches but I'm not sure I understand yet, likely because I have not had to deal with the complexities it is trying to address. I think my head-canon is simpler than yours, perhaps too simple! It mostly just thinks of things the same as IR instructions, which I like because it's familiar and dependable.

As far as I understand, correct me where I'm wrong:

Most instruction produce a single value. These are simple enough to deal with, we use a VPValue.
Most instructions also consume values, for which we use VPUser.
The complexity comes from interleaving groups which might need to deal with multiple values.
Interleaving stores do not produce multiple values, but do have multiple operands (?)
Interleaving loads do need to produce multiple values, in some way.
- They can either use something like this built into the VPValue class (which I kept getting stuck on having two different type systems in VPValue).
- They can have the Def/MultiDef from D90558 etal (which looks cleaner than the first version I was trying to look at last week, but still involves a lot of little nodes).
- They could hold a vector of "Members" that are VPUsers and join those together with operands. The complexity gets moved into the VPInterleaveRecipe class and nothing is needed in VPValue/etc

As you might have guessed, I still quite like the last option. But that might well just be because there are somethings about it that I haven't had to deal with. I like how it mirrors the llvm-ir though, it to me seems simple and familiar.

Some other random questions:

Do you know how Store Interleaving recipes should be handled?
Do you think that vplan nodes will eventually need types? (From looking at some things I think the answer is probably yes).
Should we split VPInterleaveRecipe (and maybe VPWidenMemoryInstructionRecipe) into different load and store recipes?

In D88380#2381997, @dmgreen wrote:

Hello Florian. Sorry for the delay. I believe that the design of something is the most important part to get right - and the hardest part to change. I've been trying to take a look at these and the other new patches but I'm not sure I understand yet, likely because I have not had to deal with the complexities it is trying to address. I think my head-canon is simpler than yours, perhaps too simple! It mostly just thinks of things the same as IR instructions, which I like because it's familiar and dependable.

Thanks for taking a look. I realize those are a lot of changes. I think the points below are spot on, I tried to expand on a few.

As far as I understand, correct me where I'm wrong:

Most instruction produce a single value. These are simple enough to deal with, we use a VPValue.

Most instructions also consume values, for which we use VPUser.

The complexity comes from interleaving groups which might need to deal with multiple values.

Yes, interleave groups are the only example in the current code. But there very likely will be additional ones in the future. Another example that Ayal mentioned some time ago is modeling sincos for which various vector libraries like Accelerate (https://developer.apple.com/documentation/accelerate/vforce/3241297-sincos) or in Intel's SVML provide tuned implementations, which return separate vectors with the sin and cos results.

One benefit of modeling the fact that we can have recipes that define multiple values is that VPlan based analysis/transformations need and can account for this scenario in general. This would hopefully ensure the code is written in a way to safely handle future additions of 'multi-defs'. If we just special case the interleave case, this likely is not the case and introducing new multi-defs in the future will be more painful.

Also there's a trend towards more specialized vector instruction sets and I expect more specialized candidates for multi-defs to pop up there as well. Unfortunately some of those are not public, but I think modeling the multi-def case in general should ensure VPlan can be used for such specialized cases downstream without to much pain as well in the future.

Interleaving stores do not produce multiple values, but do have multiple operands (?)

yes, some recipes like VPWidenMemoryInstructionRecpipe, VPWidenCallRecipe or VPInterleaveRecipe may not actually define a VPValue, depending on the contents. That makes turning them into VPValues a little awkward, so those are probably likely candidates to break-down further in the future (see response further below as well)

Interleaving loads do need to produce multiple values, in some way.

They can either use something like this built into the VPValue class (which I kept getting stuck on having two different type systems in VPValue).

They can have the Def/MultiDef from D90558 etal (which looks cleaner than the first version I was trying to look at last week, but still involves a lot of little nodes).

They could hold a vector of "Members" that are VPUsers and join those together with operands. The complexity gets moved into the VPInterleaveRecipe class and nothing is needed in VPValue/etc

IIUC this is the same idea as the VPDef approach (D90558), where VPDef adds such a vector, but in a general fashion so we do not need to special case VPInterleaveRecipe. I think this vector has to hold VPValues in some form. I am not entirely sure how the VPUser fits into this vector, as I think the whole interleave group shares a single address as operand.

Note that D90558 & co should be simpler than the originally proposed MultiDef, which was spread out over multiple classes. The latest patches essentially just add the VPDef class, which just contains a vector of 'defined' VPValues. In the end, all recipes should inherit from it and VPDef can be folded directly into VPRecipeBase. So it should boil down to adding a single vector to VPRecipeBase.

It also requires a change to VPValue to allow getting the VPDef (effectively the recipe) that defines the value. I think there's no way around that, even if we just change VPInterleaveRecipe to keep a vector of VPValues it defines. To walk upwards the def-use chains, we need to be able to get the recipe/VPDef that defines a VPValue. In the single def cases, we can just cast the VPValue directly to the corresponding recipe, but that's not possible for VPValues defined by a 'multi-def'. Once we have to do that, I think we might as well add a general way to express that a recipe can define multiple values.

As you might have guessed, I still quite like the last option. But that might well just be because there are somethings about it that I haven't had to deal with. I like how it mirrors the llvm-ir though, it to me seems simple and familiar.

Some other random questions:

Do you know how Store Interleaving recipes should be handled?

(See below)

Do you think that vplan nodes will eventually need types? (From looking at some things I think the answer is probably yes).

Once we synthesize more VPInstructions or recipes that are not directly tied to the original IR, I think we will need a way to attach types to nodes for cost-modeling and code-generation. Currently we get away without types, because we mostly rely on the types of the underlying IR. Once all of code-gen is updated to work on VPValues I think we can start to think/work on this transition.

Should we split VPInterleaveRecipe (and maybe VPWidenMemoryInstructionRecipe) into different load and store recipes?

At the momentI think the current break-down mirrors the existing code-generation functions in LV (e.g. vectorizeMemoryInstrution, vectorizeInterelaveGroup). I think that might make sense to break down/split up some of those recipes further in the future, as we move towards migrating the code-gen code, VP cost-modeling & transforms.

IIUC this is the same idea as the VPDef approach (D90558), where VPDef adds such a vector, but in a general fashion so we do not need to special case VPInterleaveRecipe. I think this vector has to hold VPValues in some form. I am not entirely sure how the VPUser fits into this vector, as I think the whole interleave group shares a single address as operand.

Note that D90558 & co should be simpler than the originally proposed MultiDef, which was spread out over multiple classes. The latest patches essentially just add the VPDef class, which just contains a vector of 'defined' VPValues. In the end, all recipes should inherit from it and VPDef can be folded directly into VPRecipeBase. So it should boil down to adding a single vector to VPRecipeBase.

It also requires a change to VPValue to allow getting the VPDef (effectively the recipe) that defines the value. I think there's no way around that, even if we just change VPInterleaveRecipe to keep a vector of VPValues it defines. To walk upwards the def-use chains, we need to be able to get the recipe/VPDef that defines a VPValue. In the single def cases, we can just cast the VPValue directly to the corresponding recipe, but that's not possible for VPValues defined by a 'multi-def'. Once we have to do that, I think we might as well add a general way to express that a recipe can define multiple values.

Something like this is what I meant, to try and make it more concrete (this is just meant as pseudo code):

/// VPInterleaveRecipe is a recipe for transforming an interleave group of load
/// or stores into one wide load/store and shuffles.
class VPInterleaveRecipe : public VPRecipeBase {
  const InterleaveGroup<Instruction> *IG;
  SmallVector<std::unique_ptr<VPUser>, 4> Members;

public:
  VPInterleaveRecipe(const InterleaveGroup<Instruction> *IG, VPValue *Addr,
                     VPValue *Mask)
      : VPRecipeBase(VPInterleaveSC, nullptr, {Addr}), IG(IG) {
    if (Mask)
      addOperand(Mask);
    // FIXME: Only loads actually need this, but stores need something better for
    // detecting Mask operands.
    for (size_t i = 0; i < IG->getFactor(); i++)
      Members.emplace_back(
          new VPUser(VPValue::VPExtractSC, IG->getMember(i), {this}));
  }
  ~VPInterleaveRecipe() override = default;

  VPValue *getExtract(int Idx) const {
    return Members[Idx].get();
  }

The Members are the Values that the other recipes are connected to. They also become uses of the VPInterleaveRecipe (hence they are VPUsers, which glues together a def-use graph). Everything else is pretty standard, VPRecipeBase inherits from VPUser which inherits from VPValue.
I feel like either if you are having to make modifications that depend upon def-uses then you are altering VPInterleaveRecipe directly and you probably know how to deal with it's uses, or you are doing something like a replaceAllUsesWith on one of those "Members", which should then just work. There might be other things I'm not thinking about though, and I haven't tried to implement anything particularly on top of this scheme (other than the vmulh code, which did not have to deal with interleaving groups specifically, other than they might be a leaf node).

I think that method would be simpler and involve less things like creating new VPValues in the constructors of VPRecipes and storing the extra vectors for the defs. It also mirrors the llvm-ir, which like I said I'm a fan of because of it's familiararity. Take a think about it. Maybe try implementing something like the vmulh code on top of the VPDef code. (Or.. maybe something that actually does something with multi-node recipes).

If you think it's still the best way to go, I'll happily review those patches. So long as we have though through the options.

In D88380#2383596, @dmgreen wrote:

... snip ..

The Members are the Values that the other recipes are connected to. They also become uses of the VPInterleaveRecipe (hence they are VPUsers, which glues together a def-use graph). Everything else is pretty standard, VPRecipeBase inherits from VPUser which inherits from VPValue.
I feel like either if you are having to make modifications that depend upon def-uses then you are altering VPInterleaveRecipe directly and you probably know how to deal with it's uses, or you are doing something like a replaceAllUsesWith on one of those "Members", which should then just work. There might be other things I'm not thinking about though, and I haven't tried to implement anything particularly on top of this scheme (other than the vmulh code, which did not have to deal with interleaving groups specifically, other than they might be a leaf node).

Ah yes, I missed that this was the proposal with the VPInstruction extract nodes!

I think that method would be simpler and involve less things like creating new VPValues in the constructors of VPRecipes and storing the extra vectors for the defs. It also mirrors the llvm-ir, which like I said I'm a fan of because of it's familiararity.

Creating the VPValues in all the constructors for the recipes in earlier versions of the patches was indeed suboptimal & ugly. In the latest version of the patch, things have improved though I think. Now all recipes expect VPInterleaveRecipe keep inheriting from VPValue and the registration happens as part of the constructor.

Take a think about it. Maybe try implementing something like the vmulh code on top of the VPDef code. (Or.. maybe something that actually does something with multi-node recipes).

I spent some time porting the VMULH patch (D88152) on top of the VPDef system: D91198 (this excludes the cost-model & VPInstruction::execute changes, but they should not matter).

The only part that needed slight adjustments was the recursivelyDeleteUnusedRecipes implementation, which mostly boils down to iterating over all defined values when checking if the value is dead. I think code dealing only with single-def recipes should work unchanged.

void VPRecipeBase::recursivelyDeleteUnusedRecipes() {
  if (all_of(defined_values(), [](VPValue *Def) {
        return Def->getNumUsers() == 0;
      }) /* && isSafeToRemove()*/) {
    for (auto *Op : operands()) {
      Op->removeUser(*this);
      if (VPRecipeBase *R = dyn_cast<VPRecipeBase>(Op->getDef()))
        R->recursivelyDeleteUnusedRecipes();
    }
    eraseFromParent();
  }
}

If you think it's still the best way to go, I'll happily review those patches. So long as we have though through the options.

I think the ergonomics when dealing with single-def recipes is the same with all options.

The main advantage I see of modeling multi-defs is that we
(1) can avoid artificial/dummy VPValues; conceptually I think making VPInterleaveRecipe inherit from VPValue is not the right fit, it doesn't produce a value (granted, currently this is broken by stores and loads being handled in the same recipe, but that's easy to change);
(2) have a uniform way to define & handle multi-defs.

IMO it is quite hard to gauge up-front if the extra complexity is really warranted for a cleaner modeling, but given that the overhead seems minor (to me) I think it would preferable to go with the cleaner and uniform modeling. It would probably also be good to hear @Ayal 's or @gilr 's thoughts on this.

Some things that might be slightly harder when using extra VPINstructions to extract from a multi-def might be moving or predicating multi-def recipes.

Extending the recipe abstraction to support def/use relations is an important next step forward for VPlan, thanks for pushing this momentum forward!

Supporting multiple VPValues defined by a single recipe is relevant for VPInterleaveRecipe and to promote idioms such as sincos, add.with.overflow, addsub and potentially more. These are important to recognize and model during vectorization due to their associated atomic costs(*), and to facilitate more advanced "Loop Mixed" vectorization opportunities, where interleave groups form leaves and roots of "Loop aware" SLP trees (CGO'16).
While this multi-valued concept admittedly diverges from LLVM-IR's Instruction-is-a-User-is-a-single-Value, it is consistent with functions returning multiple values in MLIR. Furthermore, VPlan aims to model a region of code with live-ins and live-outs - typically a single loop-nest within a function. A VPValue that is generated by a recipe inside the region and used outside the region, could be represented by feeding a VPUser similar to in-region users; such a live-out VPUser however is out-of-scope, does not correspond to any recipe, and is therefore not a VPValue. Analogously, a VPUser of a recipe inside the region may have an operand from outside the region, which could be represented by a VPValue similar to in-region values; such a live-in VPValue however is out-of-scope, does not correspond to any recipe, and is therefore not a VPUser. The concept of VPDef to hold the zero, one or more VPValues defined by a recipe is analogous to VPUser which holds the operands of a recipe, as proposed by D90558. If/as all live-ins are single values, i.e., all multi-value VPDefs correspond to recipes, recipe and VPDef could be merged into one.

(*) Explicit extract VPInstructions could also be used following a wide VPInstruction load, once introduced into VPlan; but doing so effectively breaks the interleaved-load idiom along with its associated atomic cost.

OK sounds great. I did not know that MLIR could represent multiple values, that's good to see.

I still like honestly like a more "llvm" Value/User model more, but perhaps as I read the patches I will learn to like Defs too.

In D88380#2393587, @dmgreen wrote:

OK sounds great. I did not know that MLIR could represent multiple values, that's good to see.

I still like honestly like a more "llvm" Value/User model more, but perhaps as I read the patches I will learn to like Defs too.

Related references: https://mlir.llvm.org/docs/LangRef/#operations and the last "Traversing the def-use chains" section at the bottom of https://mlir.llvm.org/docs/Tutorials/UnderstandingTheIRStructure/ - hopefully this looks intuitive enough.

Perhaps VPDef should be renamed VPResults to clarify this correspondence, although other entities also have distinct names: VPUser/Operands and VPRecipeBase/Operation.

dcaballe added a subscriber: dcaballe.Nov 13 2020, 2:39 PM

Just looking around! Glad to see VPlan moving forward! :)

llvm/lib/Transforms/Vectorize/VPlanValue.h
52	I'm trying to understand what 'Sub' refers to here. Not sure I can easily relate it to 'result'. Is this kind of VPValue intended to be use for something else in the future? Would "Result VPValue" or similar make it clearer?
59	So a Virtual VPValue is basically what an Operation is in MLIR, right?
196	This, indeed, looks pretty much like MLIR API :)

Thanks for all the feedback. I'd propose to continue the discussion at the VPDef proposal (D90558). I'll abandon this one now, to avoid any confusion.

In D88380#2395190, @dcaballe wrote:

Just looking around! Glad to see VPlan moving forward! :)

Thanks for taking a look! We moved on to a slightly different implementation of the same ideas in D90558.

llvm/lib/Transforms/Vectorize/VPlanValue.h
196	Glad things align there :) We moved on to a slightly different implementation which represents the defined values of an operation through a dedicated VPDef class, without the need to complicate the VPValue hierarchy.

Revision Contents

Path

Size

llvm/

docs/

Proposals/

VectorizationPlan.rst

16 lines

lib/

Transforms/

Vectorize/

VPlanValue.h

94 lines

unittests/

Transforms/

Vectorize/

VPlanTest.cpp

49 lines

Diff 296489

llvm/docs/Proposals/VectorizationPlan.rst

Show First 20 Lines • Show All 143 Lines • ▼ Show 20 Lines	:VPRecipeBase:
may specify how its ingredients are to be transformed to produce the output IR		may specify how its ingredients are to be transformed to produce the output IR
instructions; e.g., cloned once, replicated multiple times or widened		instructions; e.g., cloned once, replicated multiple times or widened
according to selected VF.		according to selected VF.

:VPValue:		:VPValue:
The base of VPlan's def-use relations class hierarchy. When instantiated, it		The base of VPlan's def-use relations class hierarchy. When instantiated, it
models a constant or a live-in Value in VPlan. It has users, which are of type		models a constant or a live-in Value in VPlan. It has users, which are of type
VPUser, but no operands.		VPUser, but no operands.
		There are 3 different kinds of VPValues:
		1. Concrete VPValues
		Concrete VPValues are either live-ins coming from IR or instructions/
		recipes in VPlan which produce a single value. They are the most common
		kind.
		2. Sub VPValues
		Sub-VPValues are result values from instructions/recipes in VPlan that
		produce multiple values. They contain a reference to the producing
		'virtual' VPValue.
		3. Virtual VPValues
		Virtual VPValues are used to model instructions/recipes that either produce
		multiple subvalues or no values at all. A virtual VPValue does not refer to
		a concrete value, which means it cannot be used like concrete or subvalues.
		For example, they cannot be used as operands. They can be used to traverse
		the def-use chains upwards. They also provide convenient access to all
		users of all sub-values of the producer.

:VPUser:		:VPUser:
A VPUser represents an entity that uses a number of VPValues as operands.		A VPUser represents an entity that uses a number of VPValues as operands.
VPUser is similar in some aspects to LLVM's User class.		VPUser is similar in some aspects to LLVM's User class.

:VPInstruction:		:VPInstruction:
A VPInstruction is both a VPRecipe and a VPUser. It models a single		A VPInstruction is both a VPRecipe and a VPUser. It models a single
VPlan-level instruction to be generated if the VPlan is executed, including		VPlan-level instruction to be generated if the VPlan is executed, including
▲ Show 20 Lines • Show All 86 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/VPlanValue.h

Show All 15 Lines
/// These are documented in docs/VectorizationPlan.rst.		/// These are documented in docs/VectorizationPlan.rst.
///		///
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TRANSFORMS_VECTORIZE_VPLAN_VALUE_H		#ifndef LLVM_TRANSFORMS_VECTORIZE_VPLAN_VALUE_H
#define LLVM_TRANSFORMS_VECTORIZE_VPLAN_VALUE_H		#define LLVM_TRANSFORMS_VECTORIZE_VPLAN_VALUE_H

#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
		#include "llvm/ADT/PointerSumType.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/iterator_range.h"		#include "llvm/ADT/iterator_range.h"

namespace llvm {		namespace llvm {

// Forward declarations.		// Forward declarations.
class raw_ostream;		class raw_ostream;
class Value;		class Value;
class VPSlotTracker;		class VPSlotTracker;
class VPUser;		class VPUser;
class VPRecipeBase;		class VPRecipeBase;

		class VPValue;

// This is the base class of the VPlan Def/Use graph, used for modeling the data		// This is the base class of the VPlan Def/Use graph, used for modeling the data
// flow into, within and out of the VPlan. VPValues can stand for live-ins		// flow into, within and out of the VPlan. VPValues can stand for live-ins
// coming from the input IR, instructions which VPlan will generate if executed		// coming from the input IR, instructions which VPlan will generate if executed
// and live-outs which the VPlan will need to fix accordingly.		// and live-outs which the VPlan will need to fix accordingly.
		//
		// There are 3 different kinds of VPValues:
		// 1. Concrete VPValues
		// Concrete VPValues are either live-ins coming from IR or instructions/
		// recipes in VPlan which produce a single value. They are the most common
		// kind.
		// 2. Sub VPValues
		// Sub-VPValues are result values from instructions/recipes in VPlan that
		// produce multiple values. They contain a reference to the producing
		// 'virtual' VPValue.
		dcaballeUnsubmitted Not Done Reply Inline Actions I'm trying to understand what 'Sub' refers to here. Not sure I can easily relate it to 'result'. Is this kind of VPValue intended to be use for something else in the future? Would "Result VPValue" or similar make it clearer? dcaballe: I'm trying to understand what 'Sub' refers to here. Not sure I can easily relate it to 'result'.
		// 3. Virtual VPValues
		// Virtual VPValues are used to model instructions/recipes that either produce
		// multiple subvalues or no values at all. A virtual VPValue does not refer to
		dmgreenUnsubmitted Not Done Reply Inline Actions "no values at all" could be concrete VPValues, just with void return type. dmgreen: "no values at all" could be concrete VPValues, just with void return type.
		fhahnAuthorUnsubmitted Done Reply Inline Actions If we have recipes that exclusively have void return types, 'virtual' values could indeed be used. The problem with the current recipes unfortunately is that for example `VPWidenMemoryInstructionRecipe` combines both load & store instructions and they share the same SubclassID. Same for others, like `VPWidenCall`. Not sure if we would create plain VPValues with void return type in practice. fhahn: If we have recipes that exclusively have void return types, 'virtual' values could indeed be…
		// a concrete value, which means it cannot be used like concrete or subvalues.
		// For example, they cannot be used as operands. They can be used to traverse
		// the def-use chains upwards. They also provide convenient access to all
		// users of all sub-values of the producer.
		dcaballeUnsubmitted Not Done Reply Inline Actions So a Virtual VPValue is basically what an Operation is in MLIR, right? dcaballe: So a Virtual VPValue is basically what an Operation is in MLIR, right?
class VPValue {		class VPValue {
friend class VPBuilder;		friend class VPBuilder;
friend struct VPlanTransforms;		friend struct VPlanTransforms;
friend class VPBasicBlock;		friend class VPBasicBlock;
friend class VPInterleavedAccessInfo;		friend class VPInterleavedAccessInfo;
friend class VPSlotTracker;		friend class VPSlotTracker;
friend class VPRecipeBase;		friend class VPRecipeBase;

const unsigned char SubclassID; ///< Subclass identifier (for isa/dyn_cast).		const unsigned char SubclassID; ///< Subclass identifier (for isa/dyn_cast).

SmallVector<VPUser *, 1> Users;		SmallVector<VPUser *, 1> Users;

protected:		protected:
// Hold the underlying Value, if any, attached to this VPValue.		/// Hold the underlying Value, if any, attached to this VPValue for concrete
Value *UnderlyingVal;		/// VPValues or a pointer to the producing/defining VPValue.
		union {
		Value *UnderlyingValue;
		VPValue *DefiningValue;
		};

VPValue(const unsigned char SC, Value *UV = nullptr)		VPValue(const unsigned char SC, Value *UV = nullptr)
: SubclassID(SC), UnderlyingVal(UV) {}		: SubclassID(SC), UnderlyingValue(UV) {}

// DESIGN PRINCIPLE: Access to the underlying IR must be strictly limited to		// DESIGN PRINCIPLE: Access to the underlying IR must be strictly limited to
// the front-end and back-end of VPlan so that the middle-end is as		// the front-end and back-end of VPlan so that the middle-end is as
// independent as possible of the underlying IR. We grant access to the		// independent as possible of the underlying IR. We grant access to the
// underlying IR using friendship. In that way, we should be able to use VPlan		// underlying IR using friendship. In that way, we should be able to use VPlan
// for multiple underlying IRs (Polly?) by providing a new VPlan front-end,		// for multiple underlying IRs (Polly?) by providing a new VPlan front-end,
// back-end and analysis information for the new IR.		// back-end and analysis information for the new IR.

/// Return the underlying Value attached to this VPValue.		/// Return the underlying Value attached to this VPValue.
Value *getUnderlyingValue() { return UnderlyingVal; }		Value *getUnderlyingValue() {
const Value *getUnderlyingValue() const { return UnderlyingVal; }		assert(isConcrete() &&
		"can only get the underlying value of a concrete VPValue");
		return UnderlyingValue;
		}
		const Value *getUnderlyingValue() const {
		assert(isConcrete() &&
		"can only get the underlying value of a concrete VPValue");
		return UnderlyingValue;
		}

// Set \p Val as the underlying Value of this VPValue.		// Set \p Val as the underlying Value of this VPValue.
void setUnderlyingValue(Value *Val) {		void setUnderlyingValue(Value *Val) {
assert(!UnderlyingVal && "Underlying Value is already set.");		assert(isConcrete() && !getUnderlyingValue() &&
UnderlyingVal = Val;		"Underlying Value is already set.");
		UnderlyingValue = Val;
}		}

public:		public:
/// An enumeration for keeping track of the concrete subclass of VPValue that		/// An enumeration for keeping track of the concrete subclass of VPValue that
/// are actually instantiated. Values of this enumeration are kept in the		/// are actually instantiated. Values of this enumeration are kept in the
/// SubclassID field of the VPValue objects. They are used for concrete		/// SubclassID field of the VPValue objects. They are used for concrete
/// type identification.		/// type identification.
enum {		enum {
VPValueSC,		VPValueSC,
		VPVSubValueSC,
		dmgreenUnsubmitted Not Done Reply Inline Actions When would VPVirtualValueSC be used? dmgreen: When would VPVirtualValueSC be used?
		fhahnAuthorUnsubmitted Done Reply Inline Actions This is only used for testing. Alternatively I could just keep VPInterleaveSC in this patch and use that instead. fhahn: This is only used for testing. Alternatively I could just keep VPInterleaveSC in this patch and…
VPInstructionSC,		VPInstructionSC,
		dmgreenUnsubmitted Not Done Reply Inline Actions Would a VPMultiValueSC ever be used, or would it always be a VPVInterleaveSC? (Or whatever other recipe type it became) dmgreen: Would a VPMultiValueSC ever be used, or would it always be a VPVInterleaveSC? (Or whatever…
		fhahnAuthorUnsubmitted Done Reply Inline Actions This is left over from a previous version. I'll remove it. fhahn: This is left over from a previous version. I'll remove it.
VPMemoryInstructionSC,		VPMemoryInstructionSC,
VPVWidenCallSC,		VPVWidenCallSC,
VPVWidenSelectSC,		VPVWidenSelectSC,
VPVWidenGEPSC		VPVWidenGEPSC,
		VPVInterleaveSC
		fhahnAuthorUnsubmitted Done Reply Inline Actions This should be part of D84684 fhahn: This should be part of D84684
};		};

VPValue(Value *UV = nullptr) : VPValue(VPValueSC, UV) {}		VPValue(Value *UV = nullptr) : VPValue(VPValueSC, UV) {}
		VPValue(VPValue *Base) : SubclassID(VPVSubValueSC), DefiningValue(Base) {}
VPValue(const VPValue &) = delete;		VPValue(const VPValue &) = delete;
VPValue &operator=(const VPValue &) = delete;		VPValue &operator=(const VPValue &) = delete;

/// \return an ID for the concrete type of this object.		/// \return an ID for the concrete type of this object.
/// This is used to implement the classof checks. This should not be used		/// This is used to implement the classof checks. This should not be used
/// for any other purpose, as the values may change as LLVM evolves.		/// for any other purpose, as the values may change as LLVM evolves.
unsigned getVPValueID() const { return SubclassID; }		unsigned getVPValueID() const { return SubclassID; }

void printAsOperand(raw_ostream &OS, VPSlotTracker &Tracker) const;		void printAsOperand(raw_ostream &OS, VPSlotTracker &Tracker) const;
void print(raw_ostream &OS, VPSlotTracker &Tracker) const;		void print(raw_ostream &OS, VPSlotTracker &Tracker) const;

/// Dump the value to stderr (for debugging).		/// Dump the value to stderr (for debugging).
void dump() const;		void dump() const;

unsigned getNumUsers() const { return Users.size(); }		unsigned getNumUsers() const { return Users.size(); }
void addUser(VPUser &User) { Users.push_back(&User); }		void addUser(VPUser &User) {
		Users.push_back(&User);
		if (isSubValue())
		getDefiningValue()->Users.push_back(&User);
		dmgreenUnsubmitted Not Done Reply Inline Actions The defining recipe would be another user of the value? The sounds like it would complicate the number of uses. When would it be useful to store this in both places? dmgreen: The defining recipe would be another user of the value? The sounds like it would complicate the…
		fhahnAuthorUnsubmitted Done Reply Inline Actions The defining recipe would be another user of the value? As is yes. The main advantage is that this would allow clients to traverse the def-use chains without necessarily needing to account for 'virtual' values directly. A cleaner alternative would be to account for that when the client asks for the list of users and combine the users of all sub-values for virtual values on demand. fhahn: > The defining recipe would be another user of the value? As is yes. The main advantage is…
		}

/// Remove a single \p User from the list of users.		/// Remove a single \p User from the list of users.
void removeUser(VPUser &User) {		void removeUser(VPUser &User) {
bool Found = false;		bool Found = false;
// The same user can be added multiple times, e.g. because the same VPValue		// The same user can be added multiple times, e.g. because the same VPValue
// is used twice by the same VPUser. Remove a single one.		// is used twice by the same VPUser. Remove a single one.
erase_if(Users, [&User, &Found](VPUser *Other) {		erase_if(Users, [&User, &Found](VPUser *Other) {
if (Found)		if (Found)
return false;		return false;
if (Other == &User) {		if (Other == &User) {
Found = true;		Found = true;
return true;		return true;
}		}
return false;		return false;
});		});

		// For sub-values, also update the users of the defining value.
		if (isSubValue())
		getDefiningValue()->removeUser(User);
}		}

typedef SmallVectorImpl<VPUser *>::iterator user_iterator;		typedef SmallVectorImpl<VPUser *>::iterator user_iterator;
typedef SmallVectorImpl<VPUser *>::const_iterator const_user_iterator;		typedef SmallVectorImpl<VPUser *>::const_iterator const_user_iterator;
typedef iterator_range<user_iterator> user_range;		typedef iterator_range<user_iterator> user_range;
typedef iterator_range<const_user_iterator> const_user_range;		typedef iterator_range<const_user_iterator> const_user_range;

user_iterator user_begin() { return Users.begin(); }		user_iterator user_begin() { return Users.begin(); }
Show All 13 Lines	bool hasMoreThanOneUniqueUser() {
// Check if all users match the first user.		// Check if all users match the first user.
auto Current = std::next(user_begin());		auto Current = std::next(user_begin());
while (Current != user_end() && user_begin() == Current)		while (Current != user_end() && user_begin() == Current)
Current++;		Current++;
return Current != user_end();		return Current != user_end();
}		}

void replaceAllUsesWith(VPValue *New);		void replaceAllUsesWith(VPValue *New);

		VPValue *getDefiningValue() {
		dcaballeUnsubmitted Not Done Reply Inline Actions This, indeed, looks pretty much like MLIR API :) dcaballe: This, indeed, looks pretty much like MLIR API :)
		fhahnAuthorUnsubmitted Done Reply Inline Actions Glad things align there :) We moved on to a slightly different implementation which represents the defined values of an operation through a dedicated VPDef class, without the need to complicate the VPValue hierarchy. fhahn: Glad things align there :) We moved on to a slightly different implementation which represents…
		assert(isSubValue() && "can only get defining value of sub-value");
		return DefiningValue;
		}
		VPValue const *getDefiningValue() const {
		assert(isSubValue() && "can only get defining value of sub-value");
		return DefiningValue;
		}
		dmgreenUnsubmitted Not Done Reply Inline Actions Could this function just be checking for a new type of SubclassID? dmgreen: Could this function just be checking for a new type of SubclassID?

		bool isConcrete() const { return !isSubValue() && !isVirtual(); }

		bool isSubValue() const { return getVPValueID() == VPVSubValueSC; }

		bool isVirtual() const { return getVPValueID() == VPVInterleaveSC; }
};		};

		// Check size assertions made in the PointerLikeTypeTraits specialization for
		// VPValue *.
		static_assert(detail::ConstantLog2<alignof(VPValue)>::value >= 2,
		"alignment of VPValue is smaller than assumed above");

typedef DenseMap<Value , VPValue > Value2VPValueTy;		typedef DenseMap<Value , VPValue > Value2VPValueTy;
typedef DenseMap<VPValue , Value > VPValue2ValueTy;		typedef DenseMap<VPValue , Value > VPValue2ValueTy;

raw_ostream &operator<<(raw_ostream &OS, const VPValue &V);		raw_ostream &operator<<(raw_ostream &OS, const VPValue &V);

/// This class augments VPValue with operands which provide the inverse def-use		/// This class augments VPValue with operands which provide the inverse def-use
/// edges from VPValue's users to their defs.		/// edges from VPValue's users to their defs.
class VPUser {		class VPUser {
SmallVector<VPValue *, 2> Operands;		SmallVector<VPValue *, 2> Operands;

public:		public:
VPUser() {}		VPUser() {}
VPUser(ArrayRef<VPValue *> Operands) {		VPUser(ArrayRef<VPValue *> Operands) {
for (VPValue *Operand : Operands)		for (VPValue *Operand : Operands) {
		assert(!Operand->isVirtual() &&
		"cannot use a virtual VPValue as operand");
addOperand(Operand);		addOperand(Operand);
}		}
		}

VPUser(std::initializer_list<VPValue *> Operands)		VPUser(std::initializer_list<VPValue *> Operands)
: VPUser(ArrayRef<VPValue *>(Operands)) {}		: VPUser(ArrayRef<VPValue *>(Operands)) {}
template <typename IterT> VPUser(iterator_range<IterT> Operands) {		template <typename IterT> VPUser(iterator_range<IterT> Operands) {
for (VPValue *Operand : Operands)		for (VPValue *Operand : Operands) {
		assert(!Operand->isVirtual() &&
		"cannot use a virtual VPValue as operand");
addOperand(Operand);		addOperand(Operand);
}		}
		}

VPUser(const VPUser &) = delete;		VPUser(const VPUser &) = delete;
VPUser &operator=(const VPUser &) = delete;		VPUser &operator=(const VPUser &) = delete;
virtual ~VPUser() {		virtual ~VPUser() {
for (VPValue *Op : operands())		for (VPValue *Op : operands())
Op->removeUser(*this);		Op->removeUser(*this);
}		}

void addOperand(VPValue *Operand) {		void addOperand(VPValue *Operand) {
		assert(!Operand->isVirtual() && "cannot use a virtual VPValue as operand");
Operands.push_back(Operand);		Operands.push_back(Operand);
Operand->addUser(*this);		Operand->addUser(*this);
}		}

unsigned getNumOperands() const { return Operands.size(); }		unsigned getNumOperands() const { return Operands.size(); }
inline VPValue *getOperand(unsigned N) const {		inline VPValue *getOperand(unsigned N) const {
assert(N < Operands.size() && "Operand index out of bounds");		assert(N < Operands.size() && "Operand index out of bounds");
return Operands[N];		return Operands[N];
}		}

void setOperand(unsigned I, VPValue *New) {		void setOperand(unsigned I, VPValue *New) {
		assert(!New->isVirtual() && "cannot use a virtual VPValue as operand");
Operands[I]->removeUser(*this);		Operands[I]->removeUser(*this);
Operands[I] = New;		Operands[I] = New;
New->addUser(*this);		New->addUser(*this);
}		}

typedef SmallVectorImpl<VPValue *>::iterator operand_iterator;		typedef SmallVectorImpl<VPValue *>::iterator operand_iterator;
typedef SmallVectorImpl<VPValue *>::const_iterator const_operand_iterator;		typedef SmallVectorImpl<VPValue *>::const_iterator const_operand_iterator;
typedef iterator_range<operand_iterator> operand_range;		typedef iterator_range<operand_iterator> operand_range;
▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

llvm/unittests/Transforms/Vectorize/VPlanTest.cpp

Show First 20 Lines • Show All 515 Lines • ▼ Show 20 Lines	TEST(VPRecipeTest, CastVPWidenMemoryInstructionRecipeToVPUser) {
VPWidenMemoryInstructionRecipe Recipe(*Load, &Addr, &Mask);		VPWidenMemoryInstructionRecipe Recipe(*Load, &Addr, &Mask);
EXPECT_TRUE(isa<VPUser>(&Recipe));		EXPECT_TRUE(isa<VPUser>(&Recipe));
VPRecipeBase *BaseR = &Recipe;		VPRecipeBase *BaseR = &Recipe;
EXPECT_TRUE(isa<VPUser>(BaseR));		EXPECT_TRUE(isa<VPUser>(BaseR));
EXPECT_EQ(&Recipe, BaseR->toVPUser());		EXPECT_EQ(&Recipe, BaseR->toVPUser());
delete Load;		delete Load;
}		}

		struct VPTestMultiValueDef : public VPUser, public VPValue {
		SmallVector<VPValue *, 4> Defs;

		VPTestMultiValueDef() : VPValue(VPValue::VPVInterleaveSC) {}
		};

		TEST(VPMultiValueTest, traverseUseLists) {
		// Check that the def-use chains of a multi-def can be traversed in both
		// directions.

		// Create a multi-value def which defines 2 values.
		VPTestMultiValueDef MultiDef;
		MultiDef.Defs.push_back(new VPValue(&MultiDef));
		MultiDef.Defs.push_back(new VPValue(&MultiDef));

		VPInstruction I1(1, {MultiDef.Defs[0], MultiDef.Defs[1]});
		VPInstruction I2(2, {MultiDef.Defs[0]});
		VPInstruction I3(3, {MultiDef.Defs[1]});

		// Check that we can get all users of all definitions in the multi-value.
		SmallVector<VPUser *, 4> MultiDefUsers(MultiDef.user_begin(),
		MultiDef.user_end());
		// Note: Currently users may contain duplicates!
		EXPECT_EQ(4u, MultiDefUsers.size());
		EXPECT_EQ(&I1, MultiDefUsers[0]);
		EXPECT_EQ(&I1, MultiDefUsers[1]);
		EXPECT_EQ(&I2, MultiDefUsers[2]);
		EXPECT_EQ(&I3, MultiDefUsers[3]);

		SmallVector<VPUser *, 4> MultiDefV0Users(MultiDef.Defs[0]->user_begin(),
		MultiDef.Defs[0]->user_end());
		EXPECT_EQ(2u, MultiDefV0Users.size());
		EXPECT_EQ(&I1, MultiDefV0Users[0]);
		EXPECT_EQ(&I2, MultiDefV0Users[1]);

		SmallVector<VPUser *, 4> MultiDefV1Users(MultiDef.Defs[1]->user_begin(),
		MultiDef.Defs[1]->user_end());
		EXPECT_EQ(2u, MultiDefV1Users.size());
		EXPECT_EQ(&I1, MultiDefV1Users[0]);
		EXPECT_EQ(&I3, MultiDefV1Users[1]);

		// Now check that we can get the right defining value for each multi-value
		// handed out.
		EXPECT_EQ(&MultiDef, I1.getOperand(0)->getDefiningValue());
		EXPECT_EQ(&MultiDef, I1.getOperand(1)->getDefiningValue());
		EXPECT_EQ(&MultiDef, I2.getOperand(0)->getDefiningValue());
		EXPECT_EQ(&MultiDef, I3.getOperand(0)->getDefiningValue());
		}

} // namespace		} // namespace
} // namespace llvm		} // namespace llvm