This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Transforms/Utils/
-
llvm/
-
Transforms/
-
Utils/
-
LoopUtils.h
-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
4/11
LoopUtils.cpp

Differential D149731

[IR] New function llvm::createMinMaxSelectCmpOp for creating min/max operation in select-cmp form
Needs ReviewPublic

Authored by Mel-Chen on May 3 2023, 1:05 AM.

Download Raw Diff

Details

Reviewers

nikic
fhahn
RKSimon
ABataev
vdmitrie

Summary

This patch preserves the ability to represent min max operations in select-cmp form. This provides flexibility in choosing between intrinsic or select-cmp form for generating min max operations, depending on the optimization requirements.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Mel-Chen created this revision.May 3 2023, 1:05 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 3 2023, 1:05 AM

Herald added subscribers: hoy, hiraditya. · View Herald Transcript

Mel-Chen requested review of this revision.May 3 2023, 1:05 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 3 2023, 1:05 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

xbolva00 added a reviewer: nikic.May 3 2023, 1:07 AM

Herald added a subscriber: StephenFan. · View Herald TranscriptMay 3 2023, 1:07 AM

xbolva00 added a subscriber: xbolva00.May 3 2023, 1:08 AM

xbolva00 added inline comments.

llvm/lib/Transforms/Utils/LoopUtils.cpp
943	We have intrinsics for min and max. Do not emit cmp select form..
952	Why we dont use intrinsic in all cases?

xbolva00 added a reviewer: fhahn.May 3 2023, 1:09 AM

Mel-Chen mentioned this in D143465: [LoopVectorize] Vectorize the reduction pattern of integer min/max with index..May 3 2023, 1:14 AM

Mel-Chen added a child revision: D143465: [LoopVectorize] Vectorize the reduction pattern of integer min/max with index..May 3 2023, 1:15 AM

Mel-Chen added reviewers: RKSimon, ABataev, vdmitrie.May 3 2023, 1:19 AM

Harbormaster completed remote builds in B229628: Diff 518998.May 3 2023, 2:09 AM

Mel-Chen added inline comments.May 3 2023, 2:17 AM

llvm/lib/Transforms/Utils/LoopUtils.cpp
943	I am aware that we can use intrinsic to represent min max operations, but this patch aims to preserve the ability to express min max operations in select-cmp form. This is necessary for the vectorization feature I am currently developing. It is important to emphasize that this patch does not enforce the use of select-cmp form for min max operations, but rather provides an additional option.
952	Refer to D148221. My understanding is that for fp min max there are still some FMF issues to be handled. @RKSimon

nikic added inline comments.May 3 2023, 2:20 AM

llvm/lib/Transforms/Utils/LoopUtils.cpp
943	Why does your patch require the non-canonical select-cmp form? Please explain this in the patch description.

xbolva00 added inline comments.May 3 2023, 2:25 AM

llvm/lib/Transforms/Utils/LoopUtils.cpp
943	This is necessary for the vectorization feature I am currently developing. Can you please share so we can check it and possibly suggest some tips?

Update commit log.

Mel-Chen edited the summary of this revision. (Show Details)May 3 2023, 3:36 AM

Mel-Chen added inline comments.May 3 2023, 4:03 AM

llvm/lib/Transforms/Utils/LoopUtils.cpp

943

@nikic @xbolva00 Sure. Here is the WIP patch: D143465
In short, we are developing a new reduction pattern, min max with index, for the vectorizer. In the function fixReduction process for interleaving, we need to generate the following IR in middle.block:

middle.block:                                     ; preds = %vector.body
  ;; Start to fix minmax reduction
  %rdx.minmax.cmp = icmp sgt i64 %12, %13
  %rdx.minmax.select = select i1 %rdx.minmax.cmp, i64 %12, i64 %13
  %rdx.minmax.cmp8 = icmp sgt i64 %rdx.minmax.select, %14
  %rdx.minmax.select9 = select i1 %rdx.minmax.cmp8, i64 %rdx.minmax.select, i64 %14
  %rdx.minmax.cmp10 = icmp sgt i64 %rdx.minmax.select9, %15
  %rdx.minmax.select11 = select i1 %rdx.minmax.cmp10, i64 %rdx.minmax.select9, i64 %15
   ;; Start to fix index reduction
  %rdx.select = select i1 %rdx.minmax.cmp, i64 %20, i64 %21
  %rdx.select12 = select i1 %rdx.minmax.cmp8, i64 %rdx.select, i64 %22
  %rdx.select13 = select i1 %rdx.minmax.cmp10, i64 %rdx.select12, i64 %23

For the handling of index reduction, we require the cmp part of the min max operation. This is the reason why I need this patch.

Harbormaster completed remote builds in B229650: Diff 519026.May 3 2023, 4:30 AM

RKSimon added inline comments.May 4 2023, 5:31 AM

llvm/lib/Transforms/Utils/LoopUtils.cpp
952	maxnum/minnum reductions are sensitive to ordering if the values might be nan

fhahn added inline comments.May 30 2023, 8:40 AM

llvm/lib/Transforms/Utils/LoopUtils.cpp

943

To clarify, is the sequence below could be expressed using intrinsics,

%rdx.minmax.cmp = icmp sgt i64 %12, %13
%rdx.minmax.select = select i1 %rdx.minmax.cmp, i64 %12, i64 %13
%rdx.minmax.cmp8 = icmp sgt i64 %rdx.minmax.select, %14
%rdx.minmax.select9 = select i1 %rdx.minmax.cmp8, i64 %rdx.minmax.select, i64 %14
%rdx.minmax.cmp10 = icmp sgt i64 %rdx.minmax.select9, %15
%rdx.minmax.select11 = select i1 %rdx.minmax.cmp10, i64 %rdx.minmax.select9, i64 %15

but you would like to re-use the compares from above for the selcets below:

 ;; Start to fix index reduction
%rdx.select = select i1 %rdx.minmax.cmp, i64 %20, i64 %21
%rdx.select12 = select i1 %rdx.minmax.cmp8, i64 %rdx.select, i64 %22
%rdx.select13 = select i1 %rdx.minmax.cmp10, i64 %rdx.select12, i64 %23

If that's the case it would probably be better to just keep the min/max select generation code local to the code that generates the 2nd chain of selects, instead of exposing this as general API

943

but you would like to re-use the compares from above for the selcets below:

@Mel-Chen is the comment above an accurate summary?

Mel-Chen added inline comments.Jun 5 2023, 2:05 AM

llvm/lib/Transforms/Utils/LoopUtils.cpp
943	@fhahn Exactly, that's correct. I also believe there are better ways to address it, such as changing to generate the `CmpInst` during fixing index reduction, or generating an additional `CmpInst` during fixing min max reduction. Once I fix the issue you described, this patch will be abandoned. However, until it is fixed, we will temporarily rely on this patch.

Revision Contents

Path

Size

llvm/

include/

llvm/

Transforms/

Utils/

LoopUtils.h

7 lines

lib/

Transforms/

Utils/

LoopUtils.cpp

13 lines

Diff 518998

llvm/include/llvm/Transforms/Utils/LoopUtils.h

	Show First 20 Lines • Show All 358 Lines • ▼ Show 20 Lines
	/// pattern we are trying to match. In this pattern we are only ever selecting			/// pattern we are trying to match. In this pattern we are only ever selecting
	/// between two values: 1) an initial PHI start value, and 2) a loop invariant			/// between two values: 1) an initial PHI start value, and 2) a loop invariant
	/// value. This function uses \p LoopExitInst to determine 2), which we then use			/// value. This function uses \p LoopExitInst to determine 2), which we then use
	/// to select between \p Left and \p Right. Any lane value in \p Left that			/// to select between \p Left and \p Right. Any lane value in \p Left that
	/// matches 2) will be merged into \p Right.			/// matches 2) will be merged into \p Right.
	Value createSelectCmpOp(IRBuilderBase &Builder, Value StartVal, RecurKind RK,			Value createSelectCmpOp(IRBuilderBase &Builder, Value StartVal, RecurKind RK,
	Value Left, Value Right);			Value Left, Value Right);

				/// Returns a Min/Max operation in select-cmp form corresponding to
				/// MinMaxRecurrenceKind.
				/// Select(Cmp(strict min max predicate, Left, Right), Left, Right)
				/// The Builder's fast-math-flags must be set to propagate the expected values.
				Value *createMinMaxSelectCmpOp(IRBuilderBase &Builder, RecurKind RK,
				Value Left, Value Right);

	/// Returns a Min/Max operation corresponding to MinMaxRecurrenceKind.			/// Returns a Min/Max operation corresponding to MinMaxRecurrenceKind.
	/// The Builder's fast-math-flags must be set to propagate the expected values.			/// The Builder's fast-math-flags must be set to propagate the expected values.
	Value createMinMaxOp(IRBuilderBase &Builder, RecurKind RK, Value Left,			Value createMinMaxOp(IRBuilderBase &Builder, RecurKind RK, Value Left,
	Value *Right);			Value *Right);

	/// Generates an ordered vector reduction using extracts to reduce the value.			/// Generates an ordered vector reduction using extracts to reduce the value.
	Value getOrderedReduction(IRBuilderBase &Builder, Value Acc, Value *Src,			Value getOrderedReduction(IRBuilderBase &Builder, Value Acc, Value *Src,
	unsigned Op, RecurKind MinMaxKind = RecurKind::None);			unsigned Op, RecurKind MinMaxKind = RecurKind::None);
	▲ Show 20 Lines • Show All 187 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/LoopUtils.cpp

Show First 20 Lines • Show All 934 Lines • ▼ Show 20 Lines	Value llvm::createSelectCmpOp(IRBuilderBase &Builder, Value StartVal,
RecurKind RK, Value Left, Value Right) {		RecurKind RK, Value Left, Value Right) {
if (auto VTy = dyn_cast<VectorType>(Left->getType()))		if (auto VTy = dyn_cast<VectorType>(Left->getType()))
StartVal = Builder.CreateVectorSplat(VTy->getElementCount(), StartVal);		StartVal = Builder.CreateVectorSplat(VTy->getElementCount(), StartVal);
Value *Cmp =		Value *Cmp =
Builder.CreateCmp(CmpInst::ICMP_NE, Left, StartVal, "rdx.select.cmp");		Builder.CreateCmp(CmpInst::ICMP_NE, Left, StartVal, "rdx.select.cmp");
return Builder.CreateSelect(Cmp, Left, Right, "rdx.select");		return Builder.CreateSelect(Cmp, Left, Right, "rdx.select");
}		}

		Value *llvm::createMinMaxSelectCmpOp(IRBuilderBase &Builder, RecurKind RK,
		xbolva00Unsubmitted Not Done Reply Inline Actions We have intrinsics for min and max. Do not emit cmp select form.. xbolva00: We have intrinsics for min and max. Do not emit cmp select form..
		Mel-ChenAuthorUnsubmitted Done Reply Inline Actions I am aware that we can use intrinsic to represent min max operations, but this patch aims to preserve the ability to express min max operations in select-cmp form. This is necessary for the vectorization feature I am currently developing. It is important to emphasize that this patch does not enforce the use of select-cmp form for min max operations, but rather provides an additional option. Mel-Chen: I am aware that we can use intrinsic to represent min max operations, but this patch aims to…
		nikicUnsubmitted Not Done Reply Inline Actions Why does your patch require the non-canonical select-cmp form? Please explain this in the patch description. nikic: Why does your patch require the non-canonical select-cmp form? Please explain this in the patch…
		xbolva00Unsubmitted Not Done Reply Inline Actions This is necessary for the vectorization feature I am currently developing. Can you please share so we can check it and possibly suggest some tips? xbolva00: >> This is necessary for the vectorization feature I am currently developing. Can you please…
		Mel-ChenAuthorUnsubmitted Done Reply Inline Actions @nikic @xbolva00 Sure. Here is the WIP patch: D143465 In short, we are developing a new reduction pattern, min max with index, for the vectorizer. In the function fixReduction process for interleaving, we need to generate the following IR in middle.block: middle.block: ; preds = %vector.body ;; Start to fix minmax reduction %rdx.minmax.cmp = icmp sgt i64 %12, %13 %rdx.minmax.select = select i1 %rdx.minmax.cmp, i64 %12, i64 %13 %rdx.minmax.cmp8 = icmp sgt i64 %rdx.minmax.select, %14 %rdx.minmax.select9 = select i1 %rdx.minmax.cmp8, i64 %rdx.minmax.select, i64 %14 %rdx.minmax.cmp10 = icmp sgt i64 %rdx.minmax.select9, %15 %rdx.minmax.select11 = select i1 %rdx.minmax.cmp10, i64 %rdx.minmax.select9, i64 %15 ;; Start to fix index reduction %rdx.select = select i1 %rdx.minmax.cmp, i64 %20, i64 %21 %rdx.select12 = select i1 %rdx.minmax.cmp8, i64 %rdx.select, i64 %22 %rdx.select13 = select i1 %rdx.minmax.cmp10, i64 %rdx.select12, i64 %23 For the handling of index reduction, we require the cmp part of the min max operation. This is the reason why I need this patch. Mel-Chen: @nikic @xbolva00 Sure. Here is the WIP patch: [[ https://reviews.llvm.org/D143465 \| D143465 ]]…
		fhahnUnsubmitted Not Done Reply Inline Actions To clarify, is the sequence below could be expressed using intrinsics, %rdx.minmax.cmp = icmp sgt i64 %12, %13 %rdx.minmax.select = select i1 %rdx.minmax.cmp, i64 %12, i64 %13 %rdx.minmax.cmp8 = icmp sgt i64 %rdx.minmax.select, %14 %rdx.minmax.select9 = select i1 %rdx.minmax.cmp8, i64 %rdx.minmax.select, i64 %14 %rdx.minmax.cmp10 = icmp sgt i64 %rdx.minmax.select9, %15 %rdx.minmax.select11 = select i1 %rdx.minmax.cmp10, i64 %rdx.minmax.select9, i64 %15 but you would like to re-use the compares from above for the selcets below: ;; Start to fix index reduction %rdx.select = select i1 %rdx.minmax.cmp, i64 %20, i64 %21 %rdx.select12 = select i1 %rdx.minmax.cmp8, i64 %rdx.select, i64 %22 %rdx.select13 = select i1 %rdx.minmax.cmp10, i64 %rdx.select12, i64 %23 If that's the case it would probably be better to just keep the min/max select generation code local to the code that generates the 2nd chain of selects, instead of exposing this as general API fhahn: To clarify, is the sequence below could be expressed using intrinsics, ``` %rdx.minmax.cmp =…
		fhahnUnsubmitted Not Done Reply Inline Actions but you would like to re-use the compares from above for the selcets below: @Mel-Chen is the comment above an accurate summary? fhahn: > but you would like to re-use the compares from above for the selcets below: @Mel-Chen is the…
		Mel-ChenAuthorUnsubmitted Done Reply Inline Actions @fhahn Exactly, that's correct. I also believe there are better ways to address it, such as changing to generate the `CmpInst` during fixing index reduction, or generating an additional `CmpInst` during fixing min max reduction. Once I fix the issue you described, this patch will be abandoned. However, until it is fixed, we will temporarily rely on this patch. Mel-Chen: @fhahn Exactly, that's correct. I also believe there are better ways to address it, such as…
		Value Left, Value Right) {
		CmpInst::Predicate Pred = getMinMaxReductionPredicate(RK);
		Value *Cmp = Builder.CreateCmp(Pred, Left, Right, "rdx.minmax.cmp");
		Value *Select = Builder.CreateSelect(Cmp, Left, Right, "rdx.minmax.select");
		return Select;
		}

Value llvm::createMinMaxOp(IRBuilderBase &Builder, RecurKind RK, Value Left,		Value llvm::createMinMaxOp(IRBuilderBase &Builder, RecurKind RK, Value Left,
Value *Right) {		Value *Right) {
Type *Ty = Left->getType();		Type *Ty = Left->getType();
if (Ty->isIntOrIntVectorTy()) {		if (Ty->isIntOrIntVectorTy()) {
// TODO: Add float minnum/maxnum support when FMF nnan is set.		// TODO: Add float minnum/maxnum support when FMF nnan is set.
Intrinsic::ID Id = getMinMaxReductionIntrinsicOp(RK);		Intrinsic::ID Id = getMinMaxReductionIntrinsicOp(RK);
return Builder.CreateIntrinsic(Ty, Id, {Left, Right}, nullptr,		return Builder.CreateIntrinsic(Ty, Id, {Left, Right}, nullptr,
"rdx.minmax");		"rdx.minmax");
}		}
CmpInst::Predicate Pred = getMinMaxReductionPredicate(RK);		return createMinMaxSelectCmpOp(Builder, RK, Left, Right);
xbolva00Unsubmitted Not Done Reply Inline Actions Why we dont use intrinsic in all cases? xbolva00: Why we dont use intrinsic in all cases?
Mel-ChenAuthorUnsubmitted Done Reply Inline Actions Refer to D148221. My understanding is that for fp min max there are still some FMF issues to be handled. @RKSimon Mel-Chen: Refer to [[ https://reviews.llvm.org/D148221 \| D148221 ]]. My understanding is that for fp min…
RKSimonUnsubmitted Not Done Reply Inline Actions maxnum/minnum reductions are sensitive to ordering if the values might be nan RKSimon: maxnum/minnum reductions are sensitive to ordering if the values might be nan
Value *Cmp = Builder.CreateCmp(Pred, Left, Right, "rdx.minmax.cmp");
Value *Select = Builder.CreateSelect(Cmp, Left, Right, "rdx.minmax.select");
return Select;
}		}

// Helper to generate an ordered reduction.		// Helper to generate an ordered reduction.
Value llvm::getOrderedReduction(IRBuilderBase &Builder, Value Acc, Value *Src,		Value llvm::getOrderedReduction(IRBuilderBase &Builder, Value Acc, Value *Src,
unsigned Op, RecurKind RdxKind) {		unsigned Op, RecurKind RdxKind) {
unsigned VF = cast<FixedVectorType>(Src->getType())->getNumElements();		unsigned VF = cast<FixedVectorType>(Src->getType())->getNumElements();

// Extract and apply reduction ops in ascending order:		// Extract and apply reduction ops in ascending order:
▲ Show 20 Lines • Show All 967 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[IR] New function llvm::createMinMaxSelectCmpOp for creating min/max operation in select-cmp formNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 518998

llvm/include/llvm/Transforms/Utils/LoopUtils.h

llvm/lib/Transforms/Utils/LoopUtils.cpp

[IR] New function llvm::createMinMaxSelectCmpOp for creating min/max operation in select-cmp form
Needs ReviewPublic