This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/RISCV/
-
Target/
-
RISCV/
-
RISCVTargetTransformInfo.h
-
test/Transforms/SLPVectorizer/RISCV/
-
Transforms/
-
SLPVectorizer/
-
RISCV/
-
lit.local.cfg
-
rvv-min-vector-size.ll

Differential D116534

[RISCV] Set getMinVectorRegisterBitWidth to 16 if enable fixed length vector code gen for RVV
ClosedPublic

Authored by kito-cheng on Jan 3 2022, 5:09 AM.

Download Raw Diff

Details

Reviewers

craig.topper
frasercrmck

Commits

rGf142c45f1e49: [RISCV] Set getMinVectorRegisterBitWidth to 16 if enable fixed length vector…

Summary

getMinVectorRegisterBitWidth means what vector types is supported in
this target, and actually RISC-V support all fixed length vector types with
vector length less than getMinRVVVectorSizeInBits, so set it to 16,
means 2 x i8, that is minimal fixed length vector size in theory.

That also fixed one issue, some testcase migth become non-vectorizable
when -riscv-v-vector-bits-min set to larger value, because the vector size is
smaller than -riscv-v-vector-bits-min.

For example, following code can vectorize by SLP with
-riscv-v-vector-bits-min=128 or -riscv-v-vector-bits-min=256, but
can't vectorize -riscv-v-vector-bits-min=512 or larger:

void foo(double *da) {
  da[0] = 0;
  da[1] = 1;
  da[2] = 2;
  da[3] = 3;
}

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

kito-cheng created this revision.Jan 3 2022, 5:09 AM

Herald added subscribers: VincentWu, luke957, achieveartificialintelligence and 26 others. · View Herald TranscriptJan 3 2022, 5:09 AM

kito-cheng requested review of this revision.Jan 3 2022, 5:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 3 2022, 5:09 AM

Herald added subscribers: llvm-commits, MaskRay. · View Herald Transcript

kito-cheng added reviewers: craig.topper, frasercrmck.Jan 3 2022, 5:11 AM

kito-cheng edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B141315: Diff 397046.Jan 3 2022, 5:54 AM

liaolucy added a subscriber: liaolucy.Jan 3 2022, 11:17 PM

I agree it shouldn't be based on -riscv-v-vector-bits-min. But 16 feels maybe too low. What do other targets use?

I think only VE and Aarch64 is meaningful for RISC-V as reference since we are the only 3 targets having scaleable vector support, so I only take a look on those two targets:

VE: NO VLS code gen support, getMinVectorRegisterBitWidth always return 0.
AArch64: Return 64 as default, and set 128 for many core with this comment // FIXME: remove this to enable 64-bit SLP if performance looks good.[1]

My thought: This hook is describing capability of target, so I would prefer describe what we really can support, which is 2 x i8, I know there is concern about it having benefit or not, but I think that should be cost model stuffs, we could describe that on cost model in following patches, for example 2 x i8 and (ST->getMinRVVVectorSizeInBits()) / 8 x i8 having same cost, so SLP and loop vectorization will using larger type if possible.

[1] https://github.com/llvm/llvm-project/blob/main/llvm/lib/Target/AArch64/AArch64Subtarget.cpp#L138

LGTM

This revision is now accepted and ready to land.Jan 7 2022, 5:57 PM

This revision was landed with ongoing or failed builds.Jan 7 2022, 7:16 PM

Closed by commit rGf142c45f1e49: [RISCV] Set getMinVectorRegisterBitWidth to 16 if enable fixed length vector… (authored by kito-cheng). · Explain Why

This revision was automatically updated to reflect the committed changes.

Kito Cheng <kito.cheng@sifive.com> added a commit: rGf142c45f1e49: [RISCV] Set getMinVectorRegisterBitWidth to 16 if enable fixed length vector….

Revision Contents

Path

Size

llvm/

lib/

Target/

RISCV/

RISCVTargetTransformInfo.h

2 lines

test/

Transforms/

SLPVectorizer/

RISCV/

lit.local.cfg

2 lines

rvv-min-vector-size.ll

68 lines

Diff 398293

llvm/lib/Target/RISCV/RISCVTargetTransformInfo.h

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	public:
void getUnrollingPreferences(Loop *L, ScalarEvolution &SE,		void getUnrollingPreferences(Loop *L, ScalarEvolution &SE,
TTI::UnrollingPreferences &UP,		TTI::UnrollingPreferences &UP,
OptimizationRemarkEmitter *ORE);		OptimizationRemarkEmitter *ORE);

void getPeelingPreferences(Loop *L, ScalarEvolution &SE,		void getPeelingPreferences(Loop *L, ScalarEvolution &SE,
TTI::PeelingPreferences &PP);		TTI::PeelingPreferences &PP);

unsigned getMinVectorRegisterBitWidth() const {		unsigned getMinVectorRegisterBitWidth() const {
return ST->hasVInstructions() ? ST->getMinRVVVectorSizeInBits() : 0;		return ST->useRVVForFixedLengthVectors() ? 16 : 0;
}		}

InstructionCost getGatherScatterOpCost(unsigned Opcode, Type *DataTy,		InstructionCost getGatherScatterOpCost(unsigned Opcode, Type *DataTy,
const Value *Ptr, bool VariableMask,		const Value *Ptr, bool VariableMask,
Align Alignment,		Align Alignment,
TTI::TargetCostKind CostKind,		TTI::TargetCostKind CostKind,
const Instruction *I);		const Instruction *I);

▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

llvm/test/Transforms/SLPVectorizer/RISCV/lit.local.cfg

This file was added.

				if not 'RISCV' in config.root.targets:
				config.unsupported = True

llvm/test/Transforms/SLPVectorizer/RISCV/rvv-min-vector-size.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt < %s -slp-vectorizer -mtriple=riscv64 -mattr=+experimental-v \
				; RUN: -riscv-v-vector-bits-min=128 -S \| FileCheck %s --check-prefixes=CHECK,CHECK-128
				; RUN: opt < %s -slp-vectorizer -mtriple=riscv64 -mattr=+experimental-v \
				; RUN: -riscv-v-vector-bits-min=256 -S \| FileCheck %s --check-prefixes=CHECK,CHECK-256
				; RUN: opt < %s -slp-vectorizer -mtriple=riscv64 -mattr=+experimental-v \
				; RUN: -riscv-v-vector-bits-min=512 -S \| FileCheck %s --check-prefixes=CHECK,CHECK-512

				target datalayout = "e-m:e-p:64:64-i64:64-i128:128-n64-S128"
				target triple = "riscv64"

				define void @foo(i64* nocapture writeonly %da) {
				; CHECK-128-LABEL: @foo(
				; CHECK-128-NEXT: entry:
				; CHECK-128-NEXT: [[ARRAYIDX1:%.]] = getelementptr inbounds i64, i64 [[DA:%.*]], i64 1
				; CHECK-128-NEXT: [[TMP0:%.]] = bitcast i64 [[DA]] to <2 x i64>*
				; CHECK-128-NEXT: store <2 x i64> <i64 0, i64 1>, <2 x i64>* [[TMP0]], align 8
				; CHECK-128-NEXT: [[ARRAYIDX2:%.]] = getelementptr inbounds i64, i64 [[DA]], i64 2
				; CHECK-128-NEXT: [[ARRAYIDX3:%.]] = getelementptr inbounds i64, i64 [[DA]], i64 3
				; CHECK-128-NEXT: [[TMP1:%.]] = bitcast i64 [[ARRAYIDX2]] to <2 x i64>*
				; CHECK-128-NEXT: store <2 x i64> <i64 2, i64 3>, <2 x i64>* [[TMP1]], align 8
				; CHECK-128-NEXT: ret void
				;
				; CHECK-256-LABEL: @foo(
				; CHECK-256-NEXT: entry:
				; CHECK-256-NEXT: [[ARRAYIDX1:%.]] = getelementptr inbounds i64, i64 [[DA:%.*]], i64 1
				; CHECK-256-NEXT: [[ARRAYIDX2:%.]] = getelementptr inbounds i64, i64 [[DA]], i64 2
				; CHECK-256-NEXT: [[ARRAYIDX3:%.]] = getelementptr inbounds i64, i64 [[DA]], i64 3
				; CHECK-256-NEXT: [[TMP0:%.]] = bitcast i64 [[DA]] to <4 x i64>*
				; CHECK-256-NEXT: store <4 x i64> <i64 0, i64 1, i64 2, i64 3>, <4 x i64>* [[TMP0]], align 8
				; CHECK-256-NEXT: ret void
				;
				; CHECK-512-LABEL: @foo(
				; CHECK-512-NEXT: entry:
				; CHECK-512-NEXT: [[ARRAYIDX1:%.]] = getelementptr inbounds i64, i64 [[DA:%.*]], i64 1
				; CHECK-512-NEXT: [[ARRAYIDX2:%.]] = getelementptr inbounds i64, i64 [[DA]], i64 2
				; CHECK-512-NEXT: [[ARRAYIDX3:%.]] = getelementptr inbounds i64, i64 [[DA]], i64 3
				; CHECK-512-NEXT: [[TMP0:%.]] = bitcast i64 [[DA]] to <4 x i64>*
				; CHECK-512-NEXT: store <4 x i64> <i64 0, i64 1, i64 2, i64 3>, <4 x i64>* [[TMP0]], align 8
				; CHECK-512-NEXT: ret void
				;
				entry:
				store i64 0, i64* %da, align 8
				%arrayidx1 = getelementptr inbounds i64, i64* %da, i64 1
				store i64 1, i64* %arrayidx1, align 8
				%arrayidx2 = getelementptr inbounds i64, i64* %da, i64 2
				store i64 2, i64* %arrayidx2, align 8
				%arrayidx3 = getelementptr inbounds i64, i64* %da, i64 3
				store i64 3, i64* %arrayidx3, align 8
				ret void
				}

				define void @foo8(i8* nocapture writeonly %da) {
				; CHECK-LABEL: @foo8(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[ARRAYIDX1:%.]] = getelementptr inbounds i8, i8 [[DA:%.*]], i8 1
				; CHECK-NEXT: [[TMP0:%.]] = bitcast i8 [[DA]] to <2 x i8>*
				; CHECK-NEXT: store <2 x i8> <i8 0, i8 1>, <2 x i8>* [[TMP0]], align 8
				; CHECK-NEXT: [[ARRAYIDX2:%.]] = getelementptr inbounds i8, i8 [[DA]], i8 2
				; CHECK-NEXT: ret void
				;
				entry:
				store i8 0, i8* %da, align 8
				%arrayidx1 = getelementptr inbounds i8, i8* %da, i8 1
				store i8 1, i8* %arrayidx1, align 8
				%arrayidx2 = getelementptr inbounds i8, i8* %da, i8 2
				ret void
				}