This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
InitializePasses.h
-
Transforms/Utils/
-
Utils/
6/6
InjectTLIMappings.h
4/4
ModuleUtils.h
-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
-
CMakeLists.txt
29/29
InjectTLIMappings.cpp
-
test/Transforms/Util/
-
Transforms/
-
Util/
3/3
add-TLI-mappings.ll

Differential D70107

[VFABI] TargetLibraryInfo mappings in IR.
ClosedPublic

Authored by fpetrogalli on Nov 11 2019, 6:32 PM.

Download Raw Diff

Details

Reviewers

jdoerfert
sdesmalen
simoll

Commits

rGd6de5f12d485: [SVFS] Inject TLI Mappings in VFABI attribute.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 40862
Build 41000: arc lint + arc unit

Event Timeline

fpetrogalli created this revision.Nov 11 2019, 6:32 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 11 2019, 6:32 PM

Herald added subscribers: llvm-commits, hiraditya, mgorny. · View Herald Transcript

fpetrogalli added reviewers: jdoerfert, sdesmalen, simoll.Nov 11 2019, 6:33 PM

This patch is not functional yet, but I have added it here to show in the test what I intend to do.

I am adding a pass that adds the "vector-function-abi-variant" attribute from the mappings that are stored in the TargetLibraryInfo (TLI).

This patch is based on the work published in https://reviews.llvm.org/D69976 and https://reviews.llvm.org/D70089

There is not need to review the code for now, all I need to know is whether people agree on this approach.

andwar added a subscriber: andwar.Nov 12 2019, 12:47 AM

@fpetrogalli This means that all passes that need to care about vector variants of functions need to add a dependency on this pass, right? This seems quite similar to how TBAA info is loaded from IR metadata using an analysis pass, after which the information can be queried using the AliasAnalysis interfaces. This is doing a similar thing, but then for querying vector-variants of functions, so +1 for the approach!

We should not forget to document the new mechanism (the VFABI, metadata attributes and the SVFS mechanism), perhaps as a separate document. The metadata format should at least be described in the LangRef.

llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h
20	Don't forget about the old pass manager :)

IIUC, this a transformation pass (it does modify the module, e.g. by appendToCompilerUsed(*M, {Global});). So you probably want to register it with one of the optimisation pipelines. I _believe_ that that's how you do it:

For legacy PM:

Use INITIALIZE_PASS_BEGIN (https://github.com/llvm/llvm-project/blob/848007cfbc7509543c5b8604ae063bb6c8ffa0a9/llvm/include/llvm/PassSupport.h#L33) that will define initializeInjectTLIMappingsPass for you. Then use it where you want to add it - you can check initializeSLPVectorizerPass for reference (https://github.com/llvm/llvm-project/blob/848007cfbc7509543c5b8604ae063bb6c8ffa0a9/llvm/lib/Transforms/Vectorize/Vectorize.cpp#L28).
You can also use llvm::RegisterStandardPasses (http://llvm.org/docs/WritingAnLLVMPass.html#basic-code-required) if you want to add it in one of the avaiable extension points (so that it runs automagically with e.g. -O1)

For the new PM, you probably want to add your pass to an existing FunctionPassPamanager, e.g.

OptimizePM (https://github.com/llvm/llvm-project/blob/848007cfbc7509543c5b8604ae063bb6c8ffa0a9/llvm/lib/Passes/PassBuilder.cpp#L948)

Once that's done, your Pass will be run _automagically_ together with other passes in the pipeline. This is just a quick brain-dump so please ping me if it's unclear.

llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h
20	`Legacy` :) Also, I am not a fan of appending `Pass` to pass classes (it's clear what this class inherits from either way). Also, if you use `INITIALIZE_PASS_BEGIN`, `Pass` is going to be prepended to the _Initialize_ method anyway (so you will have `initializeInjestTLIMappingPassPass`): https://github.com/llvm/llvm-project/blob/848007cfbc7509543c5b8604ae063bb6c8ffa0a9/llvm/include/llvm/PassSupport.h#L62
llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
105	[nit] 'things' -> 'thinks'
128	[nit] What follows is the definition of `InjectTLIMappingsPass::run` though.
llvm/test/Transforms/Util/add-TLI-mappings.ll
2	For this to work you need to register a command line option. Why not use `print-after` and `print-before` instead? Or maybe we do need a command line option?

sdesmalen added inline comments.Nov 12 2019, 4:05 AM

llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
67	When you think the patch is ready to be reviewed, can you address the comments that I added in D69976 before you removed it?
llvm/test/Transforms/Util/add-TLI-mappings.ll
18	I don't think you need to add a loop here to prove the IR contains the vectorized versions of the IR, a call to `@sin` should be sufficient.

This is an update in which I have tried to add the legacy pass manager.

It is not working yet, but I think I am on a good path!

Thank you @andwar for all the pointers.

For reference, this is the linking error I am getting:

llvm-project/llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h:31: error: undefined reference to 'vtable for llvm::InjectTLIMappingsLegacy'
/usr/bin/ld: the vtable symbol may be undefined because the class is missing its key function

fpetrogalli marked 5 inline comments as done.Nov 12 2019, 9:28 PM

fpetrogalli added inline comments.

llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h
20	I am working on it! :)

Harbormaster completed remote builds in B40862: Diff 229004.Nov 12 2019, 9:32 PM

New pass manager command line option missing.

FWIW, you can checkout D69930 for an example of adding a pass with more hookup into the system.

llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h
22	I doubt you need the constructor or the name function.
llvm/include/llvm/Transforms/Utils/ModuleUtils.h
115	Unrelated?
llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
1	TODO, also below
19	Is there precedent for not having it in all lower-case letters?
74	Wasn't that checked below but with a different call to the module?
80	Where is the VarArgs restrictions checked? If it is implicit, please add an assertion that the original callee is not vararg.
92	Why do you need to query Global here? Global is VectorF isn't it?
118	For brevity and without the need to assign: `SetVector<StringRef> OriginalSetOfMappings(Mappings.begin(), Mappings.end());`
133	Can we make 16 a return value of a (maybe static) function in TLI? Style, above: if (!...count(...))
152	I doubt you need the lllvm. Above, `auto CI*`please :)
llvm/test/Transforms/Util/add-TLI-mappings.ll
2	command line options are good for various things, please make sure they work. (new and old PM)

Thank you all for the review. The pass is now fuctional, working for
the libraries supported by the TLI: SVML, Accelerate and MASSV.

I haven't added it to any of the optimization pipelines at the moment,
under the assumption that once this pass is listed in the required
passes of the loop vectorizer, it will be automatically loaded after
the TLI wrapper pass.

Harbormaster completed remote builds in B40924: Diff 229188.Nov 13 2019, 2:21 PM

fpetrogalli added inline comments.Nov 13 2019, 2:31 PM

llvm/include/llvm/Transforms/Utils/ModuleUtils.h
115	Yes, but still useful. I have also removed a group comment. I'd rather not create a separate patch for this?
llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
19	The pass debug option is all lower case now (and consequently, the command line too).
74	This avoids running the creation of the function if the function already exists. Otherwise I expect `Function::Create` to have some problems.
92	Ah right, good catch. Fixed.
152	I doubt you need the lllvm. You mean I don't need to wrap the pass in the llvm namespace? I have done it in the header file too. Is that wrong?

jdoerfert added inline comments.Nov 13 2019, 3:06 PM

llvm/include/llvm/Transforms/Utils/ModuleUtils.h
115	create an RFC one and commit it w/o review if it is trivial.
llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
74	I mean you checked that already: Function *VariantF = M->getFunction(TLIName); if (!VariantF) addVariantDeclaration(CI, VF, TLIName); but once with `getFunction` and once with `getNamedValue`. I think you don't need two checks and you should pick consistent the one lookup call you want.
92	Forgot to update or not fixed?
152	you don't need it here because you opened the namespace. you should not open namespaces in headers that is why you wrap it in the namespace there. Plus, you don't need the explicit qualification here because these are not top level declarations as you have in the header.

fpetrogalli marked 3 inline comments as done.Nov 13 2019, 8:39 PM

fpetrogalli added inline comments.

llvm/include/llvm/Transforms/Utils/ModuleUtils.h
115	Done in https://reviews.llvm.org/D70218

andwar added inline comments.Nov 14 2019, 4:06 AM

llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h
22	What about this one? You don't need the `name` function, the one provided by the base class is fine: https://github.com/llvm/llvm-project/blob/848007cfbc7509543c5b8604ae063bb6c8ffa0a9/llvm/include/llvm/IR/PassManager.h#L373 As for the constructor, the auto-generated one should be sufficient here. Is it not?
llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
166	[nit] `Legacy PM implementation (the pass manager is `legacy`, not the pass :) ).

Address last round of reviews.

Harbormaster completed remote builds in B40975: Diff 229375.Nov 14 2019, 11:57 AM

fpetrogalli added inline comments.Nov 14 2019, 8:36 PM

llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h
22	Contructor and name function removed. That was too much copy and paste.
llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
74	I was using `Global` mostly to be able to call `appendToCompilerUsed`, which works on `global`s. Then, I realized that `Function` inherits from `Global`, so there is no need to use `Global` at all. I also have replaced the check on having an empty body with an assertion, just in case someone modifies the function in the middle and populates the body of the function before invoking `appendToCompilerUsed`.
92	Not it is fixed, no more `Global`.
152	Facepalm myself at the `using namespace llvm;` on top of this cpp file. Thanks for the explanation.

Some minor comments below, otherwise LGTM.

@sdesmalen any more comments?

llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
121	The call is not free but invariant: `for (unsigned VF = 2, MaxVF = TLI.getWidestVF(ScalarName); VF <= MaxVF; VF *= 2) {`
155	Remove `Changed` or doe something with it.
171	You can preserve more here, e.g. all CFG analysis.
183	I think one of the false could be a true, unclear if that will make a difference though.

This revision is now accepted and ready to land.Nov 14 2019, 8:52 PM

fpetrogalli marked 2 inline comments as done.Nov 14 2019, 9:41 PM

fpetrogalli added inline comments.

llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
29	TODO: The description of the counter is misleading, only declarations are added in this pass.
140–143	TODO: remove braces.

Update according to last round of review from @jdoerfert.

Thank you.

llvm/lib/Transforms/Utils/InjectTLIMappings.cpp
183	It is not clear what the cfg parameter is for. I'll leave it `false` for now. If we need to revise it, we will change it later.

Harbormaster completed remote builds in B41044: Diff 229592.Nov 15 2019, 10:41 AM

Closed by commit rGd6de5f12d485: [SVFS] Inject TLI Mappings in VFABI attribute. (authored by fpetrogalli). · Explain WhyNov 15 2019, 10:50 AM

This revision was automatically updated to reflect the committed changes.

@fpetrogalli I have a high level question - why does this pass require that the function should be from one of the vector libraries (Mass, SMVL etc)? I'd like to piggy-back on the vector-function-abi-variant attribute to vectorize a call to a scalar function, once the front-end adds this attribute to the callsite. So, if we have the following code in IR, with a scalar call to a function trivially.vectorizable.func with the vector-function-abi-variant attribute, we will generate the vector function declarations and then LoopVectorizer/SLPVectorizer would do its thing.

Test IR:

; ModuleID = 'chk.ll'
source_filename = "chk.ll"
target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"
define dso_local double @test(float* %Arr) {
entry:
  br label %for.cond

for.cond:
  %Sum.0 = phi double [ 0.000000e+00, %entry ], [ %add, %for.inc ]
  %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.inc ]
  %cmp = icmp slt i32 %i.0, 128
  br i1 %cmp, label %for.body, label %for.cond.cleanup

for.cond.cleanup:
  br label %for.end

for.body:
  %idxprom = sext i32 %i.0 to i64
  %arrayidx = getelementptr inbounds float, float* %Arr, i64 %idxprom
  %0 = load float, float* %arrayidx, align 4
  %conv = fpext float %0 to double
  %1 = call fast double @trivially.vectorizable.func(double %conv) #2 <-- CALL OF INTEREST
  %add = fadd fast double %Sum.0, %1
  br label %for.inc

for.inc:
  %inc = add nsw i32 %i.0, 1
  br label %for.cond

for.end:
  ret double %Sum.0
}

declare double @trivially.vectorizable.func(double) #1
attributes #2 = { "vector-function-abi-variant"="_ZGV_LLVM_N2v_trivially.vectorizable.func(trivially.vectorizable.func.v2)" }
attributes #1 = { nounwind readnone speculatable willreturn }

The way I'm thinking - this pass will add the declaration for the vector function in IR declare <2 x double> @trivially.vectorizable.func.v2(<2 x double>).
I've confirmed that once this is done, LoopVectorizer will vectorize that call, i.e. convert the scalar call to the vector version.
Is there any reason to avoid updating this pass with such a functionality? Basically, I'm trying to use something similar to simd pragma in clang (https://clang.llvm.org/docs/AttributeReference.html#id211), where we can specify any function as vectorizable.

Do we already have such support for front-ends to specify any scalar function as 'vectorizable' in IR? This is the only existing attribute that I see which can be reused for this purpose.

Also, to clarify, this vectorized call is finally lowered before passing to the backend/linker etc. So, you can think of this as I have a handwritten nice vectorized form for "trivially.vectorizable.call", which will be inlined before passing to codegen (so these declarations needn't be kept around after the inlining). The main reason for such a usecase is if we want the "high level function call" remaining in IR form until some late pass, which is just before codegen. It is not to bypass some vectorizer code generation limitation. We have such usecases internally.

In D70107#2018145, @anna wrote:

@fpetrogalli I have a high level question - why does this pass require that the function should be from one of the vector libraries (Mass, SMVL etc)? I'd like to piggy-back on the vector-function-abi-variant attribute to vectorize a call to a scalar function, once the front-end adds this attribute to the callsite. So, if we have the following code in IR, with a scalar call to a function trivially.vectorizable.func with the vector-function-abi-variant attribute, we will generate the vector function declarations and then LoopVectorizer/SLPVectorizer would do its thing.

Hi @anna ,

this is exactly what the attribute is for. This pass works only with the library recognized by the TLI as a shortcut implementation to avoid doing the codegeneration of the attribute for the library functions in the frontend.

As things are, if you run opt with -O2 on your IR, the code should be vectorized with a call to the vector version, if not for the fact that the IR is missing the declaration of the actual vector function. In fact, the vector-function-abi-variant attribute requires that all the mappings listed in the attribute are resolved by some declaration/definition in the IR (notice that unused _declarations_ are kept in the IR and not deleted because such declarations are passed as parameter to the IR intrinsics`@llvm.compiler.used`). You can find more infos on this attribute here: https://llvm.org/docs/LangRef.html#call-site-attributes

Test IR:

; ModuleID = 'chk.ll'
source_filename = "chk.ll"
target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"
define dso_local double @test(float* %Arr) {
entry:
  br label %for.cond

for.cond:
  %Sum.0 = phi double [ 0.000000e+00, %entry ], [ %add, %for.inc ]
  %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.inc ]
  %cmp = icmp slt i32 %i.0, 128
  br i1 %cmp, label %for.body, label %for.cond.cleanup

for.cond.cleanup:
  br label %for.end

for.body:
  %idxprom = sext i32 %i.0 to i64
  %arrayidx = getelementptr inbounds float, float* %Arr, i64 %idxprom
  %0 = load float, float* %arrayidx, align 4
  %conv = fpext float %0 to double
  %1 = call fast double @trivially.vectorizable.func(double %conv) #2 <-- CALL OF INTEREST
  %add = fadd fast double %Sum.0, %1
  br label %for.inc

for.inc:
  %inc = add nsw i32 %i.0, 1
  br label %for.cond

for.end:
  ret double %Sum.0
}

declare double @trivially.vectorizable.func(double) #1
attributes #2 = { "vector-function-abi-variant"="_ZGV_LLVM_N2v_trivially.vectorizable.func(trivially.vectorizable.func.v2)" }
attributes #1 = { nounwind readnone speculatable willreturn }

Yes, this is correct. Your code will vectorize as it sees the vector declaration. Just make sure to mark it with @llvm.compiler.used so it doesn't get deleted by other optimization in the pipelines before reaching the vectorizer.

Is there any reason to avoid updating this pass with such a functionality? Basically, I'm trying to use something similar to simd pragma in clang (https://clang.llvm.org/docs/AttributeReference.html#id211), where we can specify any function as vectorizable.

This pass is not to be used as the main way to generate the IR attribute that carries the mappings. It is here only to translate the TLI mappings used by the libraries into IR information, as the loop vectorizer doesn not interface anymore with the TLI but uses a class called VFDatabase that loads the mappings from vector-function-abi-variant. Any other mapping should be added by the frontend. In fact, the vector-function-abi-variant attribute was originally designed to carry the information of the OpenMP declare variant and declare simd directives: http://lists.llvm.org/pipermail/llvm-dev/2019-June/133484.html (Please notice that non of the frontend changes discussed in that email has been implemented yet, all we have done for now are the backend pieces)

Do we already have such support for front-ends to specify any scalar function as 'vectorizable' in IR? This is the only existing attribute that I see which can be reused for this purpose.

As I said, no. Changing the codegeneration of declare simd to use vector-function-abi-variant is on my to do list, unfortunately not as close as I would like them to be. I suspect that also you have your own pipeline of things to do, but let me know if declare simd becomes closer to you than to me, I can always help you getting things sorted! Of course, feel free to ignore my offer, which is essentially a friendly "patches are welcome!" :)

Also, to clarify, this vectorized call is finally lowered before passing to the backend/linker etc. So, you can think of this as I have a handwritten nice vectorized form for "trivially.vectorizable.call", which will be inlined before passing to codegen (so these declarations needn't be kept around after the inlining). The main reason for such a usecase is if we want the "high level function call" remaining in IR form until some late pass, which is just before codegen. It is not to bypass some vectorizer code generation limitation. We have such usecases internally.

If the function is defined in the module, even if all its uses are inlined, why would you want to remove it? If the function declaration is in the module, it is because it is present in the original source? Even if it comes from a declare simd on a scalar function, the compiler is allowed to think it is there even if it is unused.

Please let me know if you have any more question!

Kind regards,

Francesco

Thank you for the detailed and quick response @fpetrogalli.

In D70107#2018558, @fpetrogalli wrote:

In D70107#2018145, @anna wrote:

@fpetrogalli I have a high level question - why does this pass require that the function should be from one of the vector libraries (Mass, SMVL etc)? I'd like to piggy-back on the vector-function-abi-variant attribute to vectorize a call to a scalar function, once the front-end adds this attribute to the callsite. So, if we have the following code in IR, with a scalar call to a function trivially.vectorizable.func with the vector-function-abi-variant attribute, we will generate the vector function declarations and then LoopVectorizer/SLPVectorizer would do its thing.

Hi @anna ,

this is exactly what the attribute is for. This pass works only with the library recognized by the TLI as a shortcut implementation to avoid doing the codegeneration of the attribute for the library functions in the frontend.

As things are, if you run opt with -O2 on your IR, the code should be vectorized with a call to the vector version, if not for the fact that the IR is missing the declaration of the actual vector function. In fact, the vector-function-abi-variant attribute requires that all the mappings listed in the attribute are resolved by some declaration/definition in the IR.

Ah, interesting. I hadn't noticed that property of the attribute. So, basically, if we were to pass the attribute through the front-end, we're required to keep the vector declarations in the IR as well.

For my purposes, our front-end converts the code into IR and our use case works with adding the attribute at that point itself. I was hoping to actually avoid having the various vector declarations in the IR - frankly it's just for IR cleanliness and some compile time (depending on number of vector variants for each such scalar function) since the vector declarations won't typically be used until we do a pass for some form of vectorization (loop/SLP etc).

what I was thinking of adding in inject-TLI-mappings or a separate pass such as "inject-vector-declarations" is:

if (call->hasAttr("variant-abi")) {
  getVectorVariantFromAttr(call, VariantNames)
  for (VName: VariantNames) 
    addVariantDeclaration(CI, VF, VName);
}

So, to that end, I think it would be better to make the property that the "vector name mentioned in the attribute should be present in the IR as a vector function declaration" as optional? This would also go better with the work of converting "pragma simd" to vector-abi attribute, since the front-end needn't add all the various vector declarations (until actually it is required by some pass, such as loop/SLP vectorizer). Again, this is just for IR compactness, compile time benefits. We do not have any functional issues with the attribute as-is, and we do not need source language support for pragma-simd (more details below).

Yes, this is correct. Your code will vectorize as it sees the vector declaration. Just make sure to mark it with @llvm.compiler.used so it doesn't get deleted by other optimization in the pipelines before reaching the vectorizer.

Yup, the main thing was I wanted to avoid having the declarations in the IR, but I see that it is the property of the attribute itself.

Changing the codegeneration of declare simd to use vector-function-abi-variant is on my to do list, unfortunately not as close as I would like them to be. I suspect that also you have your own pipeline of things to do, but let me know if declare simd becomes closer to you than to me, I can always help you getting things sorted!

For us, we are able to pass in the "vector-function-abi-variant" since the function we're tring to vectorize is not from the source code (java), so that part is sorted. We are adding these vectorized versions of some internal functions we add through the front-end (not present in source code).

I was just curious if we had some other attribute I hadn't noticed :) It's just that adding these bunches of declaration from front-end seemed a lot of such declarations going through each pass (number of scalar functions * 5).

Also, this is slightly OT: the pass seems to rely on the fact that vectorizer will always use power-of-2 VF (which is true currently), but once we start supporting any number for VF (for example in middle-end it's VF=6 and backend decides what's the correct VF is), we will start having way too many declarations in module (and the pass will also need to be updated). I think we're functionally good in that case, because we will just not generate the vectorized call, since we don't have the corresponding VF variant declaration.

In D70107#2018811, @anna wrote:

Thank you for the detailed and quick response @fpetrogalli.

You are welcome!

In D70107#2018558, @fpetrogalli wrote:

In D70107#2018145, @anna wrote:

@fpetrogalli I have a high level question - why does this pass require that the function should be from one of the vector libraries (Mass, SMVL etc)? I'd like to piggy-back on the vector-function-abi-variant attribute to vectorize a call to a scalar function, once the front-end adds this attribute to the callsite. So, if we have the following code in IR, with a scalar call to a function trivially.vectorizable.func with the vector-function-abi-variant attribute, we will generate the vector function declarations and then LoopVectorizer/SLPVectorizer would do its thing.

Hi @anna ,

this is exactly what the attribute is for. This pass works only with the library recognized by the TLI as a shortcut implementation to avoid doing the codegeneration of the attribute for the library functions in the frontend.

As things are, if you run opt with -O2 on your IR, the code should be vectorized with a call to the vector version, if not for the fact that the IR is missing the declaration of the actual vector function. In fact, the vector-function-abi-variant attribute requires that all the mappings listed in the attribute are resolved by some declaration/definition in the IR.

Ah, interesting. I hadn't noticed that property of the attribute. So, basically, if we were to pass the attribute through the front-end, we're required to keep the vector declarations in the IR as well.

Yep! I have added as many assertions as I could to make sure the declaration was not missing when reading the attribute.

For my purposes, our front-end converts the code into IR and our use case works with adding the attribute at that point itself. I was hoping to actually avoid having the various vector declarations in the IR - frankly it's just for IR cleanliness and some compile time (depending on number of vector variants for each such scalar function) since the vector declarations won't typically be used until we do a pass for some form of vectorization (loop/SLP etc).

I understand - this was our original intent too. Add the attribute and build the vector function signature at compile time in the middle end, our of the scalar signature in the IR and the mangled string of the vector function name. We started this work last summer, until our intern @aranisumedh found out that it was not possible to retrieve the vector signatures in some cases [1]. Essentially, it turned out that only the frontend has enough information about the language types that are needed to build the correct signature in IR. hence, we decided the vector declaration to be required in the IR.

[1] See this: http://lists.llvm.org/pipermail/llvm-dev/2019-June/133225.html, and the follow up discussion. Let me know if you have any question!

what I was thinking of adding in inject-TLI-mappings or a separate pass such as "inject-vector-declarations" is:
if (call->hasAttr("variant-abi")) {
  getVectorVariantFromAttr(call, VariantNames)
  for (VName: VariantNames) 
    addVariantDeclaration(CI, VF, VName);
}
So, to that end, I think it would be better to make the property that the "vector name mentioned in the attribute should be present in the IR as a vector function declaration" as optional?

Sorry, I don't think we can do this, for the aforementioned reason, and for the sake of consistency. It would be bad to end up having code to handle the custom behavior of the attribute.

This would also go better with the work of converting "pragma simd" to vector-abi attribute, since the front-end needn't add all the various vector declarations (until actually it is required by some pass, such as loop/SLP vectorizer). Again, this is just for IR compactness, compile time benefits. We do not have any functional issues with the attribute as-is, and we do not need source language support for pragma-simd (more details below).

Yes, this is correct. Your code will vectorize as it sees the vector declaration. Just make sure to mark it with @llvm.compiler.used so it doesn't get deleted by other optimization in the pipelines before reaching the vectorizer.

Yup, the main thing was I wanted to avoid having the declarations in the IR, but I see that it is the property of the attribute itself.

Changing the codegeneration of declare simd to use vector-function-abi-variant is on my to do list, unfortunately not as close as I would like them to be. I suspect that also you have your own pipeline of things to do, but let me know if declare simd becomes closer to you than to me, I can always help you getting things sorted!

For us, we are able to pass in the "vector-function-abi-variant" since the function we're tring to vectorize is not from the source code (java), so that part is sorted. We are adding these vectorized versions of some internal functions we add through the front-end (not present in source code).

I think your starting point should be to make sure that the front end generates the exact list of functions you want to provide in vector form, using the attribute and relative declarations. Once you have verified the declarations are there, check that the vectorizer vectorizes as expected. If not, improve whatever part of the middle end opt that is needed to make your input IR work.

I was just curious if we had some other attribute I hadn't noticed :) It's just that adding these bunches of declaration from front-end seemed a lot of such declarations going through each pass (number of scalar functions * 5).

I can see that this might not "look nice" from some points of view, but it is the best way to guarantee that front-end and middle-end are decoupled to be able to unit-test each components independently. In my past I have gone through testing a front-end coupled with a backend - you don't wanna do that if you want to keep sane! :)

Also, this is slightly OT: the pass seems to rely on the fact that vectorizer will always use power-of-2 VF (which is true currently),

the pass assumes power of 2 because the TLI assumes power of two. The pass doesn't know anything about the vectorizer.

but once we start supporting any number for VF (for example in middle-end it's VF=6 and backend decides what's the correct VF is),

Of course, the TLI assumes power of 2 because the vectorizer assumes power of 2. It is a chain. If you want to vectorize VF=6, I think you should start from the vectorizer.

we will start having way too many declarations in module (and the pass will also need to be updated).

I think you need to define how many are too many. Even if the IR file will seem to have many unused declarations, those will not end up in an object file, and will not be useless because they could be used by other optimization passes if needed. It seems to be the only way we can keep the scalar-to-mapping info in a useful place.

I think we're functionally good in that case, because we will just not generate the vectorized call, since we don't have the corresponding VF variant declaration.

Yep. The VFDatabase is the interface you want to use to check the availability of vector functions in the IR, for a given CallInst CI (please refer to the code in the vectorizer, and the implementation of VFDatabase and its API in llvm/Transforms/VectorUtils.h).

Sorry if some of my answer sound "on the go", these days I am working odd hours and have many things in the pipeline! :)

Please don't stop asking questions!

Thank you!

Francesco

In D70107#2019000, @fpetrogalli wrote:
In D70107#2018811, @anna wrote:

For my purposes, our front-end converts the code into IR and our use case works with adding the attribute at that point itself. I was hoping to actually avoid having the various vector declarations in the IR - frankly it's just for IR cleanliness and some compile time (depending on number of vector variants for each such scalar function) since the vector declarations won't typically be used until we do a pass for some form of vectorization (loop/SLP etc).

I understand - this was our original intent too. Add the attribute and build the vector function signature at compile time in the middle end, our of the scalar signature in the IR and the mangled string of the vector function name. We started this work last summer, until our intern @aranisumedh found out that it was not possible to retrieve the vector signatures in some cases [1]. Essentially, it turned out that only the frontend has enough information about the language types that are needed to build the correct signature in IR. hence, we decided the vector declaration to be required in the IR.

[1] See this: http://lists.llvm.org/pipermail/llvm-dev/2019-June/133225.html, and the follow up discussion. Let me know if you have any question!
what I was thinking of adding in inject-TLI-mappings or a separate pass such as "inject-vector-declarations" is:
if (call->hasAttr("variant-abi")) {
  getVectorVariantFromAttr(call, VariantNames)
  for (VName: VariantNames) 
    addVariantDeclaration(CI, VF, VName);
}
So, to that end, I think it would be better to make the property that the "vector name mentioned in the attribute should be present in the IR as a vector function declaration" as optional?
Sorry, I don't think we can do this, for the aforementioned reason, and for the sake of consistency. It would be bad to end up having code to handle the custom behavior of the attribute.

Thanks for the clarification here.

This would also go better with the work of converting "pragma simd" to vector-abi attribute, since the front-end needn't add all the various vector declarations (until actually it is required by some pass, such as loop/SLP vectorizer). Again, this is just for IR compactness, compile time benefits. We do not have any functional issues with the attribute as-is, and we do not need source language support for pragma-simd (more details below).

Yes, this is correct. Your code will vectorize as it sees the vector declaration. Just make sure to mark it with @llvm.compiler.used so it doesn't get deleted by other optimization in the pipelines before reaching the vectorizer.

Yup, the main thing was I wanted to avoid having the declarations in the IR, but I see that it is the property of the attribute itself.

Changing the codegeneration of declare simd to use vector-function-abi-variant is on my to do list, unfortunately not as close as I would like them to be. I suspect that also you have your own pipeline of things to do, but let me know if declare simd becomes closer to you than to me, I can always help you getting things sorted!

For us, we are able to pass in the "vector-function-abi-variant" since the function we're tring to vectorize is not from the source code (java), so that part is sorted. We are adding these vectorized versions of some internal functions we add through the front-end (not present in source code).

I think your starting point should be to make sure that the front end generates the exact list of functions you want to provide in vector form, using the attribute and relative declarations. Once you have verified the declarations are there, check that the vectorizer vectorizes as expected. If not, improve whatever part of the middle end opt that is needed to make your input IR work.

I agree with all of the points. Again, to state, for a simple scalar function, we will have 5 vector forms being generated (2,4,8,16 and 32) and we'll have to start recording each of those declarations in the module. Is that right? This will functionally work for us (since we've tried a similar idea in our pipeline).

I was just curious if we had some other attribute I hadn't noticed :) It's just that adding these bunches of declaration from front-end seemed a lot of such declarations going through each pass (number of scalar functions * 5).

I can see that this might not "look nice" from some points of view, but it is the best way to guarantee that front-end and middle-end are decoupled to be able to unit-test each components independently. In my past I have gone through testing a front-end coupled with a backend - you don't wanna do that if you want to keep sane! :)

Ah, so there is some difference here. Our front-end and LLVM is completely decoupled (more details here: https://llvm.org/devmtg/2017-10/slides/Reames-FalconKeynote.pdf), but we have a mechanism to query from LLVM to our java VM for anything we want more information about (in this case, pass in the exact set of declarations). So, we can always guarantee the correct set of declarations are retrieved. Building the signature at compile time without any input from FE will be problematic (as you have pointed out above). I can see why the declarations are marked as required for the attribute.

the pass assumes power of 2 because the TLI assumes power of two. The pass doesn't know anything about the vectorizer.

but once we start supporting any number for VF (for example in middle-end it's VF=6 and backend decides what's the correct VF is),

Of course, the TLI assumes power of 2 because the vectorizer assumes power of 2. It is a chain. If you want to vectorize VF=6, I think you should start from the vectorizer.

Agreed, I was just it pointing out (and to be clear, this seems to be the assumption in various other parts of the vectorizer as well). :)

we will start having way too many declarations in module (and the pass will also need to be updated).

I think you need to define how many are too many. Even if the IR file will seem to have many unused declarations, those will not end up in an object file, and will not be useless because they could be used by other optimization passes if needed. It seems to be the only way we can keep the scalar-to-mapping info in a useful place.

So, as stated previously in numbers, we have something like 5 * number of scalar functions which have vector mappings. In our case, we will have 5 per scalar because there is nothing preventing generating a vectorized power-of-2 VF. I remember seeing some "vector length agnostic function", perhaps those can be generated on the fly, if we specify something like _vN rather than _v2 or _v4 etc?

Thank you!

In D70107#2020413, @anna wrote:

[...]

I think your starting point should be to make sure that the front end generates the exact list of functions you want to provide in vector form, using the attribute and relative declarations. Once you have verified the declarations are there, check that the vectorizer vectorizes as expected. If not, improve whatever part of the middle end opt that is needed to make your input IR work.

I agree with all of the points. Again, to state, for a simple scalar function, we will have 5 vector forms being generated (2,4,8,16 and 32) and we'll have to start recording each of those declarations in the module. Is that right? This will functionally work for us (since we've tried a similar idea in our pipeline).

"is that right?" -> yes :)

I was just curious if we had some other attribute I hadn't noticed :) It's just that adding these bunches of declaration from front-end seemed a lot of such declarations going through each pass (number of scalar functions * 5).

I can see that this might not "look nice" from some points of view, but it is the best way to guarantee that front-end and middle-end are decoupled to be able to unit-test each components independently. In my past I have gone through testing a front-end coupled with a backend - you don't wanna do that if you want to keep sane! :)

Ah, so there is some difference here. Our front-end and LLVM is completely decoupled (more details here: https://llvm.org/devmtg/2017-10/slides/Reames-FalconKeynote.pdf), but we have a mechanism to query from LLVM to our java VM for anything we want more information about (in this case, pass in the exact set of declarations). So, we can always guarantee the correct set of declarations are retrieved. Building the signature at compile time without any input from FE will be problematic (as you have pointed out above). I can see why the declarations are marked as required for the attribute.

Good that we reached the same conclusion after going through the same process of discovery!

the pass assumes power of 2 because the TLI assumes power of two. The pass doesn't know anything about the vectorizer.

but once we start supporting any number for VF (for example in middle-end it's VF=6 and backend decides what's the correct VF is),

Of course, the TLI assumes power of 2 because the vectorizer assumes power of 2. It is a chain. If you want to vectorize VF=6, I think you should start from the vectorizer.

Agreed, I was just it pointing out (and to be clear, this seems to be the assumption in various other parts of the vectorizer as well). :)

Yep.

we will start having way too many declarations in module (and the pass will also need to be updated).

I think you need to define how many are too many. Even if the IR file will seem to have many unused declarations, those will not end up in an object file, and will not be useless because they could be used by other optimization passes if needed. It seems to be the only way we can keep the scalar-to-mapping info in a useful place.

So, as stated previously in numbers, we have something like 5 * number of scalar functions which have vector mappings. In our case, we will have 5 per scalar because there is nothing preventing generating a vectorized power-of-2 VF.

Well, the cost model selects the one that is more appropriate for the datatype and the content of the loop. So, for example, if your loop is processing 64-bits scalars, and your vector registers are 128-bit wide, it is very unluckily that the vectorizer will select anything other than VF = 2... So if your function processes 64-bit data, you should first emit the 2-labes version of the vector function in the module, without bothering generating the other 4/8/16/32 lanes ones if the vectorizer already picks up the 2-lane version.

I remember seeing some "vector length agnostic function", perhaps those can be generated on the fly, if we specify something like _vN rather than _v2 or _v4 etc?

Vector Lengh Agnostic (VLA) is currently used only when targeting the scalable vector extension (SVE, of AArch64), because it uses a property of the underlying hardware. You cannot used it for a fixed width vector extension.

Francesco @fpetrogalli, I was reusing this attribute for custom scalar functions and I have a fundamental question - why is it that this *only* a callsite attribute? I don't see anything preventing this from being a function attribute as well. The meaning for the function attribute being if "variant-abi" is present on the function, it implies all callsites calling this particular function has that attribute. Of course, if you have multiple units being finally linked, it is the responsibility of the front end to choose it it wants to add the function attribute or a callsite attribute. However, I strongly feel we should have support for both, because there are use cases where the vector shape etc does not change depending on the callsite. Was there a reason for not supporting it as a function attribute as well?

In D70107#2023372, @anna wrote:

Francesco @fpetrogalli, I was reusing this attribute for custom scalar functions and I have a fundamental question - why is it that this *only* a callsite attribute? I don't see anything preventing this from being a function attribute as well. The meaning for the function attribute being if "variant-abi" is present on the function, it implies all callsites calling this particular function has that attribute. Of course, if you have multiple units being finally linked, it is the responsibility of the front end to choose it it wants to add the function attribute or a callsite attribute. However, I strongly feel we should have support for both, because there are use cases where the vector shape etc does not change depending on the callsite. Was there a reason for not supporting it as a function attribute as well?

And I came across exactly that suggestion for making this a function attribute as well: https://lists.llvm.org/pipermail/llvm-dev/2019-May/132631.html

I don't see anything preventing this from being a function attribute as well.

HI @anna ,

apologies in advance for the late reply.

I don't think we want this. Imagine situation in which proto.h contains a scalar declaration foo marked with #pragma ompe declare simd. Suppose you include this header in 2 compilation units, one (sourceA.c) that can be compiled with -fopenmp-simd, and one (sourceB.c) that cannot be compiled with openmp - for any reason, for example because of some requirements on floating points computations. Then, you want to do some optimization that involves merging the two modules. The calls to foo in the IR generated from sourceB.c suddenly become vectorizable, which is wrong. So, overall, I think we should not attach this attribute to a function declaration.

I appreciate that what I described might be a remote possibility, but if we ever end up having to deal with it, it will be quite hard to fix.

Let me know if you have any question.

Kind regards,

Francesco

In D70107#2034194, @fpetrogalli wrote:

I don't see anything preventing this from being a function attribute as well.

I appreciate that what I described might be a remote possibility, but if we ever end up having to deal with it, it will be quite hard to fix.

FWIW, I think for generic OpenMP it is needed to annotate the call site. That said, I don't see a fundamental reason one could not annotate the function. So we need the call site attributes and later when we allow them on functions, everything should be in-place. WDYT?

In D70107#2034194, @fpetrogalli wrote:

I don't see anything preventing this from being a function attribute as well.

HI @anna ,

apologies in advance for the late reply.

I don't think we want this. Imagine situation in which proto.h contains a scalar declaration foo marked with #pragma ompe declare simd. Suppose you include this header in 2 compilation units, one (sourceA.c) that can be compiled with -fopenmp-simd, and one (sourceB.c) that cannot be compiled with openmp - for any reason, for example because of some requirements on floating points computations. Then, you want to do some optimization that involves merging the two modules. The calls to foo in the IR generated from sourceB.c suddenly become vectorizable, which is wrong. So, overall, I think we should not attach this attribute to a function declaration.

I appreciate that what I described might be a remote possibility, but if we ever end up having to deal with it, it will be quite hard to fix.

Let me know if you have any question.

Kind regards,

Francesco

Hi! I really don't think this should be matter. The way I'm thinking of this - it is the responsibility of the front-end to add the correct attribute, either on callsite or function. We can repurpose this attribute for other front-ends and for things other than OpenMP. For your example and in OpenMP, you have this on the callsites. Always. So, that cases you describe is handled.

spatel mentioned this in D95373: Replace vector intrinsics with call to vector library.Jan 26 2021, 5:32 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

InitializePasses.h

1 line

Transforms/

Utils/

InjectTLIMappings.h

40 lines

ModuleUtils.h

2 lines

lib/

Transforms/

Utils/

CMakeLists.txt

1 line

InjectTLIMappings.cpp

163 lines

test/

Transforms/

Util/

add-TLI-mappings.ll

20 lines

Diff 229004

llvm/include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 174 Lines • ▼ Show 20 Lines
	void initializeIRTranslatorPass(PassRegistry&);			void initializeIRTranslatorPass(PassRegistry&);
	void initializeIVUsersWrapperPassPass(PassRegistry&);			void initializeIVUsersWrapperPassPass(PassRegistry&);
	void initializeIfConverterPass(PassRegistry&);			void initializeIfConverterPass(PassRegistry&);
	void initializeImplicitNullChecksPass(PassRegistry&);			void initializeImplicitNullChecksPass(PassRegistry&);
	void initializeIndVarSimplifyLegacyPassPass(PassRegistry&);			void initializeIndVarSimplifyLegacyPassPass(PassRegistry&);
	void initializeIndirectBrExpandPassPass(PassRegistry&);			void initializeIndirectBrExpandPassPass(PassRegistry&);
	void initializeInferAddressSpacesPass(PassRegistry&);			void initializeInferAddressSpacesPass(PassRegistry&);
	void initializeInferFunctionAttrsLegacyPassPass(PassRegistry&);			void initializeInferFunctionAttrsLegacyPassPass(PassRegistry&);
				void initializeInjectTLIMappingsLegacyPass(PassRegistry &);
	void initializeInlineCostAnalysisPass(PassRegistry&);			void initializeInlineCostAnalysisPass(PassRegistry&);
	void initializeInstCountPass(PassRegistry&);			void initializeInstCountPass(PassRegistry&);
	void initializeInstNamerPass(PassRegistry&);			void initializeInstNamerPass(PassRegistry&);
	void initializeInstSimplifyLegacyPassPass(PassRegistry &);			void initializeInstSimplifyLegacyPassPass(PassRegistry &);
	void initializeInstrProfilingLegacyPassPass(PassRegistry&);			void initializeInstrProfilingLegacyPassPass(PassRegistry&);
	void initializeInstrOrderFileLegacyPassPass(PassRegistry&);			void initializeInstrOrderFileLegacyPassPass(PassRegistry&);
	void initializeInstructionCombiningPassPass(PassRegistry&);			void initializeInstructionCombiningPassPass(PassRegistry&);
	void initializeInstructionSelectPass(PassRegistry&);			void initializeInstructionSelectPass(PassRegistry&);
	▲ Show 20 Lines • Show All 237 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h

This file was added.

				//===- InjectTLIMAppings.h - TODO brief description ----------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// TODO
				//
				//===----------------------------------------------------------------------===//
				#ifndef LLVM_TRANSFORMS_UTILS_INJECTTLIMAPPINGS_H
				#define LLVM_TRANSFORMS_UTILS_INJECTTLIMAPPINGS_H

				#include "llvm/IR/PassManager.h"

				namespace llvm {
				class InjectTLIMappings : public PassInfoMixin<InjectTLIMappings> {

				public:
				sdesmalenUnsubmitted Done Reply Inline Actions Don't forget about the old pass manager :) sdesmalen: Don't forget about the old pass manager :)
				andwarUnsubmitted Done Reply Inline Actions `Legacy` :) Also, I am not a fan of appending `Pass` to pass classes (it's clear what this class inherits from either way). Also, if you use `INITIALIZE_PASS_BEGIN`, `Pass` is going to be prepended to the _Initialize_ method anyway (so you will have `initializeInjestTLIMappingPassPass`): https://github.com/llvm/llvm-project/blob/848007cfbc7509543c5b8604ae063bb6c8ffa0a9/llvm/include/llvm/PassSupport.h#L62 andwar: `Legacy` :) Also, I am not a fan of appending `Pass` to pass classes (it's clear what this…
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions I am working on it! :) fpetrogalli: I am working on it! :)
				static StringRef name() { return "InjectTLIMappings"; }
				explicit InjectTLIMappings() {}
				jdoerfertUnsubmitted Done Reply Inline Actions I doubt you need the constructor or the name function. jdoerfert: I doubt you need the constructor or the name function.
				andwarUnsubmitted Done Reply Inline Actions What about this one? You don't need the `name` function, the one provided by the base class is fine: https://github.com/llvm/llvm-project/blob/848007cfbc7509543c5b8604ae063bb6c8ffa0a9/llvm/include/llvm/IR/PassManager.h#L373 As for the constructor, the auto-generated one should be sufficient here. Is it not? andwar: What about this one? You don't need the `name` function, the one provided by the base class is…
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions Contructor and name function removed. That was too much copy and paste. fpetrogalli: Contructor and name function removed. That was too much copy and paste.
				PreservedAnalyses run(Function &F, FunctionAnalysisManager &AM);
				};

				// Legacy pass
				class InjectTLIMappingsLegacy : public FunctionPass {
				public:
				static char ID;

				InjectTLIMappingsLegacy() : FunctionPass(ID) {
				initializeInjectTLIMappingsLegacyPass(*PassRegistry::getPassRegistry());
				}

				void getAnalysisUsage(AnalysisUsage &AU) const override;
				bool runOnFunction(Function &F) override;
				};

				} // End namespace llvm
				#endif // LLVM_TRANSFORMS_UTILS_INJECTTLIMAPPINGS_H

llvm/include/llvm/Transforms/Utils/ModuleUtils.h

	Show First 20 Lines • Show All 103 Lines • ▼ Show 20 Lines
	/// This identifier is normally guaranteed to be unique, or the program would			/// This identifier is normally guaranteed to be unique, or the program would
	/// fail to link due to multiply defined symbols.			/// fail to link due to multiply defined symbols.
	///			///
	/// If the module has no strong external symbols (such a module may still have a			/// If the module has no strong external symbols (such a module may still have a
	/// semantic effect if it performs global initialization), we cannot produce a			/// semantic effect if it performs global initialization), we cannot produce a
	/// unique identifier for this module, so we return the empty string.			/// unique identifier for this module, so we return the empty string.
	std::string getUniqueModuleId(Module *M);			std::string getUniqueModuleId(Module *M);

	class TargetLibraryInfo;
	class CallInst;			class CallInst;
	namespace VFABI {			namespace VFABI {

	jdoerfertUnsubmitted Done Reply Inline Actions Unrelated? jdoerfert: Unrelated?
	fpetrogalliAuthorUnsubmitted Done Reply Inline Actions Yes, but still useful. I have also removed a group comment. I'd rather not create a separate patch for this? fpetrogalli: Yes, but still useful. I have also removed a group comment. I'd rather not create a separate…
	jdoerfertUnsubmitted Done Reply Inline Actions create an RFC one and commit it w/o review if it is trivial. jdoerfert: create an RFC one and commit it w/o review if it is trivial.
	fpetrogalliAuthorUnsubmitted Done Reply Inline Actions Done in https://reviews.llvm.org/D70218 fpetrogalli: Done in https://reviews.llvm.org/D70218
	/// \defgroup Vector Function ABI (VABI) Module functions.			/// \defgroup Vector Function ABI (VABI) Module functions.
	///			///
	/// Utility functions for VFABI data that can modify the module.			/// Utility functions for VFABI data that can modify the module.
	///			///
	/// @{			/// @{
	/// Overwrite the Vector Function ABI variants attribute with the names provide			/// Overwrite the Vector Function ABI variants attribute with the names provide
	/// in \p VariantMappings.			/// in \p VariantMappings.
	void setVectorVariantNames(CallInst *CI,			void setVectorVariantNames(CallInst *CI,
	const SmallVector<std::string, 8> &VariantMappings);			const SmallVector<std::string, 8> &VariantMappings);

	/// @}			/// @}
	} // End VFABI namespace			} // End VFABI namespace

	} // End llvm namespace			} // End llvm namespace

	#endif // LLVM_TRANSFORMS_UTILS_MODULEUTILS_H			#endif // LLVM_TRANSFORMS_UTILS_MODULEUTILS_H

llvm/lib/Transforms/Utils/CMakeLists.txt

Show All 17 Lines	add_llvm_library(LLVMTransformUtils
Evaluator.cpp		Evaluator.cpp
FlattenCFG.cpp		FlattenCFG.cpp
FunctionComparator.cpp		FunctionComparator.cpp
FunctionImportUtils.cpp		FunctionImportUtils.cpp
GlobalStatus.cpp		GlobalStatus.cpp
GuardUtils.cpp		GuardUtils.cpp
InlineFunction.cpp		InlineFunction.cpp
ImportedFunctionsInliningStatistics.cpp		ImportedFunctionsInliningStatistics.cpp
		InjectTLIMappings.cpp
InstructionNamer.cpp		InstructionNamer.cpp
IntegerDivision.cpp		IntegerDivision.cpp
LCSSA.cpp		LCSSA.cpp
LibCallsShrinkWrap.cpp		LibCallsShrinkWrap.cpp
Local.cpp		Local.cpp
LoopRotationUtils.cpp		LoopRotationUtils.cpp
LoopSimplify.cpp		LoopSimplify.cpp
LoopUnroll.cpp		LoopUnroll.cpp
Show All 38 Lines

llvm/lib/Transforms/Utils/InjectTLIMappings.cpp

This file was added.

				//===- InjectTLIMappings.cpp - TODO description --------------------===//
				jdoerfertUnsubmitted Done Reply Inline Actions TODO, also below jdoerfert: TODO, also below
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				// TODO
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Transforms/Utils/InjectTLIMappings.h"
				#include "llvm/ADT/Statistic.h"
				#include "llvm/Analysis/VectorUtils.h"
				#include "llvm/Transforms/Utils/ModuleUtils.h"

				using namespace llvm;

				#define DEBUG_TYPE "inject-TLI-mappings"
				jdoerfertUnsubmitted Done Reply Inline Actions Is there precedent for not having it in all lower-case letters? jdoerfert: Is there precedent for not having it in all lower-case letters?
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions The pass debug option is all lower case now (and consequently, the command line too). fpetrogalli: The pass debug option is all lower case now (and consequently, the command line too).

				STATISTIC(NumCallInjected,
				"Number of calls in which the mappings have been injected.");

				STATISTIC(NumVFDefAdded,
				"Number of function definitions/declarations that have been added.");
				STATISTIC(NumCompUsedAdded,
				"Number of `@llvm.compiler.used` operands that have been added.");

				/// Helper function to map the TLI name to a strings that holds
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions TODO: The description of the counter is misleading, only declarations are added in this pass. fpetrogalli: TODO: The description of the counter is misleading, only declarations are added in this pass.
				/// scalar-to-vector mapping.
				///
				/// _ZGV<isa><mask><vlen><vparams>_<scalarname>(<vectorname>)
				///
				/// where:
				///
				/// <isa> = "_LLVM_"
				/// <mask> = "N". Note: TLI does not support masked interfaces.
				/// <vlen> = Number of concurrent lanes, stored in the `VectorizationFactor`
				/// field of the `VecDesc` struct.
				/// <vparams> = "v", as many as are the number of parameters of CI.
				/// <scalarname> = the name of the scalar function called by CI.
				/// <vectorname> = the name of the vector function mapped by the TLI.
				static std::string mangleTLIName(StringRef VectorName, const CallInst &CI,
				unsigned VF) {
				SmallString<256> Buffer;
				llvm::raw_svector_ostream Out(Buffer);
				Out << "_ZGV" << VFABI::_LLVM_ << "N" << VF;
				for (unsigned I = 0; I < CI.getNumArgOperands(); ++I)
				Out << "v";
				Out << "_" << CI.getCalledFunction()->getName() << "(" << VectorName << ")";
				return Out.str();
				}

				/// A helper function for converting Scalar types to vector types.
				/// If the incoming type is void, we return void. If the VF is 1, we return
				/// the scalar type.
				static Type ToVectorTy(Type Scalar, unsigned VF) {
				if (Scalar->isVoidTy() \|\| VF == 1)
				return Scalar;
				return VectorType::get(Scalar, VF);
				}

				/// A helper function that adds the vector function declaration that
				/// vectorizes the CallInst CI with a vectorization factor of VF
				/// lanes. The TLI assumes that all parameters and the return type of
				/// CI (other than void) need to be widened to a VectorType of VF
				/// lanes.
				sdesmalenUnsubmitted Done Reply Inline Actions When you think the patch is ready to be reviewed, can you address the comments that I added in D69976 before you removed it? sdesmalen: When you think the patch is ready to be reviewed, can you address the comments that I added in…
				static void addVariantDeclaration(CallInst &CI, const unsigned VF,
				const StringRef VFName) {
				Module *M = CI.getModule();
				llvm::GlobalValue *Global = M->getNamedValue(VFName);
				// Nothing to do if the function already exists in the module.
				if (Global)
				return;
				jdoerfertUnsubmitted Done Reply Inline Actions Wasn't that checked below but with a different call to the module? jdoerfert: Wasn't that checked below but with a different call to the module?
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions This avoids running the creation of the function if the function already exists. Otherwise I expect `Function::Create` to have some problems. fpetrogalli: This avoids running the creation of the function if the function already exists. Otherwise I…
				jdoerfertUnsubmitted Done Reply Inline Actions I mean you checked that already: Function VariantF = M->getFunction(TLIName); if (!VariantF) addVariantDeclaration(CI, VF, TLIName); but once with `getFunction` and once with `getNamedValue`. I think you don't need two checks and you should pick consistent the one lookup call you want. jdoerfert:* I mean you checked that already: ``` Function *VariantF = M->getFunction(TLIName)…
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions I was using `Global` mostly to be able to call `appendToCompilerUsed`, which works on `global`s. Then, I realized that `Function` inherits from `Global`, so there is no need to use `Global` at all. I also have replaced the check on having an empty body with an assertion, just in case someone modifies the function in the middle and populates the body of the function before invoking `appendToCompilerUsed`. fpetrogalli: I was using `Global` mostly to be able to call `appendToCompilerUsed`, which works on `global`s.

				Type *RetTy = ToVectorTy(CI.getType(), VF);
				SmallVector<Type *, 4> Tys;
				for (Value *ArgOperand : CI.arg_operands())
				Tys.push_back(ToVectorTy(ArgOperand->getType(), VF));
				FunctionType FTy = FunctionType::get(RetTy, Tys, /isVarArg=*/false);
				jdoerfertUnsubmitted Done Reply Inline Actions Where is the VarArgs restrictions checked? If it is implicit, please add an assertion that the original callee is not vararg. jdoerfert: Where is the VarArgs restrictions checked? If it is implicit, please add an assertion that the…
				Function *VectorF =
				Function::Create(FTy, Function::ExternalLinkage, VFName, M);
				VectorF->copyAttributesFrom(CI.getCalledFunction());
				++NumVFDefAdded;
				LLVM_DEBUG(dbgs() << DEBUG_TYPE << ": Added to the module: `" << VFName
				<< "` of type " << *(VectorF->getType()));

				// Make function declaration (without a body) "sticky" in the IR by
				// listing them in the @llvm.compiler.used intrinsic.
				if (VectorF->size() == 0) {
				Global = M->getNamedValue(VFName);
				assert(Global && "Missing function declaration.");
				jdoerfertUnsubmitted Done Reply Inline Actions Why do you need to query Global here? Global is VectorF isn't it? jdoerfert: Why do you need to query Global here? Global is VectorF isn't it?
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions Ah right, good catch. Fixed. fpetrogalli: Ah right, good catch. Fixed.
				jdoerfertUnsubmitted Done Reply Inline Actions Forgot to update or not fixed? jdoerfert: Forgot to update or not fixed?
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions Not it is fixed, no more `Global`. fpetrogalli: Not it is fixed, no more `Global`.
				appendToCompilerUsed(*M, {Global});
				LLVM_DEBUG(dbgs() << DEBUG_TYPE << ": Adding `" << VFName
				<< "` to `@llvm.compiler.used`.");
				++NumCompUsedAdded;
				}
				}

				static void addMappingsFromTLI(const TargetLibraryInfo &TLI, CallInst &CI) {
				// This is needed to make sure we don't query the TLI for calls to
				// bitcast of function pointers, like `%call = call i32 (i32*, ...)
				// bitcast (i32 (...)* @goo to i32 (i32, ...))(i32* nonnull %i)`,
				// as such calls make the `isFunctionVectorizable` raise an
				// exception.
				andwarUnsubmitted Done Reply Inline Actions [nit] 'things' -> 'thinks' andwar: [nit] 'things' -> 'thinks'
				if (CI.isNoBuiltin() \|\| !CI.getCalledFunction())
				return;

				const std::string ScalarName = CI.getCalledFunction()->getName();
				// Nothing to be done if the TLI things the function is not
				// vectorizable.
				if (!TLI.isFunctionVectorizable(ScalarName))
				return;
				SmallVector<std::string, 8> Mappings;
				VFABI::getVectorVariantNames(CI, Mappings);
				Module *M = CI.getModule();
				const auto OriginalSetOfMappings =
				SetVector<StringRef>(Mappings.begin(), Mappings.end());
				jdoerfertUnsubmitted Done Reply Inline Actions For brevity and without the need to assign: `SetVector<StringRef> OriginalSetOfMappings(Mappings.begin(), Mappings.end());` jdoerfert: For brevity and without the need to assign: `SetVector<StringRef> OriginalSetOfMappings…
				// 16 is the max number of lanes the TLI has in its VecDesc
				// listings. All VFs are powers of 2.
				for (unsigned VF = 2; VF <= 16; VF *= 2) {
				jdoerfertUnsubmitted Done Reply Inline Actions The call is not free but invariant: `for (unsigned VF = 2, MaxVF = TLI.getWidestVF(ScalarName); VF <= MaxVF; VF = 2) {` jdoerfert:* The call is not free but invariant: `for (unsigned VF = 2, MaxVF = TLI.getWidestVF(ScalarName)…
				const std::string TLIName = TLI.getVectorizedFunction(ScalarName, VF);
				if (!TLIName.empty()) {
				std::string MangledName = mangleTLIName(TLIName, CI, VF);
				if (OriginalSetOfMappings.count(MangledName) == 0) {
				Mappings.push_back(MangledName);
				++NumCallInjected;
				}
				andwarUnsubmitted Done Reply Inline Actions [nit] What follows is the definition of `InjectTLIMappingsPass::run` though. andwar: [nit] What follows is the definition of `InjectTLIMappingsPass::run` though.
				Function *VariantF = M->getFunction(TLIName);
				if (!VariantF)
				addVariantDeclaration(CI, VF, TLIName);
				}
				}
				jdoerfertUnsubmitted Done Reply Inline Actions Can we make 16 a return value of a (maybe static) function in TLI? Style, above: if (!...count(...)) jdoerfert: Can we make 16 a return value of a (maybe static) function in TLI? Style, above: ``` if (!...

				VFABI::setVectorVariantNames(&CI, Mappings);
				}

				namespace llvm {
				PreservedAnalyses InjectTLIMappings::run(Function &F,
				FunctionAnalysisManager &AM) {
				const TargetLibraryInfo &TLI = AM.getResult<TargetLibraryAnalysis>(F);

				for (auto &BB : F)
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions TODO: remove braces. fpetrogalli: TODO: remove braces.
				for (auto &I : BB) {
				if (auto CI = dyn_cast<CallInst>(&I))
				addMappingsFromTLI(TLI, *CI);
				}

				// Even if the pass adds IR attributes, the analyses are preserved.
				return PreservedAnalyses::all();
				}
				} // End namespace llvm
				jdoerfertUnsubmitted Done Reply Inline Actions I doubt you need the lllvm. Above, `auto CI`please :) jdoerfert:* I doubt you need the lllvm. Above, `auto CI*`please :)
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions I doubt you need the lllvm. You mean I don't need to wrap the pass in the llvm namespace? I have done it in the header file too. Is that wrong? fpetrogalli: > I doubt you need the lllvm. You mean I don't need to wrap the pass in the llvm namespace? I…
				jdoerfertUnsubmitted Done Reply Inline Actions you don't need it here because you opened the namespace. you should not open namespaces in headers that is why you wrap it in the namespace there. Plus, you don't need the explicit qualification here because these are not top level declarations as you have in the header. jdoerfert: you don't need it here because you opened the namespace. you should not open namespaces in…
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions Facepalm myself at the `using namespace llvm;` on top of this cpp file. Thanks for the explanation. fpetrogalli: Facepalm myself at the `using namespace llvm;` on top of this cpp file. Thanks for the…

				////////////////////////////////////////////////////////////////////////////////
				// Legacy Pass manager initialization
				jdoerfertUnsubmitted Done Reply Inline Actions Remove `Changed` or doe something with it. jdoerfert: Remove `Changed` or doe something with it.
				////////////////////////////////////////////////////////////////////////////////
				char InjectTLIMappingsLegacy::ID = 0;

				INITIALIZE_PASS_BEGIN(InjectTLIMappingsLegacy, DEBUG_TYPE,
				"Inject TLI Mappings", false, false)
				INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)
				INITIALIZE_PASS_END(InjectTLIMappingsLegacy, DEBUG_TYPE, "Inject TLI Mappings",
				false, false)
				andwarUnsubmitted Done Reply Inline Actions [nit] `Legacy PM implementation (the pass manager is `legacy`, not the pass :) ). andwar: [nit] ``Legacy PM implementation` (the pass manager is `legacy`, not the pass :) ).
				jdoerfertUnsubmitted Done Reply Inline Actions You can preserve more here, e.g. all CFG analysis. jdoerfert: You can preserve more here, e.g. all CFG analysis.
				jdoerfertUnsubmitted Done Reply Inline Actions I think one of the false could be a true, unclear if that will make a difference though. jdoerfert: I think one of the false could be a true, unclear if that will make a difference though.
				fpetrogalliAuthorUnsubmitted Done Reply Inline Actions It is not clear what the cfg parameter is for. I'll leave it `false` for now. If we need to revise it, we will change it later. fpetrogalli: It is not clear what the cfg parameter is for. I'll leave it `false` for now. If we need to…

llvm/test/Transforms/Util/add-TLI-mappings.ll

This file was added.

				; RUN: opt -vector-library=SVML -inject-TLI-mappings -S < %s \| FileCheck %s

				andwarUnsubmitted Done Reply Inline Actions For this to work you need to register a command line option. Why not use `print-after` and `print-before` instead? Or maybe we do need a command line option? andwar: For this to work you need to register a command line option. Why not use `print-after` and…
				jdoerfertUnsubmitted Done Reply Inline Actions command line options are good for various things, please make sure they work. (new and old PM) jdoerfert: command line options are good for various things, please make sure they work. (new and old PM)
				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				define double @sin_f64(double %in) {
				; CHECK-LABEL: @sin_f64(
				; CHECK: call double @sin(double %{{.*}}) #[[N:[0-9]+]]
				%call = tail call double @sin(double %in)
				ret double %call
				}

				declare double @sin(double) #0

				attributes #0 = { nounwind readnone }

				; CHECK: attributes #[[N]] = { "vector-function-abi-variant"=
				; CHECK-SAME: "_ZGV_LLVM_TLI_N2v_sin(__svml_sin2),
				sdesmalenUnsubmitted Done Reply Inline Actions I don't think you need to add a loop here to prove the IR contains the vectorized versions of the IR, a call to `@sin` should be sufficient. sdesmalen: I don't think you need to add a loop here to prove the IR contains the vectorized versions of…
				; CHECK-SAME: _ZGV_LLVM_TLI_N4v_sin(__svml_sin4),
				; CHECK-SAME: _ZGV_LLVM_TLI_N8v_sin(__svml_sin8)" }

This is an archive of the discontinued LLVM Phabricator instance.

[VFABI] TargetLibraryInfo mappings in IR.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 229004

llvm/include/llvm/InitializePasses.h

llvm/include/llvm/Transforms/Utils/InjectTLIMappings.h

llvm/include/llvm/Transforms/Utils/ModuleUtils.h

llvm/lib/Transforms/Utils/CMakeLists.txt

llvm/lib/Transforms/Utils/InjectTLIMappings.cpp

llvm/test/Transforms/Util/add-TLI-mappings.ll

[VFABI] TargetLibraryInfo mappings in IR.
ClosedPublic