This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/CodeGen/
-
CodeGen/
-
BackendUtil.cpp
1/1
CGCall.cpp
-
test/CodeGen/
-
CodeGen/
-
libcalls-veclib.c
-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
6/8
TargetLibraryInfo.h
-
lib/Analysis/
-
Analysis/
-
InlineCost.cpp
6/6
TargetLibraryInfo.cpp
-
test/Transforms/Inline/
-
Transforms/
-
Inline/
-
inline-no-builtin-compatible.ll
1
veclib-compat.ll

Differential D77632

[TLI] Per-function fveclib for math library used for vectorization
ClosedPublic

Authored by wenlei on Apr 6 2020, 11:59 PM.

Download Raw Diff

Details

Reviewers

tejohnson
hoyFB
spatel
gchatelet

Commits

rG60c642e74be6: [TLI] Per-function fveclib for math library used for vectorization

Summary

Encode -fveclib setting as per-function attribute so it can be threaded through to LTO backends. Accordingly, per-function TLI now reads the attribute and populated available vector function list based on that. Note that we expect functions within the same module to share fveclib setting, so vector function list is still shared between functions, as part of the shared TargetLibraryInfoImpl. Inlining between functions with different vect lib attribute is now blocked.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

wenlei created this revision.Apr 6 2020, 11:59 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 6 2020, 11:59 PM

Herald added subscribers: cfe-commits, dexonsmith, hiraditya. · View Herald Transcript

wenlei edited the summary of this revision. (Show Details)Apr 7 2020, 12:05 AM

wenlei added reviewers: tejohnson, hoyFB, spatel, gchatelet.

wenlei mentioned this in D77484: [Vector] Pass VectLib to LTO backend so TLI build correct vector function list.Apr 7 2020, 12:08 AM

update

Harbormaster failed remote builds in B52155: Diff 255679!Apr 7 2020, 8:39 AM

rebase

Harbormaster failed remote builds in B52170: Diff 255703!Apr 7 2020, 10:19 AM

Needs testing of the inline handling, and of LTO linking IR with different attributes (which is going to hit your assert, see below).

clang/lib/CodeGen/CGCall.cpp
1987	Nit: why not "vec-lib" or just "veclib", to match the option?
llvm/include/llvm/Analysis/TargetLibraryInfo.h
47	Key comment about handling of TLII vs TLI. The former is computed once per module by the analysis (which is going to be the combined module in the case of LTO), the latter is the per-function data structure.
88	To avoid building and storing the VectorDescs and ScalarDescs for every function in the TLI, what I would do is keep 3 sets of VectorDescs/ScalarDescs on the TLII object (one set per possible veclib, built once per module during construction of the TLII), then move the new VectorLibrary member to the TLI and set it there per function based on the attribute, and use it to select which pair of VectorDescs/ScalarDescs is queried.
284–285	I don't think this will do anything currently since the TLII is built once per module by the analysis. You'll hit your assert about incompatibility below first, see comment there.
llvm/lib/Analysis/TargetLibraryInfo.cpp
1544	You'll certainly hit this assert if you try LTO linking two .ll files built with different -fveclib options, because the TLII is built once per module by the analysis.
1657	This is going to override the baseline TLI veclib with whatever is the latest function we build a TLI for (and you'll hit the assert as noted earlier if they conflict).

Thanks for taking a look and the suggestions, @tejohnson. Yeah, you're right about the potential conflict of attributes. Initially I thought even though we now allow this to be per-function, but since it comes from per-module switch, module level consistency can still be expected which can simplify things a bit (hence the assertion). But I overlooked the combined module from LTO.

I will get back to this later in the week - the change will be a bit more involving as there're a few other places where we populate the module level function list directly for TLII (clang and opt).

address feedback, allow functions within a module to have different vectlib setting. add test case for inline compatibility check.

wenlei marked 5 inline comments as done.Apr 9 2020, 11:58 AM

Harbormaster failed remote builds in B52544: Diff 256350!Apr 9 2020, 12:11 PM

Great, thanks! A few minor comments below. Looks like it needs a clang-format too.

llvm/include/llvm/Analysis/TargetLibraryInfo.h
260	Suggest moving the implementation of this constructor to the .cpp file, in which case you can just set VectLibrary directly from ClVectorLibrary there and remove the member on the Impl object.
283	This is set via a flag called "inline-caller-superset-nobuiltin". Suggest changing the name to something like "inline-caller-superset-tli" to reflect new larger scope. Also add a check with that option to your new inline test case.
llvm/lib/Analysis/TargetLibraryInfo.cpp
575–576	Why not just have "i < NumVecLibs"?
586–587	ditto
1569	Should these two be llvm_unreachable?

wenlei marked 4 inline comments as done.Apr 9 2020, 1:31 PM

wenlei added inline comments.

llvm/include/llvm/Analysis/TargetLibraryInfo.h
260	There're utilities that use `TargetLibraryInfo`, but don't link with `TargetLibraryInfo.o`. And looking at `TargetLibraryInfo`, all of the functions are in this header, so I assumed it's intentional to keep this type self-contained in this header, as it's public API, which is why I add `ClVectorLibrary` to Impl to pass it back to `TargetLibraryInfo`. For `TargetLibraryInfoImpl`, it's ok to have the implementation outside of the header. I can give it a try if keeping the class implementation/definition self-contained in the header isn't important.
llvm/lib/Analysis/TargetLibraryInfo.cpp
575–576	Good catch, thanks..

tejohnson added inline comments.Apr 9 2020, 1:58 PM

llvm/include/llvm/Analysis/TargetLibraryInfo.h
260	I don't think there should be anything using TLI without linking with libAnalysis, which contains TargetLibraryInfo.o. I don't think it should be important to keep the implementation in the header, any more so than for other headers in the Analysis library.

address feedback

Herald added subscribers: haicheng, eraman. · View Herald TranscriptApr 9 2020, 3:56 PM

wenlei marked 4 inline comments as done.Apr 9 2020, 3:58 PM

wenlei added inline comments.

llvm/include/llvm/Analysis/TargetLibraryInfo.h
260	Ok, I moved it to .cpp, thanks!

lgtm. I think one check is missing in the test, see comment below.

llvm/test/Transforms/Inline/veclib-compat.ll
29	I think NOSUPERSET should also check that there is still a call here. You can probably replace some of the duplicated checks with COMMON in this function too.

This revision is now accepted and ready to land.Apr 9 2020, 4:21 PM

Harbormaster failed remote builds in B52589: Diff 256437!Apr 9 2020, 4:21 PM

rebase, update test

Closed by commit rG60c642e74be6: [TLI] Per-function fveclib for math library used for vectorization (authored by wenlei). · Explain WhyApr 9 2020, 6:31 PM

This revision was automatically updated to reflect the committed changes.

Harbormaster failed remote builds in B52606: Diff 256465!Apr 9 2020, 6:32 PM

This change causes a ~0.5% compile-time regressions: http://llvm-compile-time-tracker.com/compare.php?from=5b18b6e9a84d985c0a907009fb71de7c1943bc88&to=60c642e74be6af86906d9f3d982728be7bd4329f&stat=instructions This is quite a lot as these things go, so it would be great if you could double check if there's any optimization potential here. In particular I'm wondering why this affects normal builds so much, even though they (presumably?) don't use any veclib at all.

In D77632#1974015, @nikic wrote:

This change causes a ~0.5% compile-time regressions: http://llvm-compile-time-tracker.com/compare.php?from=5b18b6e9a84d985c0a907009fb71de7c1943bc88&to=60c642e74be6af86906d9f3d982728be7bd4329f&stat=instructions This is quite a lot as these things go, so it would be great if you could double check if there's any optimization potential here. In particular I'm wondering why this affects normal builds so much, even though they (presumably?) don't use any veclib at all.

Thanks for the heads-up. This is surprising but there is a change even when veclib is not used - in order to allow each function to use different veclib without duplicating the work of populating vector function list for each function, we now always pre-populate vector function list for three supported vector libraries for each module. However 0.5% compile-time for that work given it's per-module is not expected. I suspect we may be passing/copying TLII around more than we anticipated (now we always have more stuff to copy). I will take a look. We could also turn this into a lazy initialization - only populate the needed list for module level TLII when it's first queried by a function level TLI.

In D77632#1974363, @wenlei wrote:

In D77632#1974015, @nikic wrote:

This change causes a ~0.5% compile-time regressions: http://llvm-compile-time-tracker.com/compare.php?from=5b18b6e9a84d985c0a907009fb71de7c1943bc88&to=60c642e74be6af86906d9f3d982728be7bd4329f&stat=instructions This is quite a lot as these things go, so it would be great if you could double check if there's any optimization potential here. In particular I'm wondering why this affects normal builds so much, even though they (presumably?) don't use any veclib at all.

Thanks for the heads-up. This is surprising but there is a change even when veclib is not used - in order to allow each function to use different veclib without duplicating the work of populating vector function list for each function, we now always pre-populate vector function list for three supported vector libraries for each module. However 0.5% compile-time for that work given it's per-module is not expected. I suspect we may be passing/copying TLII around more than we anticipated (now we always have more stuff to copy). I will take a look. We could also turn this into a lazy initialization - only populate the needed list for module level TLII when it's first queried by a function level TLI.

Hmm, yeah that is surprising, because the TLII should be built once per module per TLI analysis, which is never invalidated. We've gone from populating one set of vec libs to 3, I wouldn't have thought that was particularly expensive, so it would be good to see what is going on here and confirm we are only building this once as expected.

Looking at the compile time data at that link, interestingly the "instructions" metric increased, but not wall time or cycles or task clock - they were all neutral.

The existing TLI provides a very convenient way to define a VecLib without LLVM knowing about it ahead of time. This feature is important for any embedded use of LLVM as a library out-of-tree (I'll add a unit-test in-tree).
I don't think it is a big change to this patch to preserve the current ability but I wanted to check first (and in the meantime I reverted in temporarily in https://reviews.llvm.org/D77925 to avoid the feature regression).

At the moment the place where you seem to use this knowledge is with the enum VectorLibrary in the TargetLibraryInfoImpl class, and the VecLibDescs array which statically contains the known VecLib.
It seems to me that if we replace this enum with a string instead to identify the VecLib everything should still hold together and this would fit with minor changes to this path. The VecLibDescs could just be a StringMap<VectorLibraryDescriptors> in this case.

That was a third-party (in my case the XLA compiler) can still register its own "XLA" VecLib and add all the descriptors.

How does it sound?

In D77632#1975619, @mehdi_amini wrote:

The existing TLI provides a very convenient way to define a VecLib without LLVM knowing about it ahead of time. This feature is important for any embedded use of LLVM as a library out-of-tree (I'll add a unit-test in-tree).
I don't think it is a big change to this patch to preserve the current ability but I wanted to check first (and in the meantime I reverted in temporarily in https://reviews.llvm.org/D77925 to avoid the feature regression).

At the moment the place where you seem to use this knowledge is with the enum VectorLibrary in the TargetLibraryInfoImpl class, and the VecLibDescs array which statically contains the known VecLib.
It seems to me that if we replace this enum with a string instead to identify the VecLib everything should still hold together and this would fit with minor changes to this path. The VecLibDescs could just be a StringMap<VectorLibraryDescriptors> in this case.

That was a third-party (in my case the XLA compiler) can still register its own "XLA" VecLib and add all the descriptors.

How does it sound?

I think this should work. Just reiterating something we chatted about off patch yesterday, we really need a unit test that mimics the behavior utilized by the XLA compiler, for regression testing.

In D77632#1976126, @tejohnson wrote:

I think this should work. Just reiterating something we chatted about off patch yesterday, we really need a unit test that mimics the behavior utilized by the XLA compiler, for regression testing.

Yes I pinged some of the XLA folks to make it happen.

In D77632#1975619, @mehdi_amini wrote:

The existing TLI provides a very convenient way to define a VecLib without LLVM knowing about it ahead of time. This feature is important for any embedded use of LLVM as a library out-of-tree (I'll add a unit-test in-tree).
I don't think it is a big change to this patch to preserve the current ability but I wanted to check first (and in the meantime I reverted in temporarily in https://reviews.llvm.org/D77925 to avoid the feature regression).

At the moment the place where you seem to use this knowledge is with the enum VectorLibrary in the TargetLibraryInfoImpl class, and the VecLibDescs array which statically contains the known VecLib.
It seems to me that if we replace this enum with a string instead to identify the VecLib everything should still hold together and this would fit with minor changes to this path. The VecLibDescs could just be a StringMap<VectorLibraryDescriptors> in this case.

That was a third-party (in my case the XLA compiler) can still register its own "XLA" VecLib and add all the descriptors.

How does it sound?

Thanks for the explanation about the revert. The proposal of using a StringMap to maintain the openness sounds good to me. And agree with @tejohnson, if the openness is a feature, it should be covered in tests, otherwise it can feel somewhat like a loophole and prone to breakage, though I can see how it can be useful.. Hope this patch can be restored with tweaks soon (we have workloads with very visible vectorization that relies on this).

In D77632#1974409, @tejohnson wrote:

In D77632#1974363, @wenlei wrote:

In D77632#1974015, @nikic wrote:

This change causes a ~0.5% compile-time regressions: http://llvm-compile-time-tracker.com/compare.php?from=5b18b6e9a84d985c0a907009fb71de7c1943bc88&to=60c642e74be6af86906d9f3d982728be7bd4329f&stat=instructions This is quite a lot as these things go, so it would be great if you could double check if there's any optimization potential here. In particular I'm wondering why this affects normal builds so much, even though they (presumably?) don't use any veclib at all.

Thanks for the heads-up. This is surprising but there is a change even when veclib is not used - in order to allow each function to use different veclib without duplicating the work of populating vector function list for each function, we now always pre-populate vector function list for three supported vector libraries for each module. However 0.5% compile-time for that work given it's per-module is not expected. I suspect we may be passing/copying TLII around more than we anticipated (now we always have more stuff to copy). I will take a look. We could also turn this into a lazy initialization - only populate the needed list for module level TLII when it's first queried by a function level TLI.

Hmm, yeah that is surprising, because the TLII should be built once per module per TLI analysis, which is never invalidated. We've gone from populating one set of vec libs to 3, I wouldn't have thought that was particularly expensive, so it would be good to see what is going on here and confirm we are only building this once as expected.

Looking at the compile time data at that link, interestingly the "instructions" metric increased, but not wall time or cycles or task clock - they were all neutral.

Turns out there're a few places where we call copy ctor for TLI unnecessarily. Made some changes in D77952 to use move when possible. In addition, I should have used move for TLI.VecLibDescs in move ctor of TargetLibraryInfoImpl too.

In D77632#1976231, @wenlei wrote:

And agree with @tejohnson, if the openness is a feature, it should be covered in tests, otherwise it can feel somewhat like a loophole and prone to breakage

The thing is that LLVM does not have much C++ unittests in general, so most of the "features" like this one that LLVM offers as a library are just an artifact of being "designed as a library" and being mindful about the layering.
From this point of view this patch is changing the design of a component that was modular/pluggable into a closed system. I'm perfectly fine with trying to add a unit-test, I just don't know yet where it would fit in the LLVM testing though.

In D77632#1976240, @wenlei wrote:

In D77632#1974409, @tejohnson wrote:

In D77632#1974363, @wenlei wrote:

In D77632#1974015, @nikic wrote:

This change causes a ~0.5% compile-time regressions: http://llvm-compile-time-tracker.com/compare.php?from=5b18b6e9a84d985c0a907009fb71de7c1943bc88&to=60c642e74be6af86906d9f3d982728be7bd4329f&stat=instructions This is quite a lot as these things go, so it would be great if you could double check if there's any optimization potential here. In particular I'm wondering why this affects normal builds so much, even though they (presumably?) don't use any veclib at all.

Thanks for the heads-up. This is surprising but there is a change even when veclib is not used - in order to allow each function to use different veclib without duplicating the work of populating vector function list for each function, we now always pre-populate vector function list for three supported vector libraries for each module. However 0.5% compile-time for that work given it's per-module is not expected. I suspect we may be passing/copying TLII around more than we anticipated (now we always have more stuff to copy). I will take a look. We could also turn this into a lazy initialization - only populate the needed list for module level TLII when it's first queried by a function level TLI.

Hmm, yeah that is surprising, because the TLII should be built once per module per TLI analysis, which is never invalidated. We've gone from populating one set of vec libs to 3, I wouldn't have thought that was particularly expensive, so it would be good to see what is going on here and confirm we are only building this once as expected.

Looking at the compile time data at that link, interestingly the "instructions" metric increased, but not wall time or cycles or task clock - they were all neutral.

Turns out there're a few places where we call copy ctor for TLI unnecessarily.

I assume you mean the TargetLibraryInfoImpl (TLII) here, not the TargetLibraryInfo (TLI), right? The latter should be cheap to copy. Are these the changes in BackendUtil.cpp in D77952? I had a question about that on that patch as I think we will be calling the initializer more. Mostly we should only be copying the TargetLibraryInfo during optimization though, and not the TLII impl object.

Made some changes in D77952 to use move when possible. In addition, I should have used move for TLI.VecLibDescs in move ctor of TargetLibraryInfoImpl too.

tejohnson mentioned this in D77952: [TLII] Reduce copies of TLII for TLA.Apr 12 2020, 2:39 PM

I gave D77952 a try (on top of this one), but didn't see a significant improvement from that change.

Looking at the callgrind output for compilation of a small file, I see 52M total instructions, 4 calls to TLII initialization, where addition of the vector functions takes up the majority of the time, at 0.7M. Most of the cost is in the sorting. 2 of the initialization calls are default-constructed TLII without target triple, which seems suspect to me (are we not adding TLI early enough, and something pulls it in via analysis dependency?)

So for small files, just registering the vector functions does make up a non-trivial fraction of time, and lazy initialization might make sense. This isn't the whole truth though: While the largest regressions are indeed on small files, there are also quite a few > 1% regressions on very large files.

For a mid-size file with ~6000M instructions retried, the main difference I see is TargetLibraryAnalysis::run() going up from 82M to 126M, with the cost coming from the extra getFnAttribute("veclib") call in the TargetLibraryInfo constructor. Fetching attributes is surprisingly expensive, as it performs an iteration over all attributes internally. As this code is iterating over all attributes anyway in order to handle no-builtin-*, it might make sense to move the checks for "veclib" and "no-builtins" into that loop as well, which should make them essentially free.

In D77632#1976308, @mehdi_amini wrote:

In D77632#1976231, @wenlei wrote:

And agree with @tejohnson, if the openness is a feature, it should be covered in tests, otherwise it can feel somewhat like a loophole and prone to breakage

The thing is that LLVM does not have much C++ unittests in general, so most of the "features" like this one that LLVM offers as a library are just an artifact of being "designed as a library" and being mindful about the layering.
From this point of view this patch is changing the design of a component that was modular/pluggable into a closed system.

The interfaces being relied on were in the underlying Impl class, I think if that is expected to be pluggable and stable it really needs unit testing to reflect that usage.

I'm perfectly fine with trying to add a unit-test, I just don't know yet where it would fit in the LLVM testing though.

Presumably the testing should be in llvm/unittests/Analysis/TargetLibraryInfoTest.cpp, which already exists but only tests the LibFuncs (builtins) interfaces.

uabelho added a subscriber: uabelho.Apr 14 2020, 4:35 AM

nikic mentioned this in D78665: [TLI] Optimize no-builtins attribute check (NFC).Apr 22 2020, 12:58 PM

tejohnson mentioned this in D77925: Revert "[TLI] Per-function fveclib for math library used for vectorization".May 2 2020, 9:09 AM

Revision Contents

Path

Size

clang/

lib/

CodeGen/

BackendUtil.cpp

29 lines

CGCall.cpp

21 lines

test/

CodeGen/

libcalls-veclib.c

14 lines

llvm/

include/

llvm/

Analysis/

TargetLibraryInfo.h

154 lines

lib/

Analysis/

InlineCost.cpp

9 lines

TargetLibraryInfo.cpp

91 lines

test/

Transforms/

Inline/

inline-no-builtin-compatible.ll

4 lines

veclib-compat.ll

48 lines

Diff 256468

clang/lib/CodeGen/BackendUtil.cpp

Show First 20 Lines • Show All 346 Lines • ▼ Show 20 Lines	static void addDataFlowSanitizerPass(const PassManagerBuilder &Builder,
PM.add(createDataFlowSanitizerPass(LangOpts.SanitizerBlacklistFiles));		PM.add(createDataFlowSanitizerPass(LangOpts.SanitizerBlacklistFiles));
}		}

static void addMemTagOptimizationPasses(const PassManagerBuilder &Builder,		static void addMemTagOptimizationPasses(const PassManagerBuilder &Builder,
legacy::PassManagerBase &PM) {		legacy::PassManagerBase &PM) {
PM.add(createStackSafetyGlobalInfoWrapperPass(/SetMetadata=/true));		PM.add(createStackSafetyGlobalInfoWrapperPass(/SetMetadata=/true));
}		}

static TargetLibraryInfoImpl *createTLII(llvm::Triple &TargetTriple,		static TargetLibraryInfoImpl *createTLII(llvm::Triple &TargetTriple) {
const CodeGenOptions &CodeGenOpts) {		return new TargetLibraryInfoImpl(TargetTriple);
TargetLibraryInfoImpl *TLII = new TargetLibraryInfoImpl(TargetTriple);

switch (CodeGenOpts.getVecLib()) {
case CodeGenOptions::Accelerate:
TLII->addVectorizableFunctionsFromVecLib(TargetLibraryInfoImpl::Accelerate);
break;
case CodeGenOptions::MASSV:
TLII->addVectorizableFunctionsFromVecLib(TargetLibraryInfoImpl::MASSV);
break;
case CodeGenOptions::SVML:
TLII->addVectorizableFunctionsFromVecLib(TargetLibraryInfoImpl::SVML);
break;
default:
break;
}
return TLII;
}		}

static void addSymbolRewriterPass(const CodeGenOptions &Opts,		static void addSymbolRewriterPass(const CodeGenOptions &Opts,
legacy::PassManager *MPM) {		legacy::PassManager *MPM) {
llvm::SymbolRewriter::RewriteDescriptorList DL;		llvm::SymbolRewriter::RewriteDescriptorList DL;

llvm::SymbolRewriter::RewriteMapParser MapParser;		llvm::SymbolRewriter::RewriteMapParser MapParser;
for (const auto &MapFile : Opts.RewriteMapFiles)		for (const auto &MapFile : Opts.RewriteMapFiles)
▲ Show 20 Lines • Show All 176 Lines • ▼ Show 20 Lines	void EmitAssemblyHelper::CreatePasses(legacy::PassManager &MPM,
if (CodeGenOpts.DisableLLVMPasses)		if (CodeGenOpts.DisableLLVMPasses)
return;		return;

// Figure out TargetLibraryInfo. This needs to be added to MPM and FPM		// Figure out TargetLibraryInfo. This needs to be added to MPM and FPM
// manually (and not via PMBuilder), since some passes (eg. InstrProfiling)		// manually (and not via PMBuilder), since some passes (eg. InstrProfiling)
// are inserted before PMBuilder ones - they'd get the default-constructed		// are inserted before PMBuilder ones - they'd get the default-constructed
// TLI with an unknown target otherwise.		// TLI with an unknown target otherwise.
Triple TargetTriple(TheModule->getTargetTriple());		Triple TargetTriple(TheModule->getTargetTriple());
std::unique_ptr<TargetLibraryInfoImpl> TLII(		std::unique_ptr<TargetLibraryInfoImpl> TLII(createTLII(TargetTriple));
createTLII(TargetTriple, CodeGenOpts));

// If we reached here with a non-empty index file name, then the index file		// If we reached here with a non-empty index file name, then the index file
// was empty and we are not performing ThinLTO backend compilation (used in		// was empty and we are not performing ThinLTO backend compilation (used in
// testing in a distributed build environment). Drop any the type test		// testing in a distributed build environment). Drop any the type test
// assume sequences inserted for whole program vtables so that codegen doesn't		// assume sequences inserted for whole program vtables so that codegen doesn't
// complain.		// complain.
if (!CodeGenOpts.ThinLTOIndexFile.empty())		if (!CodeGenOpts.ThinLTOIndexFile.empty())
MPM.add(createLowerTypeTestsPass(/ExportSummary=/nullptr,		MPM.add(createLowerTypeTestsPass(/ExportSummary=/nullptr,
▲ Show 20 Lines • Show All 225 Lines • ▼ Show 20 Lines
}		}

bool EmitAssemblyHelper::AddEmitPasses(legacy::PassManager &CodeGenPasses,		bool EmitAssemblyHelper::AddEmitPasses(legacy::PassManager &CodeGenPasses,
BackendAction Action,		BackendAction Action,
raw_pwrite_stream &OS,		raw_pwrite_stream &OS,
raw_pwrite_stream *DwoOS) {		raw_pwrite_stream *DwoOS) {
// Add LibraryInfo.		// Add LibraryInfo.
llvm::Triple TargetTriple(TheModule->getTargetTriple());		llvm::Triple TargetTriple(TheModule->getTargetTriple());
std::unique_ptr<TargetLibraryInfoImpl> TLII(		std::unique_ptr<TargetLibraryInfoImpl> TLII(createTLII(TargetTriple));
createTLII(TargetTriple, CodeGenOpts));
CodeGenPasses.add(new TargetLibraryInfoWrapperPass(*TLII));		CodeGenPasses.add(new TargetLibraryInfoWrapperPass(*TLII));

// Normal mode, emit a .s or .o file by running the code generator. Note,		// Normal mode, emit a .s or .o file by running the code generator. Note,
// this also adds codegenerator level optimization passes.		// this also adds codegenerator level optimization passes.
CodeGenFileType CGFT = getCodeGenFileType(Action);		CodeGenFileType CGFT = getCodeGenFileType(Action);

// Add ObjC ARC final-cleanup optimizations. This is done as part of the		// Add ObjC ARC final-cleanup optimizations. This is done as part of the
// "codegen" passes so that it isn't run multiple times when there is		// "codegen" passes so that it isn't run multiple times when there is
▲ Show 20 Lines • Show All 319 Lines • ▼ Show 20 Lines	#include "llvm/Support/Extension.def"
ModuleAnalysisManager MAM(CodeGenOpts.DebugPassManager);		ModuleAnalysisManager MAM(CodeGenOpts.DebugPassManager);

// Register the AA manager first so that our version is the one used.		// Register the AA manager first so that our version is the one used.
FAM.registerPass([&] { return PB.buildDefaultAAPipeline(); });		FAM.registerPass([&] { return PB.buildDefaultAAPipeline(); });

// Register the target library analysis directly and give it a customized		// Register the target library analysis directly and give it a customized
// preset TLI.		// preset TLI.
Triple TargetTriple(TheModule->getTargetTriple());		Triple TargetTriple(TheModule->getTargetTriple());
std::unique_ptr<TargetLibraryInfoImpl> TLII(		std::unique_ptr<TargetLibraryInfoImpl> TLII(createTLII(TargetTriple));
createTLII(TargetTriple, CodeGenOpts));
FAM.registerPass([&] { return TargetLibraryAnalysis(*TLII); });		FAM.registerPass([&] { return TargetLibraryAnalysis(*TLII); });

// Register all the basic analyses with the managers.		// Register all the basic analyses with the managers.
PB.registerModuleAnalyses(MAM);		PB.registerModuleAnalyses(MAM);
PB.registerCGSCCAnalyses(CGAM);		PB.registerCGSCCAnalyses(CGAM);
PB.registerFunctionAnalyses(FAM);		PB.registerFunctionAnalyses(FAM);
PB.registerLoopAnalyses(LAM);		PB.registerLoopAnalyses(LAM);
PB.crossRegisterProxies(LAM, FAM, CGAM, MAM);		PB.crossRegisterProxies(LAM, FAM, CGAM, MAM);
▲ Show 20 Lines • Show All 498 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGCall.cpp

Show First 20 Lines • Show All 1,862 Lines • ▼ Show 20 Lines	if (llvm::is_contained(NBA->builtinNames(), "*")) {
FuncAttrs.addAttribute("no-builtins");		FuncAttrs.addAttribute("no-builtins");
return;		return;
}		}

// And last, add the rest of the builtin names.		// And last, add the rest of the builtin names.
llvm::for_each(NBA->builtinNames(), AddNoBuiltinAttr);		llvm::for_each(NBA->builtinNames(), AddNoBuiltinAttr);
}		}

		static void addVectLibAttributes(llvm::AttrBuilder &FuncAttrs,
		const CodeGenOptions &CodeGenOpts) {
		StringRef AttributeName = "veclib";
		switch (CodeGenOpts.getVecLib()) {
		case CodeGenOptions::Accelerate:
		FuncAttrs.addAttribute(AttributeName, "Accelerate");
		break;
		case CodeGenOptions::MASSV:
		FuncAttrs.addAttribute(AttributeName, "MASSV");
		break;
		case CodeGenOptions::SVML:
		FuncAttrs.addAttribute(AttributeName, "SVML");
		break;
		case CodeGenOptions::NoLibrary:
		break;
		}
		}

void CodeGenModule::ConstructAttributeList(		void CodeGenModule::ConstructAttributeList(
StringRef Name, const CGFunctionInfo &FI, CGCalleeInfo CalleeInfo,		StringRef Name, const CGFunctionInfo &FI, CGCalleeInfo CalleeInfo,
llvm::AttributeList &AttrList, unsigned &CallingConv, bool AttrOnCallSite) {		llvm::AttributeList &AttrList, unsigned &CallingConv, bool AttrOnCallSite) {
llvm::AttrBuilder FuncAttrs;		llvm::AttrBuilder FuncAttrs;
llvm::AttrBuilder RetAttrs;		llvm::AttrBuilder RetAttrs;

CallingConv = FI.getEffectiveCallingConvention();		CallingConv = FI.getEffectiveCallingConvention();
if (FI.isNoReturn())		if (FI.isNoReturn())
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	void CodeGenModule::ConstructAttributeList(
// Attach "no-builtins" attributes to:		// Attach "no-builtins" attributes to:
// * call sites: both `nobuiltin` and "no-builtins" or "no-builtin-<name>".		// * call sites: both `nobuiltin` and "no-builtins" or "no-builtin-<name>".
// * definitions: "no-builtins" or "no-builtin-<name>" only.		// * definitions: "no-builtins" or "no-builtin-<name>" only.
// The attributes can come from:		// The attributes can come from:
// * LangOpts: -ffreestanding, -fno-builtin, -fno-builtin-<name>		// * LangOpts: -ffreestanding, -fno-builtin, -fno-builtin-<name>
// * FunctionDecl attributes: __attribute__((no_builtin(...)))		// * FunctionDecl attributes: __attribute__((no_builtin(...)))
addNoBuiltinAttributes(FuncAttrs, getLangOpts(), NBA);		addNoBuiltinAttributes(FuncAttrs, getLangOpts(), NBA);

		// Attach "veclib" attribute to function based on '-fveclib' setting.
		tejohnsonUnsubmitted Done Reply Inline Actions Nit: why not "vec-lib" or just "veclib", to match the option? tejohnson: Nit: why not "vec-lib" or just "veclib", to match the option?
		addVectLibAttributes(FuncAttrs, getCodeGenOpts());

ConstructDefaultFnAttrList(Name, HasOptnone, AttrOnCallSite, FuncAttrs);		ConstructDefaultFnAttrList(Name, HasOptnone, AttrOnCallSite, FuncAttrs);

// This must run after constructing the default function attribute list		// This must run after constructing the default function attribute list
// to ensure that the speculative load hardening attribute is removed		// to ensure that the speculative load hardening attribute is removed
// in the case where the -mspeculative-load-hardening flag was passed.		// in the case where the -mspeculative-load-hardening flag was passed.
if (TargetDecl) {		if (TargetDecl) {
if (TargetDecl->hasAttr<NoSpeculativeLoadHardeningAttr>())		if (TargetDecl->hasAttr<NoSpeculativeLoadHardeningAttr>())
FuncAttrs.removeAttribute(llvm::Attribute::SpeculativeLoadHardening);		FuncAttrs.removeAttribute(llvm::Attribute::SpeculativeLoadHardening);
▲ Show 20 Lines • Show All 2,869 Lines • Show Last 20 Lines

clang/test/CodeGen/libcalls-veclib.c

This file was added.

				// RUN: %clang_cc1 -S -emit-llvm -fveclib=SVML -o - %s \| FileCheck --check-prefixes=SVML %s
				// RUN: %clang_cc1 -S -emit-llvm -fveclib=Accelerate -o - %s \| FileCheck --check-prefixes=ACCELERATE %s
				// RUN: %clang_cc1 -S -emit-llvm -fveclib=MASSV -o - %s \| FileCheck --check-prefixes=MASSV %s
				// RUN: %clang_cc1 -S -emit-llvm -fveclib=none -o - %s \| FileCheck --check-prefixes=NOLIB %s
				// RUN: %clang_cc1 -S -emit-llvm -o - %s \| FileCheck --check-prefixes=NOLIB %s

				int main() {
				return 0;
				}

				// SVML: "veclib"="SVML"
				// ACCELERATE: "veclib"="Accelerate"
				// MASSV: "veclib"="MASSV"
				// NOLIB-NOT: "veclib"
				No newline at end of file

llvm/include/llvm/Analysis/TargetLibraryInfo.h

Show All 38 Lines	#include "llvm/Analysis/TargetLibraryInfo.def"
NotLibFunc		NotLibFunc
};		};

/// Implementation of the target library information.		/// Implementation of the target library information.
///		///
/// This class constructs tables that hold the target library information and		/// This class constructs tables that hold the target library information and
/// make it available. However, it is somewhat expensive to compute and only		/// make it available. However, it is somewhat expensive to compute and only
/// depends on the triple. So users typically interact with the \c		/// depends on the triple. So users typically interact with the \c
/// TargetLibraryInfo wrapper below.		/// TargetLibraryInfo wrapper below.
		tejohnsonUnsubmitted Not Done Reply Inline Actions Key comment about handling of TLII vs TLI. The former is computed once per module by the analysis (which is going to be the combined module in the case of LTO), the latter is the per-function data structure. tejohnson: Key comment about handling of TLII vs TLI. The former is computed once per module by the…
class TargetLibraryInfoImpl {		class TargetLibraryInfoImpl {
friend class TargetLibraryInfo;		friend class TargetLibraryInfo;

		public:
		/// List of known vector-functions libraries.
		///
		/// The vector-functions library defines, which functions are vectorizable
		/// and with which factor. The library can be specified by either frontend,
		/// or a commandline option, and then used by
		/// addVectorizableFunctionsFromVecLib for filling up the tables of
		/// vectorizable functions.
		enum VectorLibrary {
		Accelerate, // Use Accelerate framework.
		MASSV, // IBM MASS vector library.
		SVML, // Intel short vector math library.
		NumVecLibs, // Number of supported vector libraries.
		NoLibrary // Don't use any vector library.
		};

		private:
unsigned char AvailableArray[(NumLibFuncs+3)/4];		unsigned char AvailableArray[(NumLibFuncs+3)/4];
llvm::DenseMap<unsigned, std::string> CustomNames;		llvm::DenseMap<unsigned, std::string> CustomNames;
static StringLiteral const StandardNames[NumLibFuncs];		static StringLiteral const StandardNames[NumLibFuncs];
bool ShouldExtI32Param, ShouldExtI32Return, ShouldSignExtI32Param;		bool ShouldExtI32Param, ShouldExtI32Return, ShouldSignExtI32Param;

enum AvailabilityState {		enum AvailabilityState {
StandardName = 3, // (memset to all ones)		StandardName = 3, // (memset to all ones)
CustomName = 1,		CustomName = 1,
Unavailable = 0 // (memset to all zeros)		Unavailable = 0 // (memset to all zeros)
};		};
void setState(LibFunc F, AvailabilityState State) {		void setState(LibFunc F, AvailabilityState State) {
AvailableArray[F/4] &= ~(3 << 2*(F&3));		AvailableArray[F/4] &= ~(3 << 2*(F&3));
AvailableArray[F/4] \|= State << 2*(F&3);		AvailableArray[F/4] \|= State << 2*(F&3);
}		}
AvailabilityState getState(LibFunc F) const {		AvailabilityState getState(LibFunc F) const {
return static_cast<AvailabilityState>((AvailableArray[F/4] >> 2*(F&3)) & 3);		return static_cast<AvailabilityState>((AvailableArray[F/4] >> 2*(F&3)) & 3);
}		}

		/// Vector library descriptor for all supported ones.
		struct VectorLibraryDescriptors {
/// Vectorization descriptors - sorted by ScalarFnName.		/// Vectorization descriptors - sorted by ScalarFnName.
		tejohnsonUnsubmitted Done Reply Inline Actions To avoid building and storing the VectorDescs and ScalarDescs for every function in the TLI, what I would do is keep 3 sets of VectorDescs/ScalarDescs on the TLII object (one set per possible veclib, built once per module during construction of the TLII), then move the new VectorLibrary member to the TLI and set it there per function based on the attribute, and use it to select which pair of VectorDescs/ScalarDescs is queried. tejohnson: To avoid building and storing the VectorDescs and ScalarDescs for every function in the TLI…
std::vector<VecDesc> VectorDescs;		std::vector<VecDesc> VectorDescs;
/// Scalarization descriptors - same content as VectorDescs but sorted based		/// Scalarization descriptors - same content as VectorDescs but sorted based
/// on VectorFnName rather than ScalarFnName.		/// on VectorFnName rather than ScalarFnName.
std::vector<VecDesc> ScalarDescs;		std::vector<VecDesc> ScalarDescs;
		} VecLibDescs[NumVecLibs];

/// Return true if the function type FTy is valid for the library function		/// Return true if the function type FTy is valid for the library function
/// F, regardless of whether the function is available.		/// F, regardless of whether the function is available.
bool isValidProtoForLibFunc(const FunctionType &FTy, LibFunc F,		bool isValidProtoForLibFunc(const FunctionType &FTy, LibFunc F,
const DataLayout *DL) const;		const DataLayout *DL) const;

public:		/// Add a set of scalar -> vector mappings, queryable via
/// List of known vector-functions libraries.		/// getVectorizedFunction and getScalarizedFunction.
///		void addVectorizableFunctions(ArrayRef<VecDesc> Fns,
/// The vector-functions library defines, which functions are vectorizable		VectorLibraryDescriptors &VetLibDescs);
/// and with which factor. The library can be specified by either frontend,
/// or a commandline option, and then used by
/// addVectorizableFunctionsFromVecLib for filling up the tables of
/// vectorizable functions.
enum VectorLibrary {
NoLibrary, // Don't use any vector library.
Accelerate, // Use Accelerate framework.
MASSV, // IBM MASS vector library.
SVML // Intel short vector math library.
};

		/// Calls addVectorizableFunctionsFromVecLib with a known preset of functions
		/// for the given vector library.
		void addVectorizableFunctionsFromVecLib(enum VectorLibrary VecLib,
		VectorLibraryDescriptors &VetLibDesc);

		public:
TargetLibraryInfoImpl();		TargetLibraryInfoImpl();
explicit TargetLibraryInfoImpl(const Triple &T);		explicit TargetLibraryInfoImpl(const Triple &T);

// Provide value semantics.		// Provide value semantics.
TargetLibraryInfoImpl(const TargetLibraryInfoImpl &TLI);		TargetLibraryInfoImpl(const TargetLibraryInfoImpl &TLI);
TargetLibraryInfoImpl(TargetLibraryInfoImpl &&TLI);		TargetLibraryInfoImpl(TargetLibraryInfoImpl &&TLI);
TargetLibraryInfoImpl &operator=(const TargetLibraryInfoImpl &TLI);		TargetLibraryInfoImpl &operator=(const TargetLibraryInfoImpl &TLI);
TargetLibraryInfoImpl &operator=(TargetLibraryInfoImpl &&TLI);		TargetLibraryInfoImpl &operator=(TargetLibraryInfoImpl &&TLI);
Show All 33 Lines	void setAvailableWithName(LibFunc F, StringRef Name) {
}		}
}		}

/// Disables all builtins.		/// Disables all builtins.
///		///
/// This can be used for options like -fno-builtin.		/// This can be used for options like -fno-builtin.
void disableAllFunctions();		void disableAllFunctions();

/// Add a set of scalar -> vector mappings, queryable via		/// Populate VectorLibraryDescriptors for all supported vector libraries.
/// getVectorizedFunction and getScalarizedFunction.		void addAllVectorizableFunctions();
void addVectorizableFunctions(ArrayRef<VecDesc> Fns);

/// Calls addVectorizableFunctions with a known preset of functions for the
/// given vector library.
void addVectorizableFunctionsFromVecLib(enum VectorLibrary VecLib);

/// Return true if the function F has a vector equivalent with vectorization		/// Return true if the function F has a vector equivalent with vectorization
/// factor VF.		/// factor VF.
bool isFunctionVectorizable(StringRef F, unsigned VF) const {		bool isFunctionVectorizable(StringRef F, unsigned VF,
return !getVectorizedFunction(F, VF).empty();		VectorLibrary vecLib) const {
		return !getVectorizedFunction(F, VF, vecLib).empty();
}		}

/// Return true if the function F has a vector equivalent with any		/// Return true if the function F has a vector equivalent with any
/// vectorization factor.		/// vectorization factor.
bool isFunctionVectorizable(StringRef F) const;		bool isFunctionVectorizable(StringRef F, VectorLibrary vecLib) const;

/// Return the name of the equivalent of F, vectorized with factor VF. If no		/// Return the name of the equivalent of F, vectorized with factor VF. If no
/// such mapping exists, return the empty string.		/// such mapping exists, return the empty string.
StringRef getVectorizedFunction(StringRef F, unsigned VF) const;		StringRef getVectorizedFunction(StringRef F, unsigned VF,
		VectorLibrary vecLib) const;

/// Return true if the function F has a scalar equivalent, and set VF to be		/// Return true if the function F has a scalar equivalent, and set VF to be
/// the vectorization factor.		/// the vectorization factor.
bool isFunctionScalarizable(StringRef F, unsigned &VF) const {		bool isFunctionScalarizable(StringRef F, unsigned &VF,
return !getScalarizedFunction(F, VF).empty();		VectorLibrary vecLib) const {
		return !getScalarizedFunction(F, VF, vecLib).empty();
}		}

/// Return the name of the equivalent of F, scalarized. If no such mapping		/// Return the name of the equivalent of F, scalarized. If no such mapping
/// exists, return the empty string.		/// exists, return the empty string.
///		///
/// Set VF to the vectorization factor.		/// Set VF to the vectorization factor.
StringRef getScalarizedFunction(StringRef F, unsigned &VF) const;		StringRef getScalarizedFunction(StringRef F, unsigned &VF,
		VectorLibrary vecLib) const;

/// Set to true iff i32 parameters to library functions should have signext		/// Set to true iff i32 parameters to library functions should have signext
/// or zeroext attributes if they correspond to C-level int or unsigned int,		/// or zeroext attributes if they correspond to C-level int or unsigned int,
/// respectively.		/// respectively.
void setShouldExtI32Param(bool Val) {		void setShouldExtI32Param(bool Val) {
ShouldExtI32Param = Val;		ShouldExtI32Param = Val;
}		}

Show All 11 Lines	public:
}		}

/// Returns the size of the wchar_t type in bytes or 0 if the size is unknown.		/// Returns the size of the wchar_t type in bytes or 0 if the size is unknown.
/// This queries the 'wchar_size' metadata.		/// This queries the 'wchar_size' metadata.
unsigned getWCharSize(const Module &M) const;		unsigned getWCharSize(const Module &M) const;

/// Returns the largest vectorization factor used in the list of		/// Returns the largest vectorization factor used in the list of
/// vector functions.		/// vector functions.
unsigned getWidestVF(StringRef ScalarF) const;		unsigned getWidestVF(StringRef ScalarF, VectorLibrary vecLib) const;
};		};

/// Provides information about what library functions are available for		/// Provides information about what library functions are available for
/// the current target.		/// the current target.
///		///
/// This both allows optimizations to handle them specially and frontends to		/// This both allows optimizations to handle them specially and frontends to
/// disable such optimizations through -fno-builtin etc.		/// disable such optimizations through -fno-builtin etc.
class TargetLibraryInfo {		class TargetLibraryInfo {
friend class TargetLibraryAnalysis;		friend class TargetLibraryAnalysis;
friend class TargetLibraryInfoWrapperPass;		friend class TargetLibraryInfoWrapperPass;

/// The global (module level) TLI info.		/// The global (module level) TLI info.
const TargetLibraryInfoImpl *Impl;		const TargetLibraryInfoImpl *Impl;

		/// Vector library available for vectorization.
		TargetLibraryInfoImpl::VectorLibrary VectLibrary =
		TargetLibraryInfoImpl::NoLibrary;

/// Support for -fno-builtin* options as function attributes, overrides		/// Support for -fno-builtin* options as function attributes, overrides
/// information in global TargetLibraryInfoImpl.		/// information in global TargetLibraryInfoImpl.
BitVector OverrideAsUnavailable;		BitVector OverrideAsUnavailable;

		TargetLibraryInfoImpl::VectorLibrary
		getVecLibFromName(const StringRef &VecLibName) {
		if (VecLibName == "Accelerate")
		return TargetLibraryInfoImpl::Accelerate;
		else if (VecLibName == "MASSV")
		return TargetLibraryInfoImpl::MASSV;
		else if (VecLibName == "SVML")
		return TargetLibraryInfoImpl::SVML;
		return TargetLibraryInfoImpl::NoLibrary;
		}

public:		public:
explicit TargetLibraryInfo(const TargetLibraryInfoImpl &Impl,		explicit TargetLibraryInfo(const TargetLibraryInfoImpl &Impl,
Optional<const Function *> F = None)		Optional<const Function *> F = None);
: Impl(&Impl), OverrideAsUnavailable(NumLibFuncs) {
if (!F)
return;
if ((*F)->hasFnAttribute("no-builtins"))
disableAllFunctions();
else {
// Disable individual libc/libm calls in TargetLibraryInfo.
LibFunc LF;
AttributeSet FnAttrs = (*F)->getAttributes().getFnAttributes();
for (const Attribute &Attr : FnAttrs) {
if (!Attr.isStringAttribute())
continue;
auto AttrStr = Attr.getKindAsString();
if (!AttrStr.consume_front("no-builtin-"))
continue;
if (getLibFunc(AttrStr, LF))
setUnavailable(LF);
}
}
}

// Provide value semantics.		// Provide value semantics.
TargetLibraryInfo(const TargetLibraryInfo &TLI)		TargetLibraryInfo(const TargetLibraryInfo &TLI)
: Impl(TLI.Impl), OverrideAsUnavailable(TLI.OverrideAsUnavailable) {}		: Impl(TLI.Impl), VectLibrary(TLI.VectLibrary),
		OverrideAsUnavailable(TLI.OverrideAsUnavailable) {}
		tejohnsonUnsubmitted Not Done Reply Inline Actions Suggest moving the implementation of this constructor to the .cpp file, in which case you can just set VectLibrary directly from ClVectorLibrary there and remove the member on the Impl object. tejohnson: Suggest moving the implementation of this constructor to the .cpp file, in which case you can…
		wenleiAuthorUnsubmitted Done Reply Inline Actions There're utilities that use `TargetLibraryInfo`, but don't link with `TargetLibraryInfo.o`. And looking at `TargetLibraryInfo`, all of the functions are in this header, so I assumed it's intentional to keep this type self-contained in this header, as it's public API, which is why I add `ClVectorLibrary` to Impl to pass it back to `TargetLibraryInfo`. For `TargetLibraryInfoImpl`, it's ok to have the implementation outside of the header. I can give it a try if keeping the class implementation/definition self-contained in the header isn't important. wenlei: There're utilities that use `TargetLibraryInfo`, but don't link with `TargetLibraryInfo.o`. And…
		tejohnsonUnsubmitted Done Reply Inline Actions I don't think there should be anything using TLI without linking with libAnalysis, which contains TargetLibraryInfo.o. I don't think it should be important to keep the implementation in the header, any more so than for other headers in the Analysis library. tejohnson: I don't think there should be anything using TLI without linking with libAnalysis, which…
		wenleiAuthorUnsubmitted Done Reply Inline Actions Ok, I moved it to .cpp, thanks! wenlei: Ok, I moved it to .cpp, thanks!
TargetLibraryInfo(TargetLibraryInfo &&TLI)		TargetLibraryInfo(TargetLibraryInfo &&TLI)
: Impl(TLI.Impl), OverrideAsUnavailable(TLI.OverrideAsUnavailable) {}		: Impl(TLI.Impl), VectLibrary(TLI.VectLibrary),
		OverrideAsUnavailable(TLI.OverrideAsUnavailable) {}
TargetLibraryInfo &operator=(const TargetLibraryInfo &TLI) {		TargetLibraryInfo &operator=(const TargetLibraryInfo &TLI) {
Impl = TLI.Impl;		Impl = TLI.Impl;
		VectLibrary = TLI.VectLibrary;
OverrideAsUnavailable = TLI.OverrideAsUnavailable;		OverrideAsUnavailable = TLI.OverrideAsUnavailable;
return *this;		return *this;
}		}
TargetLibraryInfo &operator=(TargetLibraryInfo &&TLI) {		TargetLibraryInfo &operator=(TargetLibraryInfo &&TLI) {
Impl = TLI.Impl;		Impl = TLI.Impl;
		VectLibrary = TLI.VectLibrary;
OverrideAsUnavailable = TLI.OverrideAsUnavailable;		OverrideAsUnavailable = TLI.OverrideAsUnavailable;
return *this;		return *this;
}		}

/// Determine whether a callee with the given TLI can be inlined into		/// Determine whether a callee with the given TLI can be inlined into
/// caller with this TLI, based on 'nobuiltin' attributes. When requested,		/// caller with this TLI, based on 'nobuiltin', `veclib` attributes.
/// allow inlining into a caller with a superset of the callee's nobuiltin		/// When requested, allow inlining into a caller with a superset of the
/// attributes, which is conservatively correct.		/// callee's attributes, which is conservatively correct.
bool areInlineCompatible(const TargetLibraryInfo &CalleeTLI,		bool areInlineCompatible(const TargetLibraryInfo &CalleeTLI,
bool AllowCallerSuperset) const {		bool AllowCallerSuperset) const {
if (!AllowCallerSuperset)		if (!AllowCallerSuperset)
		tejohnsonUnsubmitted Done Reply Inline Actions This is set via a flag called "inline-caller-superset-nobuiltin". Suggest changing the name to something like "inline-caller-superset-tli" to reflect new larger scope. Also add a check with that option to your new inline test case. tejohnson: This is set via a flag called "inline-caller-superset-nobuiltin". Suggest changing the name to…
return OverrideAsUnavailable == CalleeTLI.OverrideAsUnavailable;		return VectLibrary == CalleeTLI.VectLibrary &&
		OverrideAsUnavailable == CalleeTLI.OverrideAsUnavailable;
		tejohnsonUnsubmitted Done Reply Inline Actions I don't think this will do anything currently since the TLII is built once per module by the analysis. You'll hit your assert about incompatibility below first, see comment there. tejohnson: I don't think this will do anything currently since the TLII is built once per module by the…
BitVector B = OverrideAsUnavailable;		BitVector B = OverrideAsUnavailable;
B \|= CalleeTLI.OverrideAsUnavailable;		B \|= CalleeTLI.OverrideAsUnavailable;
// We can inline if the union of the caller and callee's nobuiltin		// We can inline if the union of the caller and callee's attributes
// attributes is no stricter than the caller's nobuiltin attributes.		// is no stricter than the caller's attributes.
return B == OverrideAsUnavailable;		bool VecLibCompatible =
		(VectLibrary == CalleeTLI.VectLibrary) \|\|
		CalleeTLI.VectLibrary == TargetLibraryInfoImpl::NoLibrary;
		return B == OverrideAsUnavailable && VecLibCompatible;
}		}

/// Searches for a particular function name.		/// Searches for a particular function name.
///		///
/// If it is one of the known library functions, return true and set F to the		/// If it is one of the known library functions, return true and set F to the
/// corresponding value.		/// corresponding value.
bool getLibFunc(StringRef funcName, LibFunc &F) const {		bool getLibFunc(StringRef funcName, LibFunc &F) const {
return Impl->getLibFunc(funcName, F);		return Impl->getLibFunc(funcName, F);
Show All 28 Lines	TargetLibraryInfoImpl::AvailabilityState getState(LibFunc F) const {
return Impl->getState(F);		return Impl->getState(F);
}		}

/// Tests whether a library function is available.		/// Tests whether a library function is available.
bool has(LibFunc F) const {		bool has(LibFunc F) const {
return getState(F) != TargetLibraryInfoImpl::Unavailable;		return getState(F) != TargetLibraryInfoImpl::Unavailable;
}		}
bool isFunctionVectorizable(StringRef F, unsigned VF) const {		bool isFunctionVectorizable(StringRef F, unsigned VF) const {
return Impl->isFunctionVectorizable(F, VF);		return Impl->isFunctionVectorizable(F, VF, VectLibrary);
}		}
bool isFunctionVectorizable(StringRef F) const {		bool isFunctionVectorizable(StringRef F) const {
return Impl->isFunctionVectorizable(F);		return Impl->isFunctionVectorizable(F, VectLibrary);
}		}
StringRef getVectorizedFunction(StringRef F, unsigned VF) const {		StringRef getVectorizedFunction(StringRef F, unsigned VF) const {
return Impl->getVectorizedFunction(F, VF);		return Impl->getVectorizedFunction(F, VF, VectLibrary);
}		}

/// Tests if the function is both available and a candidate for optimized code		/// Tests if the function is both available and a candidate for optimized code
/// generation.		/// generation.
bool hasOptimizedCodeGen(LibFunc F) const {		bool hasOptimizedCodeGen(LibFunc F) const {
if (getState(F) == TargetLibraryInfoImpl::Unavailable)		if (getState(F) == TargetLibraryInfoImpl::Unavailable)
return false;		return false;
switch (F) {		switch (F) {
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	public:
}		}
bool invalidate(Function &, const PreservedAnalyses &,		bool invalidate(Function &, const PreservedAnalyses &,
FunctionAnalysisManager::Invalidator &) {		FunctionAnalysisManager::Invalidator &) {
return false;		return false;
}		}
/// Returns the largest vectorization factor used in the list of		/// Returns the largest vectorization factor used in the list of
/// vector functions.		/// vector functions.
unsigned getWidestVF(StringRef ScalarF) const {		unsigned getWidestVF(StringRef ScalarF) const {
return Impl->getWidestVF(ScalarF);		return Impl->getWidestVF(ScalarF, VectLibrary);
}		}

/// Check if the function "F" is listed in a library known to LLVM.		/// Check if the function "F" is listed in a library known to LLVM.
bool isKnownVectorFunctionInLibrary(StringRef F) const {		bool isKnownVectorFunctionInLibrary(StringRef F) const {
return this->isFunctionVectorizable(F);		return this->isFunctionVectorizable(F);
}		}
};		};

▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

llvm/lib/Analysis/InlineCost.cpp

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	cl::desc("Minimum block frequency, expressed as a multiple of caller's "
"entry frequency, for a callsite to be hot in the absence of "		"entry frequency, for a callsite to be hot in the absence of "
"profile information."));		"profile information."));

static cl::opt<bool> OptComputeFullInlineCost(		static cl::opt<bool> OptComputeFullInlineCost(
"inline-cost-full", cl::Hidden, cl::init(false), cl::ZeroOrMore,		"inline-cost-full", cl::Hidden, cl::init(false), cl::ZeroOrMore,
cl::desc("Compute the full inline cost of a call site even when the cost "		cl::desc("Compute the full inline cost of a call site even when the cost "
"exceeds the threshold."));		"exceeds the threshold."));

static cl::opt<bool> InlineCallerSupersetNoBuiltin(		static cl::opt<bool> InlineCallerSupersetTLI(
"inline-caller-superset-nobuiltin", cl::Hidden, cl::init(true),		"inline-caller-superset-tli", cl::Hidden, cl::init(true), cl::ZeroOrMore,
cl::ZeroOrMore,		cl::desc("Allow inlining when caller has a superset of callee's TLI "
cl::desc("Allow inlining when caller has a superset of callee's nobuiltin "
"attributes."));		"attributes."));

namespace {		namespace {
class InlineCostCallAnalyzer;		class InlineCostCallAnalyzer;

// This struct is used to store information about inline cost of a		// This struct is used to store information about inline cost of a
// particular instruction		// particular instruction
struct InstructionCostDetail {		struct InstructionCostDetail {
▲ Show 20 Lines • Show All 2,045 Lines • ▼ Show 20 Lines	static bool functionsHaveCompatibleAttributes(
function_ref<const TargetLibraryInfo &(Function &)> &GetTLI) {		function_ref<const TargetLibraryInfo &(Function &)> &GetTLI) {
// Note that CalleeTLI must be a copy not a reference. The legacy pass manager		// Note that CalleeTLI must be a copy not a reference. The legacy pass manager
// caches the most recently created TLI in the TargetLibraryInfoWrapperPass		// caches the most recently created TLI in the TargetLibraryInfoWrapperPass
// object, and always returns the same object (which is overwritten on each		// object, and always returns the same object (which is overwritten on each
// GetTLI call). Therefore we copy the first result.		// GetTLI call). Therefore we copy the first result.
auto CalleeTLI = GetTLI(*Callee);		auto CalleeTLI = GetTLI(*Callee);
return TTI.areInlineCompatible(Caller, Callee) &&		return TTI.areInlineCompatible(Caller, Callee) &&
GetTLI(*Caller).areInlineCompatible(CalleeTLI,		GetTLI(*Caller).areInlineCompatible(CalleeTLI,
InlineCallerSupersetNoBuiltin) &&		InlineCallerSupersetTLI) &&
AttributeFuncs::areInlineCompatible(Caller, Callee);		AttributeFuncs::areInlineCompatible(Caller, Callee);
}		}

int llvm::getCallsiteCost(CallBase &Call, const DataLayout &DL) {		int llvm::getCallsiteCost(CallBase &Call, const DataLayout &DL) {
int Cost = 0;		int Cost = 0;
for (unsigned I = 0, E = Call.arg_size(); I != E; ++I) {		for (unsigned I = 0, E = Call.arg_size(); I != E; ++I) {
if (Call.isByValArgument(I)) {		if (Call.isByValArgument(I)) {
// We approximate the number of loads and stores needed by dividing the		// We approximate the number of loads and stores needed by dividing the
▲ Show 20 Lines • Show All 256 Lines • Show Last 20 Lines

llvm/lib/Analysis/TargetLibraryInfo.cpp

Show First 20 Lines • Show All 544 Lines • ▼ Show 20 Lines	static void initialize(TargetLibraryInfoImpl &TLI, const Triple &T,
// optimizations, so this situation should be fixed.		// optimizations, so this situation should be fixed.
if (T.isNVPTX()) {		if (T.isNVPTX()) {
TLI.disableAllFunctions();		TLI.disableAllFunctions();
TLI.setAvailable(LibFunc_nvvm_reflect);		TLI.setAvailable(LibFunc_nvvm_reflect);
} else {		} else {
TLI.setUnavailable(LibFunc_nvvm_reflect);		TLI.setUnavailable(LibFunc_nvvm_reflect);
}		}

TLI.addVectorizableFunctionsFromVecLib(ClVectorLibrary);		TLI.addAllVectorizableFunctions();
}		}

TargetLibraryInfoImpl::TargetLibraryInfoImpl() {		TargetLibraryInfoImpl::TargetLibraryInfoImpl() {
// Default to everything being available.		// Default to everything being available.
memset(AvailableArray, -1, sizeof(AvailableArray));		memset(AvailableArray, -1, sizeof(AvailableArray));

initialize(*this, Triple(), StandardNames);		initialize(*this, Triple(), StandardNames);
}		}

TargetLibraryInfoImpl::TargetLibraryInfoImpl(const Triple &T) {		TargetLibraryInfoImpl::TargetLibraryInfoImpl(const Triple &T) {
// Default to everything being available.		// Default to everything being available.
memset(AvailableArray, -1, sizeof(AvailableArray));		memset(AvailableArray, -1, sizeof(AvailableArray));

initialize(*this, T, StandardNames);		initialize(*this, T, StandardNames);
}		}

TargetLibraryInfoImpl::TargetLibraryInfoImpl(const TargetLibraryInfoImpl &TLI)		TargetLibraryInfoImpl::TargetLibraryInfoImpl(const TargetLibraryInfoImpl &TLI)
: CustomNames(TLI.CustomNames), ShouldExtI32Param(TLI.ShouldExtI32Param),		: CustomNames(TLI.CustomNames), ShouldExtI32Param(TLI.ShouldExtI32Param),
ShouldExtI32Return(TLI.ShouldExtI32Return),		ShouldExtI32Return(TLI.ShouldExtI32Return),
ShouldSignExtI32Param(TLI.ShouldSignExtI32Param) {		ShouldSignExtI32Param(TLI.ShouldSignExtI32Param) {
memcpy(AvailableArray, TLI.AvailableArray, sizeof(AvailableArray));		memcpy(AvailableArray, TLI.AvailableArray, sizeof(AvailableArray));
VectorDescs = TLI.VectorDescs;		for (unsigned i = 0; i < NumVecLibs; i++)
ScalarDescs = TLI.ScalarDescs;		VecLibDescs[i] = TLI.VecLibDescs[i];
		tejohnsonUnsubmitted Done Reply Inline Actions Why not just have "i < NumVecLibs"? tejohnson: Why not just have "i < NumVecLibs"?
		wenleiAuthorUnsubmitted Done Reply Inline Actions Good catch, thanks.. wenlei: Good catch, thanks..
}		}

TargetLibraryInfoImpl::TargetLibraryInfoImpl(TargetLibraryInfoImpl &&TLI)		TargetLibraryInfoImpl::TargetLibraryInfoImpl(TargetLibraryInfoImpl &&TLI)
: CustomNames(std::move(TLI.CustomNames)),		: CustomNames(std::move(TLI.CustomNames)),
ShouldExtI32Param(TLI.ShouldExtI32Param),		ShouldExtI32Param(TLI.ShouldExtI32Param),
ShouldExtI32Return(TLI.ShouldExtI32Return),		ShouldExtI32Return(TLI.ShouldExtI32Return),
ShouldSignExtI32Param(TLI.ShouldSignExtI32Param) {		ShouldSignExtI32Param(TLI.ShouldSignExtI32Param) {
std::move(std::begin(TLI.AvailableArray), std::end(TLI.AvailableArray),		std::move(std::begin(TLI.AvailableArray), std::end(TLI.AvailableArray),
AvailableArray);		AvailableArray);
VectorDescs = TLI.VectorDescs;		for (unsigned i = 0; i < NumVecLibs; i++)
ScalarDescs = TLI.ScalarDescs;		VecLibDescs[i] = TLI.VecLibDescs[i];
		tejohnsonUnsubmitted Done Reply Inline Actions ditto tejohnson: ditto
}		}

TargetLibraryInfoImpl &TargetLibraryInfoImpl::operator=(const TargetLibraryInfoImpl &TLI) {		TargetLibraryInfoImpl &TargetLibraryInfoImpl::operator=(const TargetLibraryInfoImpl &TLI) {
CustomNames = TLI.CustomNames;		CustomNames = TLI.CustomNames;
ShouldExtI32Param = TLI.ShouldExtI32Param;		ShouldExtI32Param = TLI.ShouldExtI32Param;
ShouldExtI32Return = TLI.ShouldExtI32Return;		ShouldExtI32Return = TLI.ShouldExtI32Return;
ShouldSignExtI32Param = TLI.ShouldSignExtI32Param;		ShouldSignExtI32Param = TLI.ShouldSignExtI32Param;
memcpy(AvailableArray, TLI.AvailableArray, sizeof(AvailableArray));		memcpy(AvailableArray, TLI.AvailableArray, sizeof(AvailableArray));
▲ Show 20 Lines • Show All 919 Lines • ▼ Show 20 Lines
static bool compareWithScalarFnName(const VecDesc &LHS, StringRef S) {		static bool compareWithScalarFnName(const VecDesc &LHS, StringRef S) {
return LHS.ScalarFnName < S;		return LHS.ScalarFnName < S;
}		}

static bool compareWithVectorFnName(const VecDesc &LHS, StringRef S) {		static bool compareWithVectorFnName(const VecDesc &LHS, StringRef S) {
return LHS.VectorFnName < S;		return LHS.VectorFnName < S;
}		}

void TargetLibraryInfoImpl::addVectorizableFunctions(ArrayRef<VecDesc> Fns) {		void TargetLibraryInfoImpl::addAllVectorizableFunctions() {
		addVectorizableFunctionsFromVecLib(Accelerate, VecLibDescs[Accelerate]);
		addVectorizableFunctionsFromVecLib(MASSV, VecLibDescs[MASSV]);
		addVectorizableFunctionsFromVecLib(SVML, VecLibDescs[SVML]);
		}

		void TargetLibraryInfoImpl::addVectorizableFunctions(
		ArrayRef<VecDesc> Fns, VectorLibraryDescriptors &VecLibDesc) {
		auto &VectorDescs = VecLibDesc.VectorDescs;
		auto &ScalarDescs = VecLibDesc.ScalarDescs;
VectorDescs.insert(VectorDescs.end(), Fns.begin(), Fns.end());		VectorDescs.insert(VectorDescs.end(), Fns.begin(), Fns.end());
llvm::sort(VectorDescs, compareByScalarFnName);		llvm::sort(VectorDescs, compareByScalarFnName);

ScalarDescs.insert(ScalarDescs.end(), Fns.begin(), Fns.end());		ScalarDescs.insert(ScalarDescs.end(), Fns.begin(), Fns.end());
llvm::sort(ScalarDescs, compareByVectorFnName);		llvm::sort(ScalarDescs, compareByVectorFnName);
}		}

void TargetLibraryInfoImpl::addVectorizableFunctionsFromVecLib(		void TargetLibraryInfoImpl::addVectorizableFunctionsFromVecLib(
enum VectorLibrary VecLib) {		enum VectorLibrary VecLib, VectorLibraryDescriptors &VetLibDesc) {
switch (VecLib) {		switch (VecLib) {
case Accelerate: {		case Accelerate: {
const VecDesc VecFuncs[] = {		const VecDesc VecFuncs[] = {
		tejohnsonUnsubmitted Done Reply Inline Actions You'll certainly hit this assert if you try LTO linking two .ll files built with different -fveclib options, because the TLII is built once per module by the analysis. tejohnson: You'll certainly hit this assert if you try LTO linking two .ll files built with different…
#define TLI_DEFINE_ACCELERATE_VECFUNCS		#define TLI_DEFINE_ACCELERATE_VECFUNCS
#include "llvm/Analysis/VecFuncs.def"		#include "llvm/Analysis/VecFuncs.def"
};		};
addVectorizableFunctions(VecFuncs);		addVectorizableFunctions(VecFuncs, VetLibDesc);
break;		break;
}		}
case MASSV: {		case MASSV: {
const VecDesc VecFuncs[] = {		const VecDesc VecFuncs[] = {
#define TLI_DEFINE_MASSV_VECFUNCS		#define TLI_DEFINE_MASSV_VECFUNCS
#include "llvm/Analysis/VecFuncs.def"		#include "llvm/Analysis/VecFuncs.def"
};		};
addVectorizableFunctions(VecFuncs);		addVectorizableFunctions(VecFuncs, VetLibDesc);
break;		break;
}		}
case SVML: {		case SVML: {
const VecDesc VecFuncs[] = {		const VecDesc VecFuncs[] = {
#define TLI_DEFINE_SVML_VECFUNCS		#define TLI_DEFINE_SVML_VECFUNCS
#include "llvm/Analysis/VecFuncs.def"		#include "llvm/Analysis/VecFuncs.def"
};		};
addVectorizableFunctions(VecFuncs);		addVectorizableFunctions(VecFuncs, VetLibDesc);
break;		break;
}		}
case NoLibrary:		default:
		llvm_unreachable("Unexpected vector library");
break;		break;
		tejohnsonUnsubmitted Done Reply Inline Actions Should these two be llvm_unreachable? tejohnson: Should these two be llvm_unreachable?
}		}
}		}

bool TargetLibraryInfoImpl::isFunctionVectorizable(StringRef funcName) const {		bool TargetLibraryInfoImpl::isFunctionVectorizable(StringRef funcName,
		VectorLibrary vecLib) const {
funcName = sanitizeFunctionName(funcName);		funcName = sanitizeFunctionName(funcName);
if (funcName.empty())		if (funcName.empty() \|\| vecLib >= NumVecLibs)
return false;		return false;

		auto &VectorDescs = VecLibDescs[vecLib].VectorDescs;
std::vector<VecDesc>::const_iterator I =		std::vector<VecDesc>::const_iterator I =
llvm::lower_bound(VectorDescs, funcName, compareWithScalarFnName);		llvm::lower_bound(VectorDescs, funcName, compareWithScalarFnName);
return I != VectorDescs.end() && StringRef(I->ScalarFnName) == funcName;		return I != VectorDescs.end() && StringRef(I->ScalarFnName) == funcName;
}		}

StringRef TargetLibraryInfoImpl::getVectorizedFunction(StringRef F,		StringRef
unsigned VF) const {		TargetLibraryInfoImpl::getVectorizedFunction(StringRef F, unsigned VF,
		VectorLibrary vecLib) const {
F = sanitizeFunctionName(F);		F = sanitizeFunctionName(F);
if (F.empty())		if (F.empty() \|\| vecLib >= NumVecLibs)
return F;		return F;
		auto &VectorDescs = VecLibDescs[vecLib].VectorDescs;
std::vector<VecDesc>::const_iterator I =		std::vector<VecDesc>::const_iterator I =
llvm::lower_bound(VectorDescs, F, compareWithScalarFnName);		llvm::lower_bound(VectorDescs, F, compareWithScalarFnName);
while (I != VectorDescs.end() && StringRef(I->ScalarFnName) == F) {		while (I != VectorDescs.end() && StringRef(I->ScalarFnName) == F) {
if (I->VectorizationFactor == VF)		if (I->VectorizationFactor == VF)
return I->VectorFnName;		return I->VectorFnName;
++I;		++I;
}		}
return StringRef();		return StringRef();
}		}

StringRef TargetLibraryInfoImpl::getScalarizedFunction(StringRef F,		StringRef
unsigned &VF) const {		TargetLibraryInfoImpl::getScalarizedFunction(StringRef F, unsigned &VF,
		VectorLibrary vecLib) const {
F = sanitizeFunctionName(F);		F = sanitizeFunctionName(F);
if (F.empty())		if (F.empty() \|\| vecLib >= NumVecLibs)
return F;		return F;

		auto &VectorDescs = VecLibDescs[vecLib].VectorDescs;
		auto &ScalarDescs = VecLibDescs[vecLib].ScalarDescs;
std::vector<VecDesc>::const_iterator I =		std::vector<VecDesc>::const_iterator I =
llvm::lower_bound(ScalarDescs, F, compareWithVectorFnName);		llvm::lower_bound(ScalarDescs, F, compareWithVectorFnName);
if (I == VectorDescs.end() \|\| StringRef(I->VectorFnName) != F)		if (I == VectorDescs.end() \|\| StringRef(I->VectorFnName) != F)
return StringRef();		return StringRef();
VF = I->VectorizationFactor;		VF = I->VectorizationFactor;
return I->ScalarFnName;		return I->ScalarFnName;
}		}

		TargetLibraryInfo::TargetLibraryInfo(const TargetLibraryInfoImpl &Impl,
		Optional<const Function *> F)
		: Impl(&Impl), OverrideAsUnavailable(NumLibFuncs) {
		if (!F)
		return;

		StringRef VectLib = (*F)->getFnAttribute("veclib").getValueAsString();
		if (!VectLib.empty())
		VectLibrary = getVecLibFromName(VectLib);
		else
		VectLibrary = ClVectorLibrary;

		if ((*F)->hasFnAttribute("no-builtins"))
		disableAllFunctions();
		else {
		// Disable individual libc/libm calls in TargetLibraryInfo.
		LibFunc LF;
		AttributeSet FnAttrs = (*F)->getAttributes().getFnAttributes();
		for (const Attribute &Attr : FnAttrs) {
		if (!Attr.isStringAttribute())
		continue;
		auto AttrStr = Attr.getKindAsString();
		if (!AttrStr.consume_front("no-builtin-"))
		continue;
		if (getLibFunc(AttrStr, LF))
		setUnavailable(LF);
		}
		}
		}

TargetLibraryInfo TargetLibraryAnalysis::run(const Function &F,		TargetLibraryInfo TargetLibraryAnalysis::run(const Function &F,
FunctionAnalysisManager &) {		FunctionAnalysisManager &) {
if (!BaselineInfoImpl)		if (!BaselineInfoImpl)
BaselineInfoImpl =		BaselineInfoImpl =
TargetLibraryInfoImpl(Triple(F.getParent()->getTargetTriple()));		TargetLibraryInfoImpl(Triple(F.getParent()->getTargetTriple()));
return TargetLibraryInfo(*BaselineInfoImpl, &F);		return TargetLibraryInfo(*BaselineInfoImpl, &F);
}		}

unsigned TargetLibraryInfoImpl::getWCharSize(const Module &M) const {		unsigned TargetLibraryInfoImpl::getWCharSize(const Module &M) const {
		tejohnsonUnsubmitted Done Reply Inline Actions This is going to override the baseline TLI veclib with whatever is the latest function we build a TLI for (and you'll hit the assert as noted earlier if they conflict). tejohnson: This is going to override the baseline TLI veclib with whatever is the latest function we build…
if (auto *ShortWChar = cast_or_null<ConstantAsMetadata>(		if (auto *ShortWChar = cast_or_null<ConstantAsMetadata>(
M.getModuleFlag("wchar_size")))		M.getModuleFlag("wchar_size")))
return cast<ConstantInt>(ShortWChar->getValue())->getZExtValue();		return cast<ConstantInt>(ShortWChar->getValue())->getZExtValue();
return 0;		return 0;
}		}

TargetLibraryInfoWrapperPass::TargetLibraryInfoWrapperPass()		TargetLibraryInfoWrapperPass::TargetLibraryInfoWrapperPass()
: ImmutablePass(ID), TLA(TargetLibraryInfoImpl()) {		: ImmutablePass(ID), TLA(TargetLibraryInfoImpl()) {
Show All 15 Lines

// Register the basic pass.		// Register the basic pass.
INITIALIZE_PASS(TargetLibraryInfoWrapperPass, "targetlibinfo",		INITIALIZE_PASS(TargetLibraryInfoWrapperPass, "targetlibinfo",
"Target Library Information", false, true)		"Target Library Information", false, true)
char TargetLibraryInfoWrapperPass::ID = 0;		char TargetLibraryInfoWrapperPass::ID = 0;

void TargetLibraryInfoWrapperPass::anchor() {}		void TargetLibraryInfoWrapperPass::anchor() {}

unsigned TargetLibraryInfoImpl::getWidestVF(StringRef ScalarF) const {		unsigned TargetLibraryInfoImpl::getWidestVF(StringRef ScalarF,
		VectorLibrary vecLib) const {
ScalarF = sanitizeFunctionName(ScalarF);		ScalarF = sanitizeFunctionName(ScalarF);
if (ScalarF.empty())		if (ScalarF.empty() \|\| vecLib >= NumVecLibs)
return 1;		return 1;

unsigned VF = 1;		unsigned VF = 1;
		auto &VectorDescs = VecLibDescs[vecLib].VectorDescs;
std::vector<VecDesc>::const_iterator I =		std::vector<VecDesc>::const_iterator I =
llvm::lower_bound(VectorDescs, ScalarF, compareWithScalarFnName);		llvm::lower_bound(VectorDescs, ScalarF, compareWithScalarFnName);
while (I != VectorDescs.end() && StringRef(I->ScalarFnName) == ScalarF) {		while (I != VectorDescs.end() && StringRef(I->ScalarFnName) == ScalarF) {
if (I->VectorizationFactor > VF)		if (I->VectorizationFactor > VF)
VF = I->VectorizationFactor;		VF = I->VectorizationFactor;
++I;		++I;
}		}
return VF;		return VF;
}		}

llvm/test/Transforms/Inline/inline-no-builtin-compatible.ll

	; Test to ensure no inlining is allowed into a caller with fewer nobuiltin attributes.			; Test to ensure no inlining is allowed into a caller with fewer nobuiltin attributes.
	; RUN: opt < %s -mtriple=x86_64-unknown-linux-gnu -S -inline \| FileCheck %s			; RUN: opt < %s -mtriple=x86_64-unknown-linux-gnu -S -inline \| FileCheck %s
	; RUN: opt < %s -mtriple=x86_64-unknown-linux-gnu -S -passes='cgscc(inline)' \| FileCheck %s			; RUN: opt < %s -mtriple=x86_64-unknown-linux-gnu -S -passes='cgscc(inline)' \| FileCheck %s

	; Make sure we don't inline callees into a caller with a superset of the			; Make sure we don't inline callees into a caller with a superset of the
	; no builtin attributes when -inline-caller-superset-nobuiltin=false.			; no builtin attributes when -inline-caller-superset-tli=false.
	; RUN: opt < %s -inline-caller-superset-nobuiltin=false -mtriple=x86_64-unknown-linux-gnu -S -passes='cgscc(inline)' \| FileCheck %s --check-prefix=NOSUPERSET			; RUN: opt < %s -inline-caller-superset-tli=false -mtriple=x86_64-unknown-linux-gnu -S -passes='cgscc(inline)' \| FileCheck %s --check-prefix=NOSUPERSET

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define i32 @allbuiltins() {			define i32 @allbuiltins() {
	entry:			entry:
	%call = call i32 (...) @externalfunc()			%call = call i32 (...) @externalfunc()
	ret i32 %call			ret i32 %call
	▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

llvm/test/Transforms/Inline/veclib-compat.ll

This file was added.

				; RUN: opt < %s -inline -inline-caller-superset-tli=true -S \| FileCheck %s --check-prefixes=COMMON
				; RUN: opt < %s -passes='cgscc(inline)' -inline-caller-superset-tli=true -S \| FileCheck %s --check-prefixes=COMMON
				; RUN: opt < %s -inline -inline-caller-superset-tli=false -S \| FileCheck %s --check-prefixes=NOSUPERSET,COMMON
				; RUN: opt < %s -passes='cgscc(inline)' -inline-caller-superset-tli=false -S \| FileCheck %s --check-prefixes=NOSUPERSET,COMMON



				define i32 @callee_svml(i8 %X) #0 {
				entry:
				ret i32 1
				}

				define i32 @callee_massv(i8 %X) #1 {
				entry:
				ret i32 1
				}

				define i32 @callee_nolibrary(i8 %X) {
				entry:
				ret i32 1
				}

				define i32 @caller_svml() #0 {
				; COMMON-LABEL: define i32 @caller_svml()
				entry:
				%rslt = call i32 @callee_massv(i8 123)
				; COMMON: call i32 @callee_massv
				%tmp1 = call i32 @callee_nolibrary(i8 123)
				; NOSUPERSET: call i32 @callee_nolibrary
				tejohnsonUnsubmitted Not Done Reply Inline Actions I think NOSUPERSET should also check that there is still a call here. You can probably replace some of the duplicated checks with COMMON in this function too. tejohnson: I think NOSUPERSET should also check that there is still a call here. You can probably replace…
				%tmp2 = call i32 @callee_svml(i8 123)
				; COMMON-NOT: call
				ret i32 %rslt
				}

				define i32 @caller_nolibrary() {
				; COMMON-LABEL: define i32 @caller_nolibrary()
				entry:
				%rslt = call i32 @callee_svml(i8 123)
				; COMMON: call i32 @callee_svml
				%tmp1 = call i32 @callee_massv(i8 123)
				; COMMON: call i32 @callee_massv
				%tmp2 = call i32 @callee_nolibrary(i8 123)
				; COMMON-NOT: call
				ret i32 %rslt
				}

				attributes #0 = { "veclib"="SVML" }
				attributes #1 = { "veclib"="MASSV" }
				No newline at end of file