This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
docs/
3/5
LanguageExtensions.rst
-
include/clang/Basic/
-
clang/
-
Basic/
3/5
Attr.td
1/1
DiagnosticParseKinds.td
2/2
DiagnosticSemaKinds.td
-
TargetInfo.h
-
lib/
-
Basic/Targets/
-
Targets/
-
AArch64.h
-
CodeGen/
-
CGLoopInfo.h
2/3
CGLoopInfo.cpp
-
Parse/
1
ParsePragma.cpp
-
Sema/
2/6
SemaStmtAttr.cpp
-
test/CodeGenCXX/
-
CodeGenCXX/
1
pragma-loop.cpp
-
pragma-scalable-loop.cpp

Differential D89031

[SVE] Add support to vectorize_width loop pragma for scalable vectors
ClosedPublic

Authored by david-arm on Oct 8 2020, 3:00 AM.

Download Raw Diff

Details

Reviewers

sdesmalen
ctetreau
fhahn
c-rhodes
efriedma
aaron.ballman
SjoerdMeijer
paulwalker-arm

Commits

rG38d18d93534d: [SVE] Add support to vectorize_width loop pragma for scalable vectors

Summary

This patch adds support for two new variants of the vectorize_width
pragma:

vectorize_width(X[, fixed|scalable]) where an optional second

parameter is passed to the vectorize_width pragma, which indicates if
the user wishes to use fixed width or scalable vectorization. For
example the user can now write something like:

#pragma clang loop vectorize_width(4, fixed)

#pragma clang loop vectorize_width(4, scalable)

In the absence of a second parameter it is assumed the user wants
fixed width vectorization, in order to maintain compatibility with
existing code.

vectorize_width(fixed|scalable) where the width is left unspecified,

but the user hints what type of vectorization they prefer, either
fixed width or scalable.

I have implemented this by making use of the LLVM loop hint attribute:

llvm.loop.vectorize.scalable.enable

Tests were added to

clang/test/CodeGenCXX/pragma-loop.cpp

for both the 'fixed' and 'scalable' optional parameter.

See this thread for context: http://lists.llvm.org/pipermail/cfe-dev/2020-November/067262.html

Diff Detail

Event Timeline

david-arm created this revision.Oct 8 2020, 3:00 AM

Herald added a reviewer: efriedma. · View Herald TranscriptOct 8 2020, 3:00 AM

Herald added a reviewer: aaron.ballman. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: cfe-commits, psnobl, tschuett. · View Herald Transcript

david-arm requested review of this revision.Oct 8 2020, 3:00 AM

david-arm added a parent revision: D88962: [SVE] Add support for scalable vectors with vectorize.scalable.enable loop attribute.

Harbormaster completed remote builds in B74410: Diff 296907.Oct 8 2020, 4:10 AM

david-arm added a reviewer: SjoerdMeijer.Oct 15 2020, 5:41 AM

fhahn added inline comments.Oct 15 2020, 1:56 PM

clang/lib/Parse/ParsePragma.cpp
1098	nit: unrelated change?
clang/lib/Sema/SemaStmtAttr.cpp
145	Is there a way to only accept `fixed_width/scalable` for targets that support it? Not sure if we have enough information here, but we might be able to reject it eg per target basis or something

Rebase.

david-arm marked an inline comment as done.Oct 19 2020, 5:16 AM

david-arm added inline comments.

clang/lib/Sema/SemaStmtAttr.cpp
145	Hi @fhahn, I think if possible we'd prefer not to reject scalable vectors at this point. Theoretically there is no reason why we can't perform scalable vectorisation for targets that don't have hardware support for scalable vectors. In this case it simply means that vscale is 1. If you want we could add some kind of opt-remark in the vectoriser that says something like "target does not support scalable vectors, vectorising for vscale=1"?

sdesmalen added a child revision: D90342: [POC][LoopVectorizer] Propagate ElementCount to interfaces in preparation for scalable auto-vec. .Oct 28 2020, 2:12 PM

LGTM

clang/lib/Sema/SemaStmtAttr.cpp
145	I agree with @david-arm that we shouldn't prevent this in the front-end. Even if the architecture may not support scalable vectors natively, there may still be reasons to want to create scalable vectors in IR, for example to have more portable IR.
clang/test/CodeGenCXX/pragma-loop.cpp
167	out of curiosity, is there a particular reason you're testing it with a do-while loop instead of a shorter for-loop like the tests in `for_template_constant_expression_test` ?

This revision is now accepted and ready to land.Oct 30 2020, 8:42 AM

fhahn added inline comments.Oct 30 2020, 8:56 AM

clang/lib/Sema/SemaStmtAttr.cpp
145	Hm, I am just a bit worried that it might be a bit confusing to users that do not know what scalable vectors are (it is obvious when knowing all about SVE, but I would assume most people don't necessarily know what this means). I guess that is not the biggest deal, as long `vectorize_width(X, scalable)` works for every target. Even if the architecture may not support scalable vectors natively, there may still be reasons to want to create scalable vectors in IR, for example to have more portable IR. Sure, but there are so many other target-specific things encoded that make the IR really un-portable between targets. Granted, it is not impossible to convert IR between some architectures (as in arm64_32)

sdesmalen requested changes to this revision.Nov 3 2020, 1:18 AM

sdesmalen added inline comments.

clang/docs/LanguageExtensions.rst
3049	Can you add a comment saying that the use of `"scalable"` is still experimental and is currently only intended to work for targets that support scalable vectors?
clang/lib/Sema/SemaStmtAttr.cpp
145	Sorry, forgot to reply to this. Hm, I am just a bit worried that it might be a bit confusing to users that do not know what scalable vectors are (it is obvious when knowing all about SVE, but I would assume most people don't necessarily know what this means). I guess that is not the biggest deal, as long vectorize_width(X, scalable) works for every target. At the moment this feature is still experimental, so I don't think any target would be able to return `true` to the question if this is supported :) That said, I agree that the compiler shouldn't crash for other targets after support in the loop-vectorizer stops being experimental. So I'm changing my mind here, and am happy to go with your suggestion to ignore the flag for other targets. When some default mechanism is added to lower scalable vectors to fixed-width vectors (for targets that don't natively support them), this check can probably be removed. @david-arm can you add some target hook to ignore the hint? Sure, but there are so many other target-specific things encoded that make the IR really un-portable between targets. Granted, it is not impossible to convert IR between some architectures (as in arm64_32) I didn't mean portable between targets, but more as keeping the length of the vector agnostic in IR and leaving it until code-generation to pick a suitable/available vector extension, so that the same IR could be used to generate code for Neon or 256bit SVE for example. This is more a hypothetical use-case at the moment though.

This revision now requires changes to proceed.Nov 3 2020, 1:18 AM

david-arm updated this revision to Diff 302773.Nov 4 2020, 1:20 AM

david-arm marked an inline comment as done.

david-arm edited the summary of this revision. (Show Details)

sdesmalen added inline comments.Nov 4 2020, 7:44 AM

clang/include/clang/Basic/DiagnosticSemaKinds.td
939	From what I can see, the vectorize_width flag is not ignored, only the scalable property is. That means this should be: 'scalable' not supported by the target so assuming 'fixed' instead.
clang/lib/Sema/SemaStmtAttr.cpp
148	If the target does not support scalable vectors, it currently assumes `"fixed"`. If we want to stick with that approach, the diagnostic message should be changed (see my other comment). The alternative is dropping the hint entirely by returning `nullptr` and changing the diagnostic message to say the hint is ignored. I could live with both options. @fhahn do you have a preference here? nit: to reduce nesting, can you hoist this out one level, e.g. if (StateLoc && StateLoc->Ident & ...) State = LoopHintAttr::ScalableNumeric; else State = LoopHintAttr::Numeric; if (State == LoopHintAttr::ScalableNumeric && !S.Context.getTargetInfo().supportsScalableVectors()) { S.Diag(....); State = LoopHintAttr::Numeric; }

I'll hold off on any more changes for now to give @fhahn a chance to reply to your comment @sdesmalen about the fallback behaviour when scalable vectorisation is unsupported.

clang/include/clang/Basic/DiagnosticSemaKinds.td
939	OK. I guess it's just when the warning comes out it appears at the start of the line so I wanted to emphasise that this relates to the scalable property passed to the vectorize_width attribute (rather than other attributes) as there could potentially be several pragmas on one line. I think it would be good to mention the vectorize_width pragma/attribute somewhere in the warning message to make it clear. I'll see if I can reword it.

sdesmalen added a comment.Nov 6 2020, 3:49 AM

This comment was removed by sdesmalen.

sdesmalen mentioned this in D88962: [SVE] Add support for scalable vectors with vectorize.scalable.enable loop attribute.Nov 9 2020, 1:40 PM

david-arm updated this revision to Diff 304488.Nov 11 2020, 5:16 AM

david-arm marked an inline comment as done.

c-rhodes added inline comments.Nov 11 2020, 5:57 AM

clang/docs/LanguageExtensions.rst
3032	nit: `s/__value__/_value_/`

I am very sorry that I am late to this... but I do have some concerns.

The concern that I have is that we extend vecorize_width with a scalable/fixed boolean, but there are more vectorisation pragma that set vectorisation options which imply enabling vectorisation:

Pragmas setting transformation options imply the transformation is enabled, as if it was enabled via the corresponding transformation pragma (e.g. vectorize(enable))

Thus, unless I miss something, I don't think this should be an option to vectorize_width, but to me it looks like we need a separate one, e.g.:

vectorize_scalable(enable|disable)

what do you think?

Hi @SjoerdMeijer I think that given we now support scalable vectors we thought it made sense to be able to specify whether the user wants 'fixed' or 'scalable' vectorisation along with the vector width, although without specifying the additional property the default continues to remain 'fixed'. However, what you said about having a vectorize_scalable pragma is correct and we are intending to also add a pragma like this in a future patch.

In D89031#2391160, @david-arm wrote:

Hi @SjoerdMeijer I think that given we now support scalable vectors we thought it made sense to be able to specify whether the user wants 'fixed' or 'scalable' vectorisation along with the vector width, although without specifying the additional property the default continues to remain 'fixed'. However, what you said about having a vectorize_scalable pragma is correct and we are intending to also add a pragma like this in a future patch.

Okay, I haven't looked at the implementation to be honest, but am just trying to understand the different use cases of this first.
I just seem to be missing or not understanding why fixed/scalable is an option to only vectorize_width, why not to vectorize(enable) or just a separate one like vectorize_scalable? By making scalable/fixed and option to vectorize_width, you can't toggle this for other pragmas like interleave(enable) that enable vectorisation, which would be inconsistent? It also seems to be more work to me to do this first for vectorize_width, and then fix up other pragmas later. But I might be missing something (obivous) here.

In D89031#2391248, @SjoerdMeijer wrote:

In D89031#2391160, @david-arm wrote:

Hi @SjoerdMeijer I think that given we now support scalable vectors we thought it made sense to be able to specify whether the user wants 'fixed' or 'scalable' vectorisation along with the vector width, although without specifying the additional property the default continues to remain 'fixed'. However, what you said about having a vectorize_scalable pragma is correct and we are intending to also add a pragma like this in a future patch.

Okay, I haven't looked at the implementation to be honest, but am just trying to understand the different use cases of this first.
I just seem to be missing or not understanding why fixed/scalable is an option to only vectorize_width, why not to vectorize(enable) or just a separate one like vectorize_scalable? By making scalable/fixed and option to vectorize_width, you can't toggle this for other pragmas like interleave(enable) that enable vectorisation, which would be inconsistent? It also seems to be more work to me to do this first for vectorize_width, and then fix up other pragmas later. But I might be missing something (obivous) here.

Hi @SjoerdMeijer, all valid and good questions. We think it makes sense to allow specifying explicitly what the meaning of '4' is when specifying the width. So that vectorize_width(4, fixed) means vectorizing with <4 x eltty> and vectorize_width(4, scalable) means vectorizing with <vscale x 4 x eltty>. Like @david-arm said, we also plan to add something like vectorize_style(fixed|scalable). This approach should be fully complementary to vectorize_with so that it would be possible to have:

// Use scalable vectors, but leave it to the cost-model to choose the most efficient N in <vscale x N x eltty>.
// If the pragma is not specified, it defaults to vectorize_style(fixed).
#pragma clang loop vectorize_style(scalable)

// Use <4 x eltty>
#pragma clang loop vectorize_width(4, fixed)

// Use <vscale x 4 x eltty>
#pragma clang loop vectorize_width(4, scalable)

// If vectorize_style(scalable) is specified, then use <vscale x 4 x eltty>, otherwise <4 x eltty>
#pragma clang loop vectorize_width(4)                           // uses <4 x eltty>
#pragma clang loop vectorize_width(4) vectorize_style(scalable) // uses <vscale x 4 x eltty>

// Conflicting options, clang should print diagnostic and error or ignore the hint.
#pragma clang loop vectorize_width(4, fixed) vectorize_style(scalable)

I hope that gives a bit more context.

This approach should be fully complementary to vectorize_with so that it would be possible to have:

// Use scalable vectors, but leave it to the cost-model to choose the most efficient N in <vscale x N x eltty>.
// If the pragma is not specified, it defaults to vectorize_style(fixed).
#pragma clang loop vectorize_style(scalable)

// Use <4 x eltty>
#pragma clang loop vectorize_width(4, fixed)

// Use <vscale x 4 x eltty>
#pragma clang loop vectorize_width(4, scalable)

// If vectorize_style(scalable) is specified, then use <vscale x 4 x eltty>, otherwise <4 x eltty>
#pragma clang loop vectorize_width(4)                           // uses <4 x eltty>
#pragma clang loop vectorize_width(4) vectorize_style(scalable) // uses <vscale x 4 x eltty>

// Conflicting options, clang should print diagnostic and error or ignore the hint.
#pragma clang loop vectorize_width(4, fixed) vectorize_style(scalable)

I hope that gives a bit more context.

Ok, thanks for clarifying that!

If:

// Use <vscale x 4 x eltty>
#pragma clang loop vectorize_width(4, scalable)

is equivalent to:

// uses <vscale x 4 x eltty>
#pragma clang loop vectorize_width(4) vectorize_style(scalable)

then I think that illustrates that I don't see the point of extending vectorize_width because we still can't express scalable vectorisation for:

// <VF x eltty>
#pragma clang loop vectorize_predicate(enable)

and also for interleave_count(4)?

Again, when the idea is to have vectorize_style anyway, wouldn't it be easier not to bother extending vectorize_width and just go for vectorize_style? It allows you to specify fixed/scalable vectorisation in one way, and avoids having conflicting options.

The other thing I thought about: this is extending an existing user-facing pragma, and notifying the list would probably be best thing to do.

As I see it there are a bunch of pragmas that all enable vectorisation, with each pragma providing a unit of information. One component of this information is the vectorisation factor hint provided by vectorize_width.

With the introduction of scalable vectors this hint is using the wrong datatype and thus needs to be updated to allow vectorize_width(#num,[fixed|scalable]) and vectorize_width([fixed|scalable]) along side the existing vectorize_width(#num) representation that effectively becomes an alias to vectorize_width(#num, fixed).

Doing this means all existing usages work as expected and there's now extra power to better guide the chosen vectorisation factor.

Because I was not understanding, we have discussed this further offline.

I think the conclusion was: pragma vectorize_width controls the vectorisation vector VF in <vscale x VF x elty>. where vscale is not just a separate thing but it defines a VectorType. That's why it would make sense to attach scalable|fixed to vectorize_width. I agree with this and seems reasonable.

I still don't see how the proposed extension here allows you to specify fixed width vectorisation for:

#pragma clang loop interleave_count(4)

targeting SVE and would appreciate if someone can comment on this example, but I won't be holding up this work anymore as this might be addressed later.

Thanks @david-arm for posting this proposal to the cfe list.
My confusion has been cleared up. The (new) proposal is to have:

vectorize_width(X) where X is an integer.
vectorize_width(X, fixed|scalable)
vectorize_width(fixed|scalable)

And with that 3rd option I agree that this allows us to express everything we want, and this patch needs adapting to this new proposal (just stating the obvious for clarity/completeness)

If scalable is used on a target that doesn't support this, a warning and falling back to fixed seems like the right thing to do.

I did have concerns about this, similarly like @fhahn:

Hm, I am just a bit worried that it might be a bit confusing to users that do not know what scalable vectors are (it is obvious when knowing all about SVE, but I would assume most people don't necessarily know what this means). I guess that is not the biggest deal, as long vectorize_width(X, scalable) works for every target.

But since the new scalable option is opt-in, people don't need know about this if they don't want/need to, this should (hopefully) not be the case.

sdesmalen mentioned this in rGd568cff696e8: [LoopVectorizer][SVE] Vectorize a simple loop with with a scalable VF..Dec 9 2020, 3:26 AM

david-arm updated this revision to Diff 312802.Dec 18 2020, 8:11 AM

david-arm edited the summary of this revision. (Show Details)

Herald added a subscriber: NickHung. · View Herald TranscriptDec 18 2020, 8:11 AM

aaron.ballman added inline comments.Dec 18 2020, 8:16 AM

clang/include/clang/Basic/Attr.td
3302–3303	Should the documentation in AttrDocs.td be updated for this change?

david-arm added a reviewer: paulwalker-arm.Dec 21 2020, 8:20 AM

david-arm added inline comments.Dec 21 2020, 8:29 AM

clang/include/clang/Basic/Attr.td
3302–3303	Hi @aaron.ballman I had a look at LoopHintDocs in AttrDocs.td and it didn't explicitly mention these states, i.e. "assume_safety", "numeric", etc., so I'm not sure if it's necessary to add anything there?

aaron.ballman added inline comments.Dec 21 2020, 10:54 AM

clang/include/clang/Basic/Attr.td
3302–3303	Oh, I see now, we're deferring to the documentation in the language extensions document. I suppose that's fine as far as this patch goes, sorry for the noise.

Hi everyone, I realise that most people have probably been on holiday recently, but just a gentle ping here to see if anyone could take another look? Thanks!

LGTM, perhaps wait a day with committing in case there are more comments.

clang/include/clang/Basic/Attr.td
3302–3303	Nit: formatting, exceeding 80 columns?
3303	same?

sdesmalen added inline comments.Jan 6 2021, 2:02 PM

clang/docs/LanguageExtensions.rst
3034	nit: s/In this case//
3036	nit: Another use of vectorize_width is
3038–3040	nit: In both variants of the pragma the vectorizer may decide to fall back on fixed width vectorization if the target does not support scalable vectors.
clang/include/clang/Basic/DiagnosticParseKinds.td
1390	`use vectorize_width(X, fixed) or vectorize_width(X, scalable)` (it may otherwise lead to confusion whether fixed/scalable needs quotes, same below)
clang/lib/AST/AttrImpl.cpp
46 ↗	(On Diff #312802)	is there always a value, even when "vectorize_width(scalable)" is specified?
clang/lib/CodeGen/CGLoopInfo.cpp
754–756	is that not something to fix in the code that conditionally sets vectorize.enable later on instead of working around it here?

david-arm added inline comments.Jan 7 2021, 12:30 AM

clang/lib/CodeGen/CGLoopInfo.cpp
754–756	I did originally try to do that, but I had trouble with it and found it broke other places too. It ended up being simpler to fix it here, but I can play around with it again. Even if this is still the simplest solution I can come back with a more detailed explanation at least!

Updated documentation as per review comments.
Fixed an issue with using value->prettyPrint on a null ptr.
Reworked the code that sets vectorize.enable.

david-arm marked 8 inline comments as done.Jan 7 2021, 6:25 AM

LGTM, thanks for all the changes @david-arm!

clang/lib/AST/AttrImpl.cpp
46 ↗	(On Diff #315121)	nit: if (value) { value->printPretty(OS, nullptr, Policy); OS << ", "; } OS << (state == ScalableWidth ? "scalable" : "fixed"; ?
clang/lib/CodeGen/CGLoopInfo.cpp
307	nit: `!= 1` (it should be functionally the same because the >1 is already caught above, but this is specifically testing that VF=1 (scalar) is not specified)

This revision is now accepted and ready to land.Jan 7 2021, 7:19 AM

Closed by commit rG38d18d93534d: [SVE] Add support to vectorize_width loop pragma for scalable vectors (authored by david-arm). · Explain WhyJan 8 2021, 3:58 AM

This revision was automatically updated to reflect the committed changes.

david-arm added a commit: rG38d18d93534d: [SVE] Add support to vectorize_width loop pragma for scalable vectors.

Revision Contents

Path

Size

clang/

docs/

LanguageExtensions.rst

11 lines

include/

clang/

Basic/

Attr.td

4 lines

DiagnosticParseKinds.td

3 lines

DiagnosticSemaKinds.td

3 lines

TargetInfo.h

2 lines

lib/

Basic/

Targets/

AArch64.h

2 lines

CodeGen/

CGLoopInfo.h

7 lines

CGLoopInfo.cpp

55 lines

Parse/

ParsePragma.cpp

18 lines

Sema/

SemaStmtAttr.cpp

20 lines

test/

CodeGenCXX/

pragma-loop.cpp

28 lines

pragma-scalable-loop.cpp

18 lines

Diff 302773

clang/docs/LanguageExtensions.rst

	Show First 20 Lines • Show All 3,022 Lines • ▼ Show 20 Lines
	.. code-block:: c++			.. code-block:: c++

	#pragma clang loop vectorize(enable)			#pragma clang loop vectorize(enable)
	#pragma clang loop interleave(enable)			#pragma clang loop interleave(enable)
	for(...) {			for(...) {
	...			...
	}			}

	The vector width is specified by ``vectorize_width(_value_)`` and the interleave			The vector width is specified by
	count is specified by ``interleave_count(_value_)``, where			``vectorize_width(_value_[, fixed\|scalable])``, where __value__ is a positive
				c-rhodesUnsubmitted Not Done Reply Inline Actions nit: `s/__value__/_value_/` c-rhodes: nit: `s/__value__/_value_/`
				integer and the type of vectorization can be specified with an optional
				second parameter. In this case 'fixed' is the default and refers to fixed width
				sdesmalenUnsubmitted Done Reply Inline Actions nit: s/In this case// sdesmalen: nit: s/In this case//
				vectorization, whereas 'scalable' indicates the compiler should use scalable
				vectors instead. The 'scalable' option is currently experimental and is only
				sdesmalenUnsubmitted Done Reply Inline Actions nit: Another use of vectorize_width is sdesmalen: nit: Another use of vectorize_width is
				intended to work for targets that support scalable vectors.

				The interleave count is specified by ``interleave_count(_value_)``, where
	_value_ is a positive integer. This is useful for specifying the optimal			_value_ is a positive integer. This is useful for specifying the optimal
				sdesmalenUnsubmitted Done Reply Inline Actions nit: In both variants of the pragma the vectorizer may decide to fall back on fixed width vectorization if the target does not support scalable vectors. sdesmalen: nit: In both variants of the pragma the vectorizer may decide to fall back on fixed width…
	width/count of the set of target architectures supported by your application.			width/count of the set of target architectures supported by your application.

	.. code-block:: c++			.. code-block:: c++

	#pragma clang loop vectorize_width(2)			#pragma clang loop vectorize_width(2)
	#pragma clang loop interleave_count(2)			#pragma clang loop interleave_count(2)
	for(...) {			for(...) {
	...			...
	}			}
				sdesmalenUnsubmitted Not Done Reply Inline Actions Can you add a comment saying that the use of `"scalable"` is still experimental and is currently only intended to work for targets that support scalable vectors? sdesmalen: Can you add a comment saying that the use of `"scalable"` is still experimental and is…

	Specifying a width/count of 1 disables the optimization, and is equivalent to			Specifying a width/count of 1 disables the optimization, and is equivalent to
	``vectorize(disable)`` or ``interleave(disable)``.			``vectorize(disable)`` or ``interleave(disable)``.

	Vector predication is enabled by ``vectorize_predicate(enable)``, for example:			Vector predication is enabled by ``vectorize_predicate(enable)``, for example:

	.. code-block:: c++			.. code-block:: c++

	▲ Show 20 Lines • Show All 649 Lines • Show Last 20 Lines

clang/include/clang/Basic/Attr.td

Show First 20 Lines • Show All 3,293 Lines • ▼ Show 20 Lines	let Args = [EnumArgument<"Option", "OptionType",
"unroll", "unroll_count", "unroll_and_jam", "unroll_and_jam_count",		"unroll", "unroll_count", "unroll_and_jam", "unroll_and_jam_count",
"pipeline", "pipeline_initiation_interval", "distribute",		"pipeline", "pipeline_initiation_interval", "distribute",
"vectorize_predicate"],		"vectorize_predicate"],
["Vectorize", "VectorizeWidth", "Interleave", "InterleaveCount",		["Vectorize", "VectorizeWidth", "Interleave", "InterleaveCount",
"Unroll", "UnrollCount", "UnrollAndJam", "UnrollAndJamCount",		"Unroll", "UnrollCount", "UnrollAndJam", "UnrollAndJamCount",
"PipelineDisabled", "PipelineInitiationInterval", "Distribute",		"PipelineDisabled", "PipelineInitiationInterval", "Distribute",
"VectorizePredicate"]>,		"VectorizePredicate"]>,
EnumArgument<"State", "LoopHintState",		EnumArgument<"State", "LoopHintState",
["enable", "disable", "numeric", "assume_safety", "full"],		["enable", "disable", "numeric", "scalable_numeric", "assume_safety", "full"],
["Enable", "Disable", "Numeric", "AssumeSafety", "Full"]>,		["Enable", "Disable", "Numeric", "ScalableNumeric", "AssumeSafety", "Full"]>,
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Should the documentation in AttrDocs.td be updated for this change? aaron.ballman: Should the documentation in AttrDocs.td be updated for this change?
		david-armAuthorUnsubmitted Done Reply Inline Actions Hi @aaron.ballman I had a look at LoopHintDocs in AttrDocs.td and it didn't explicitly mention these states, i.e. "assume_safety", "numeric", etc., so I'm not sure if it's necessary to add anything there? david-arm: Hi @aaron.ballman I had a look at LoopHintDocs in AttrDocs.td and it didn't explicitly mention…
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Oh, I see now, we're deferring to the documentation in the language extensions document. I suppose that's fine as far as this patch goes, sorry for the noise. aaron.ballman: Oh, I see now, we're deferring to the documentation in the language extensions document. I…
		SjoerdMeijerUnsubmitted Done Reply Inline Actions Nit: formatting, exceeding 80 columns? SjoerdMeijer: Nit: formatting, exceeding 80 columns?
		SjoerdMeijerUnsubmitted Done Reply Inline Actions same? SjoerdMeijer: same?
ExprArgument<"Value">];		ExprArgument<"Value">];

let AdditionalMembers = [{		let AdditionalMembers = [{
static const char *getOptionName(int Option) {		static const char *getOptionName(int Option) {
switch(Option) {		switch(Option) {
case Vectorize: return "vectorize";		case Vectorize: return "vectorize";
case VectorizeWidth: return "vectorize_width";		case VectorizeWidth: return "vectorize_width";
case Interleave: return "interleave";		case Interleave: return "interleave";
▲ Show 20 Lines • Show All 258 Lines • Show Last 20 Lines

clang/include/clang/Basic/DiagnosticParseKinds.td

	Show First 20 Lines • Show All 1,380 Lines • ▼ Show 20 Lines
	// Pragma loop support.			// Pragma loop support.
	def err_pragma_loop_missing_argument : Error<			def err_pragma_loop_missing_argument : Error<
	"missing argument; expected %select{an integer value\|"			"missing argument; expected %select{an integer value\|"
	"'enable'%select{\|, 'full'}1%select{\|, 'assume_safety'}2 or 'disable'}0">;			"'enable'%select{\|, 'full'}1%select{\|, 'assume_safety'}2 or 'disable'}0">;
	def err_pragma_loop_invalid_option : Error<			def err_pragma_loop_invalid_option : Error<
	"%select{invalid\|missing}0 option%select{ %1\|}0; expected vectorize, "			"%select{invalid\|missing}0 option%select{ %1\|}0; expected vectorize, "
	"vectorize_width, interleave, interleave_count, unroll, unroll_count, "			"vectorize_width, interleave, interleave_count, unroll, unroll_count, "
	"pipeline, pipeline_initiation_interval, vectorize_predicate, or distribute">;			"pipeline, pipeline_initiation_interval, vectorize_predicate, or distribute">;
				def err_pragma_loop_invalid_vectorize_option : Error<
				"vectorize_width loop hint malformed; use 'vectorize_width(X, scalable)' or "
				sdesmalenUnsubmitted Done Reply Inline Actions `use vectorize_width(X, fixed) or vectorize_width(X, scalable)` (it may otherwise lead to confusion whether fixed/scalable needs quotes, same below) sdesmalen: `use vectorize_width(X, fixed) or vectorize_width(X, scalable)` (it may otherwise lead to…
				"'vectorize_width(X, fixed)' where X is an integer">;

	def err_pragma_fp_invalid_option : Error<			def err_pragma_fp_invalid_option : Error<
	"%select{invalid\|missing}0 option%select{ %1\|}0; expected 'contract', 'reassociate' or 'exceptions'">;			"%select{invalid\|missing}0 option%select{ %1\|}0; expected 'contract', 'reassociate' or 'exceptions'">;
	def err_pragma_fp_invalid_argument : Error<			def err_pragma_fp_invalid_argument : Error<
	"unexpected argument '%0' to '#pragma clang fp %1'; expected "			"unexpected argument '%0' to '#pragma clang fp %1'; expected "
	"%select{"			"%select{"
	"'fast' or 'on' or 'off'\|"			"'fast' or 'on' or 'off'\|"
	"'on' or 'off'\|"			"'on' or 'off'\|"
	▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

clang/include/clang/Basic/DiagnosticSemaKinds.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 929 Lines • ▼ Show 20 Lines	def note_pragma_attribute_region_ends_here : Note<
"'#pragma clang attribute push' regions ends here">;		"'#pragma clang attribute push' regions ends here">;
def err_pragma_attribute_no_pop_eof : Error<"unterminated "		def err_pragma_attribute_no_pop_eof : Error<"unterminated "
"'#pragma clang attribute push' at end of file">;		"'#pragma clang attribute push' at end of file">;
def note_pragma_attribute_applied_decl_here : Note<		def note_pragma_attribute_applied_decl_here : Note<
"when applied to this declaration">;		"when applied to this declaration">;
def err_pragma_attr_attr_no_push : Error<		def err_pragma_attr_attr_no_push : Error<
"'#pragma clang attribute' attribute with no matching "		"'#pragma clang attribute' attribute with no matching "
"'#pragma clang attribute push'">;		"'#pragma clang attribute push'">;
		def warn_pragma_attribute_scalable_unused : Warning<
		"ignoring scalable vectorize_width flag due to lack of target support">,
		sdesmalenUnsubmitted Done Reply Inline Actions From what I can see, the vectorize_width flag is not ignored, only the scalable property is. That means this should be: 'scalable' not supported by the target so assuming 'fixed' instead. sdesmalen: From what I can see, the vectorize_width flag is not ignored, only the scalable property is.
		david-armAuthorUnsubmitted Done Reply Inline Actions OK. I guess it's just when the warning comes out it appears at the start of the line so I wanted to emphasise that this relates to the scalable property passed to the vectorize_width attribute (rather than other attributes) as there could potentially be several pragmas on one line. I think it would be good to mention the vectorize_width pragma/attribute somewhere in the warning message to make it clear. I'll see if I can reword it. david-arm: OK. I guess it's just when the warning comes out it appears at the start of the line so I…
		InGroup<PragmaClangAttribute>;

/// Objective-C parser diagnostics		/// Objective-C parser diagnostics
def err_duplicate_class_def : Error<		def err_duplicate_class_def : Error<
"duplicate interface definition for class %0">;		"duplicate interface definition for class %0">;
def err_undef_superclass : Error<		def err_undef_superclass : Error<
"cannot find interface declaration for %0, superclass of %1">;		"cannot find interface declaration for %0, superclass of %1">;
def err_forward_superclass : Error<		def err_forward_superclass : Error<
"attempting to use the forward class %0 as superclass of %1">;		"attempting to use the forward class %0 as superclass of %1">;
▲ Show 20 Lines • Show All 10,105 Lines • Show Last 20 Lines

clang/include/clang/Basic/TargetInfo.h

Show First 20 Lines • Show All 1,231 Lines • ▼ Show 20 Lines	public:

/// Identify whether this target supports multiversioning of functions,		/// Identify whether this target supports multiversioning of functions,
/// which requires support for cpu_supports and cpu_is functionality.		/// which requires support for cpu_supports and cpu_is functionality.
bool supportsMultiVersioning() const { return getTriple().isX86(); }		bool supportsMultiVersioning() const { return getTriple().isX86(); }

/// Identify whether this target supports IFuncs.		/// Identify whether this target supports IFuncs.
bool supportsIFunc() const { return getTriple().isOSBinFormatELF(); }		bool supportsIFunc() const { return getTriple().isOSBinFormatELF(); }

		virtual bool supportsScalableVectors() const { return false; }

// Validate the contents of the __builtin_cpu_supports(const char*)		// Validate the contents of the __builtin_cpu_supports(const char*)
// argument.		// argument.
virtual bool validateCpuSupports(StringRef Name) const { return false; }		virtual bool validateCpuSupports(StringRef Name) const { return false; }

// Return the target-specific priority for features/cpus/vendors so		// Return the target-specific priority for features/cpus/vendors so
// that they can be properly sorted for checking.		// that they can be properly sorted for checking.
virtual unsigned multiVersionSortPriority(StringRef Name) const {		virtual unsigned multiVersionSortPriority(StringRef Name) const {
return 0;		return 0;
▲ Show 20 Lines • Show All 258 Lines • Show Last 20 Lines

clang/lib/Basic/Targets/AArch64.h

Show First 20 Lines • Show All 124 Lines • ▼ Show 20 Lines	public:
}		}

int getEHDataRegisterNumber(unsigned RegNo) const override;		int getEHDataRegisterNumber(unsigned RegNo) const override;

const char *getBFloat16Mangling() const override { return "u6__bf16"; };		const char *getBFloat16Mangling() const override { return "u6__bf16"; };
bool hasInt128Type() const override;		bool hasInt128Type() const override;

bool hasExtIntType() const override { return true; }		bool hasExtIntType() const override { return true; }

		bool supportsScalableVectors() const override { return FPU & SveMode; }
};		};

class LLVM_LIBRARY_VISIBILITY AArch64leTargetInfo : public AArch64TargetInfo {		class LLVM_LIBRARY_VISIBILITY AArch64leTargetInfo : public AArch64TargetInfo {
public:		public:
AArch64leTargetInfo(const llvm::Triple &Triple, const TargetOptions &Opts);		AArch64leTargetInfo(const llvm::Triple &Triple, const TargetOptions &Opts);

void getTargetDefines(const LangOptions &Opts,		void getTargetDefines(const LangOptions &Opts,
MacroBuilder &Builder) const override;		MacroBuilder &Builder) const override;
▲ Show 20 Lines • Show All 78 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGLoopInfo.h

Show All 13 Lines
#ifndef LLVM_CLANG_LIB_CODEGEN_CGLOOPINFO_H		#ifndef LLVM_CLANG_LIB_CODEGEN_CGLOOPINFO_H
#define LLVM_CLANG_LIB_CODEGEN_CGLOOPINFO_H		#define LLVM_CLANG_LIB_CODEGEN_CGLOOPINFO_H

#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/IR/DebugLoc.h"		#include "llvm/IR/DebugLoc.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
		#include "llvm/Support/TypeSize.h"

namespace llvm {		namespace llvm {
class BasicBlock;		class BasicBlock;
class Instruction;		class Instruction;
class MDNode;		class MDNode;
} // end namespace llvm		} // end namespace llvm

namespace clang {		namespace clang {
Show All 21 Lines	struct LoopAttributes {

/// Value for llvm.loop.unroll_and_jam.* metadata (enable, disable, or full).		/// Value for llvm.loop.unroll_and_jam.* metadata (enable, disable, or full).
LVEnableState UnrollAndJamEnable;		LVEnableState UnrollAndJamEnable;

/// Value for llvm.loop.vectorize.predicate metadata		/// Value for llvm.loop.vectorize.predicate metadata
LVEnableState VectorizePredicateEnable;		LVEnableState VectorizePredicateEnable;

/// Value for llvm.loop.vectorize.width metadata.		/// Value for llvm.loop.vectorize.width metadata.
unsigned VectorizeWidth;		llvm::ElementCount VectorizeWidth;

/// Value for llvm.loop.interleave.count metadata.		/// Value for llvm.loop.interleave.count metadata.
unsigned InterleaveCount;		unsigned InterleaveCount;

/// llvm.unroll.		/// llvm.unroll.
unsigned UnrollCount;		unsigned UnrollCount;

/// llvm.unroll.		/// llvm.unroll.
▲ Show 20 Lines • Show All 180 Lines • ▼ Show 20 Lines	public:
}		}

/// Set the next pushed loop unroll_and_jam state.		/// Set the next pushed loop unroll_and_jam state.
void setUnrollAndJamState(const LoopAttributes::LVEnableState &State) {		void setUnrollAndJamState(const LoopAttributes::LVEnableState &State) {
StagedAttrs.UnrollAndJamEnable = State;		StagedAttrs.UnrollAndJamEnable = State;
}		}

/// Set the vectorize width for the next loop pushed.		/// Set the vectorize width for the next loop pushed.
void setVectorizeWidth(unsigned W) { StagedAttrs.VectorizeWidth = W; }		void setVectorizeWidth(unsigned W, bool IsScalable = false) {
		StagedAttrs.VectorizeWidth = llvm::ElementCount::get(W, IsScalable);
		}

/// Set the interleave count for the next loop pushed.		/// Set the interleave count for the next loop pushed.
void setInterleaveCount(unsigned C) { StagedAttrs.InterleaveCount = C; }		void setInterleaveCount(unsigned C) { StagedAttrs.InterleaveCount = C; }

/// Set the unroll count for the next loop pushed.		/// Set the unroll count for the next loop pushed.
void setUnrollCount(unsigned C) { StagedAttrs.UnrollCount = C; }		void setUnrollCount(unsigned C) { StagedAttrs.UnrollCount = C; }

/// \brief Set the unroll count for the next loop pushed.		/// \brief Set the unroll count for the next loop pushed.
Show All 26 Lines

clang/lib/CodeGen/CGLoopInfo.cpp

Show First 20 Lines • Show All 211 Lines • ▼ Show 20 Lines	LoopInfo::createLoopVectorizeMetadata(const LoopAttributes &Attrs,
bool &HasUserTransforms) {		bool &HasUserTransforms) {
LLVMContext &Ctx = Header->getContext();		LLVMContext &Ctx = Header->getContext();

Optional<bool> Enabled;		Optional<bool> Enabled;
if (Attrs.VectorizeEnable == LoopAttributes::Disable)		if (Attrs.VectorizeEnable == LoopAttributes::Disable)
Enabled = false;		Enabled = false;
else if (Attrs.VectorizeEnable != LoopAttributes::Unspecified \|\|		else if (Attrs.VectorizeEnable != LoopAttributes::Unspecified \|\|
Attrs.VectorizePredicateEnable != LoopAttributes::Unspecified \|\|		Attrs.VectorizePredicateEnable != LoopAttributes::Unspecified \|\|
Attrs.InterleaveCount != 0 \|\| Attrs.VectorizeWidth != 0)		Attrs.InterleaveCount != 0 \|\| Attrs.VectorizeWidth.isNonZero())
Enabled = true;		Enabled = true;

if (Enabled != true) {		if (Enabled != true) {
SmallVector<Metadata *, 4> NewLoopProperties;		SmallVector<Metadata *, 4> NewLoopProperties;
if (Enabled == false) {		if (Enabled == false) {
NewLoopProperties.append(LoopProperties.begin(), LoopProperties.end());		NewLoopProperties.append(LoopProperties.begin(), LoopProperties.end());
NewLoopProperties.push_back(		NewLoopProperties.push_back(
MDNode::get(Ctx, {MDString::get(Ctx, "llvm.loop.vectorize.enable"),		MDNode::get(Ctx, {MDString::get(Ctx, "llvm.loop.vectorize.enable"),
Show All 19 Lines	LoopInfo::createLoopVectorizeMetadata(const LoopAttributes &Attrs,
SmallVector<Metadata *, 4> Args;		SmallVector<Metadata *, 4> Args;
Args.push_back(nullptr);		Args.push_back(nullptr);
Args.append(LoopProperties.begin(), LoopProperties.end());		Args.append(LoopProperties.begin(), LoopProperties.end());

// Setting vectorize.predicate		// Setting vectorize.predicate
bool IsVectorPredicateEnabled = false;		bool IsVectorPredicateEnabled = false;
if (Attrs.VectorizePredicateEnable != LoopAttributes::Unspecified &&		if (Attrs.VectorizePredicateEnable != LoopAttributes::Unspecified &&
Attrs.VectorizeEnable != LoopAttributes::Disable &&		Attrs.VectorizeEnable != LoopAttributes::Disable &&
Attrs.VectorizeWidth < 1) {		Attrs.VectorizeWidth.getKnownMinValue() < 1) {

IsVectorPredicateEnabled =		IsVectorPredicateEnabled =
(Attrs.VectorizePredicateEnable == LoopAttributes::Enable);		(Attrs.VectorizePredicateEnable == LoopAttributes::Enable);

Metadata *Vals[] = {		Metadata *Vals[] = {
MDString::get(Ctx, "llvm.loop.vectorize.predicate.enable"),		MDString::get(Ctx, "llvm.loop.vectorize.predicate.enable"),
ConstantAsMetadata::get(ConstantInt::get(llvm::Type::getInt1Ty(Ctx),		ConstantAsMetadata::get(ConstantInt::get(llvm::Type::getInt1Ty(Ctx),
IsVectorPredicateEnabled))};		IsVectorPredicateEnabled))};
Args.push_back(MDNode::get(Ctx, Vals));		Args.push_back(MDNode::get(Ctx, Vals));
}		}

// Setting vectorize.width		// Setting vectorize.width
if (Attrs.VectorizeWidth > 0) {		if (Attrs.VectorizeWidth.getKnownMinValue() > 0) {
Metadata *Vals[] = {		if (Attrs.VectorizeWidth.isScalable()) {
MDString::get(Ctx, "llvm.loop.vectorize.width"),		Metadata *ECArgs[] = {
ConstantAsMetadata::get(ConstantInt::get(llvm::Type::getInt32Ty(Ctx),		ConstantAsMetadata::get(
Attrs.VectorizeWidth))};		ConstantInt::get(llvm::Type::getInt32Ty(Ctx),
		Attrs.VectorizeWidth.getKnownMinValue())),
		ConstantAsMetadata::get(ConstantInt::get(
		llvm::Type::getInt1Ty(Ctx), Attrs.VectorizeWidth.isScalable()))};
		MDNode *EC = MDNode::get(Ctx, ECArgs);
		Metadata *Vals[] = {MDString::get(Ctx, "llvm.loop.vectorize.width"), EC};
Args.push_back(MDNode::get(Ctx, Vals));		Args.push_back(MDNode::get(Ctx, Vals));
		} else {
		Metadata *Vals[] = {MDString::get(Ctx, "llvm.loop.vectorize.width"),
		ConstantAsMetadata::get(ConstantInt::get(
		llvm::Type::getInt32Ty(Ctx),
		Attrs.VectorizeWidth.getKnownMinValue()))};

		Args.push_back(MDNode::get(Ctx, Vals));
		}
}		}

// Setting interleave.count		// Setting interleave.count
if (Attrs.InterleaveCount > 0) {		if (Attrs.InterleaveCount > 0) {
Metadata *Vals[] = {		Metadata *Vals[] = {
MDString::get(Ctx, "llvm.loop.interleave.count"),		MDString::get(Ctx, "llvm.loop.interleave.count"),
ConstantAsMetadata::get(ConstantInt::get(llvm::Type::getInt32Ty(Ctx),		ConstantAsMetadata::get(ConstantInt::get(llvm::Type::getInt32Ty(Ctx),
Attrs.InterleaveCount))};		Attrs.InterleaveCount))};
Args.push_back(MDNode::get(Ctx, Vals));		Args.push_back(MDNode::get(Ctx, Vals));
}		}

// vectorize.enable is set if:		// vectorize.enable is set if:
// 1) loop hint vectorize.enable is set, or		// 1) loop hint vectorize.enable is set, or
// 2) it is implied when vectorize.predicate is set, or		// 2) it is implied when vectorize.predicate is set, or
// 3) it is implied when vectorize.width is set.		// 3) it is implied when vectorize.width is set.
if (Attrs.VectorizeEnable != LoopAttributes::Unspecified \|\|		if (Attrs.VectorizeEnable != LoopAttributes::Unspecified \|\|
IsVectorPredicateEnabled \|\|		IsVectorPredicateEnabled \|\| Attrs.VectorizeWidth.getKnownMinValue() > 1 \|\|
Attrs.VectorizeWidth > 1 ) {		(Attrs.VectorizeWidth.getKnownMinValue() == 1 &&
		Attrs.VectorizeWidth.isScalable())) {
bool AttrVal = Attrs.VectorizeEnable != LoopAttributes::Disable;		bool AttrVal = Attrs.VectorizeEnable != LoopAttributes::Disable;
		sdesmalenUnsubmitted Not Done Reply Inline Actions nit: `!= 1` (it should be functionally the same because the >1 is already caught above, but this is specifically testing that VF=1 (scalar) is not specified) sdesmalen: nit: `!= 1` (it should be functionally the same because the >1 is already caught above, but…
Args.push_back(		Args.push_back(
MDNode::get(Ctx, {MDString::get(Ctx, "llvm.loop.vectorize.enable"),		MDNode::get(Ctx, {MDString::get(Ctx, "llvm.loop.vectorize.enable"),
ConstantAsMetadata::get(ConstantInt::get(		ConstantAsMetadata::get(ConstantInt::get(
llvm::Type::getInt1Ty(Ctx), AttrVal))}));		llvm::Type::getInt1Ty(Ctx), AttrVal))}));
}		}

if (FollowupHasTransforms)		if (FollowupHasTransforms)
Args.push_back(MDNode::get(		Args.push_back(MDNode::get(
▲ Show 20 Lines • Show All 121 Lines • ▼ Show 20 Lines	LoopProperties.insert(LoopProperties.end(), AdditionalLoopProperties.begin(),
AdditionalLoopProperties.end());		AdditionalLoopProperties.end());
return createFullUnrollMetadata(Attrs, LoopProperties, HasUserTransforms);		return createFullUnrollMetadata(Attrs, LoopProperties, HasUserTransforms);
}		}

LoopAttributes::LoopAttributes(bool IsParallel)		LoopAttributes::LoopAttributes(bool IsParallel)
: IsParallel(IsParallel), VectorizeEnable(LoopAttributes::Unspecified),		: IsParallel(IsParallel), VectorizeEnable(LoopAttributes::Unspecified),
UnrollEnable(LoopAttributes::Unspecified),		UnrollEnable(LoopAttributes::Unspecified),
UnrollAndJamEnable(LoopAttributes::Unspecified),		UnrollAndJamEnable(LoopAttributes::Unspecified),
VectorizePredicateEnable(LoopAttributes::Unspecified), VectorizeWidth(0),		VectorizePredicateEnable(LoopAttributes::Unspecified),
InterleaveCount(0), UnrollCount(0), UnrollAndJamCount(0),		VectorizeWidth(ElementCount::getFixed(0)), InterleaveCount(0),
		UnrollCount(0), UnrollAndJamCount(0),
DistributeEnable(LoopAttributes::Unspecified), PipelineDisabled(false),		DistributeEnable(LoopAttributes::Unspecified), PipelineDisabled(false),
PipelineInitiationInterval(0) {}		PipelineInitiationInterval(0) {}

void LoopAttributes::clear() {		void LoopAttributes::clear() {
IsParallel = false;		IsParallel = false;
VectorizeWidth = 0;		VectorizeWidth = ElementCount::getFixed(0);
InterleaveCount = 0;		InterleaveCount = 0;
UnrollCount = 0;		UnrollCount = 0;
UnrollAndJamCount = 0;		UnrollAndJamCount = 0;
VectorizeEnable = LoopAttributes::Unspecified;		VectorizeEnable = LoopAttributes::Unspecified;
UnrollEnable = LoopAttributes::Unspecified;		UnrollEnable = LoopAttributes::Unspecified;
UnrollAndJamEnable = LoopAttributes::Unspecified;		UnrollAndJamEnable = LoopAttributes::Unspecified;
VectorizePredicateEnable = LoopAttributes::Unspecified;		VectorizePredicateEnable = LoopAttributes::Unspecified;
DistributeEnable = LoopAttributes::Unspecified;		DistributeEnable = LoopAttributes::Unspecified;
PipelineDisabled = false;		PipelineDisabled = false;
PipelineInitiationInterval = 0;		PipelineInitiationInterval = 0;
}		}

LoopInfo::LoopInfo(BasicBlock *Header, const LoopAttributes &Attrs,		LoopInfo::LoopInfo(BasicBlock *Header, const LoopAttributes &Attrs,
const llvm::DebugLoc &StartLoc, const llvm::DebugLoc &EndLoc,		const llvm::DebugLoc &StartLoc, const llvm::DebugLoc &EndLoc,
LoopInfo *Parent)		LoopInfo *Parent)
: Header(Header), Attrs(Attrs), StartLoc(StartLoc), EndLoc(EndLoc),		: Header(Header), Attrs(Attrs), StartLoc(StartLoc), EndLoc(EndLoc),
Parent(Parent) {		Parent(Parent) {

if (Attrs.IsParallel) {		if (Attrs.IsParallel) {
// Create an access group for this loop.		// Create an access group for this loop.
LLVMContext &Ctx = Header->getContext();		LLVMContext &Ctx = Header->getContext();
AccGroup = MDNode::getDistinct(Ctx, {});		AccGroup = MDNode::getDistinct(Ctx, {});
}		}

if (!Attrs.IsParallel && Attrs.VectorizeWidth == 0 &&		if (!Attrs.IsParallel && Attrs.VectorizeWidth.isZero() &&
Attrs.InterleaveCount == 0 && Attrs.UnrollCount == 0 &&		Attrs.InterleaveCount == 0 && Attrs.UnrollCount == 0 &&
Attrs.UnrollAndJamCount == 0 && !Attrs.PipelineDisabled &&		Attrs.UnrollAndJamCount == 0 && !Attrs.PipelineDisabled &&
Attrs.PipelineInitiationInterval == 0 &&		Attrs.PipelineInitiationInterval == 0 &&
Attrs.VectorizePredicateEnable == LoopAttributes::Unspecified &&		Attrs.VectorizePredicateEnable == LoopAttributes::Unspecified &&
Attrs.VectorizeEnable == LoopAttributes::Unspecified &&		Attrs.VectorizeEnable == LoopAttributes::Unspecified &&
Attrs.UnrollEnable == LoopAttributes::Unspecified &&		Attrs.UnrollEnable == LoopAttributes::Unspecified &&
Attrs.UnrollAndJamEnable == LoopAttributes::Unspecified &&		Attrs.UnrollAndJamEnable == LoopAttributes::Unspecified &&
Attrs.DistributeEnable == LoopAttributes::Unspecified && !StartLoc &&		Attrs.DistributeEnable == LoopAttributes::Unspecified && !StartLoc &&
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	if (Parent && (Parent->Attrs.UnrollAndJamEnable \|\|
if (!Parent->UnrollAndJamInnerFollowup) {		if (!Parent->UnrollAndJamInnerFollowup) {
// Splitting the attributes into a BeforeJam and an AfterJam part will		// Splitting the attributes into a BeforeJam and an AfterJam part will
// stop 'llvm.loop.isvectorized' (generated by vectorization in BeforeJam)		// stop 'llvm.loop.isvectorized' (generated by vectorization in BeforeJam)
// to be forwarded to the AfterJam part. We detect the situation here and		// to be forwarded to the AfterJam part. We detect the situation here and
// add it manually.		// add it manually.
SmallVector<Metadata *, 1> BeforeLoopProperties;		SmallVector<Metadata *, 1> BeforeLoopProperties;
if (BeforeJam.VectorizeEnable != LoopAttributes::Unspecified \|\|		if (BeforeJam.VectorizeEnable != LoopAttributes::Unspecified \|\|
BeforeJam.VectorizePredicateEnable != LoopAttributes::Unspecified \|\|		BeforeJam.VectorizePredicateEnable != LoopAttributes::Unspecified \|\|
BeforeJam.InterleaveCount != 0 \|\| BeforeJam.VectorizeWidth != 0)		BeforeJam.InterleaveCount != 0 \|\| BeforeJam.VectorizeWidth.isNonZero())
BeforeLoopProperties.push_back(		BeforeLoopProperties.push_back(
MDNode::get(Ctx, MDString::get(Ctx, "llvm.loop.isvectorized")));		MDNode::get(Ctx, MDString::get(Ctx, "llvm.loop.isvectorized")));

bool InnerFollowupHasTransform = false;		bool InnerFollowupHasTransform = false;
MDNode *InnerFollowup = createMetadata(AfterJam, BeforeLoopProperties,		MDNode *InnerFollowup = createMetadata(AfterJam, BeforeLoopProperties,
InnerFollowupHasTransform);		InnerFollowupHasTransform);
if (InnerFollowupHasTransform)		if (InnerFollowupHasTransform)
Parent->UnrollAndJamInnerFollowup = InnerFollowup;		Parent->UnrollAndJamInnerFollowup = InnerFollowup;
▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	case LoopHintAttr::Full:
case LoopHintAttr::Distribute:		case LoopHintAttr::Distribute:
case LoopHintAttr::PipelineDisabled:		case LoopHintAttr::PipelineDisabled:
case LoopHintAttr::PipelineInitiationInterval:		case LoopHintAttr::PipelineInitiationInterval:
case LoopHintAttr::VectorizePredicate:		case LoopHintAttr::VectorizePredicate:
llvm_unreachable("Options cannot be used with 'full' hint.");		llvm_unreachable("Options cannot be used with 'full' hint.");
break;		break;
}		}
break;		break;
		case LoopHintAttr::ScalableNumeric:
		switch (Option) {
		case LoopHintAttr::VectorizeWidth:
		setVectorizeWidth(ValueInt, true);
		break;
		default:
		llvm_unreachable("Options cannot be used with 'scalable' hint.");
		break;
		}
		break;
case LoopHintAttr::Numeric:		case LoopHintAttr::Numeric:
switch (Option) {		switch (Option) {
case LoopHintAttr::VectorizeWidth:		case LoopHintAttr::VectorizeWidth:
setVectorizeWidth(ValueInt);		setVectorizeWidth(ValueInt);
break;		break;
case LoopHintAttr::InterleaveCount:		case LoopHintAttr::InterleaveCount:
setInterleaveCount(ValueInt);		setInterleaveCount(ValueInt);
break;		break;
case LoopHintAttr::UnrollCount:		case LoopHintAttr::UnrollCount:
setUnrollCount(ValueInt);		setUnrollCount(ValueInt);
break;		break;
case LoopHintAttr::UnrollAndJamCount:		case LoopHintAttr::UnrollAndJamCount:
		sdesmalenUnsubmitted Done Reply Inline Actions is that not something to fix in the code that conditionally sets vectorize.enable later on instead of working around it here? sdesmalen: is that not something to fix in the code that conditionally sets vectorize.enable later on…
		david-armAuthorUnsubmitted Done Reply Inline Actions I did originally try to do that, but I had trouble with it and found it broke other places too. It ended up being simpler to fix it here, but I can play around with it again. Even if this is still the simplest solution I can come back with a more detailed explanation at least! david-arm: I did originally try to do that, but I had trouble with it and found it broke other places too.
setUnrollAndJamCount(ValueInt);		setUnrollAndJamCount(ValueInt);
break;		break;
case LoopHintAttr::PipelineInitiationInterval:		case LoopHintAttr::PipelineInitiationInterval:
setPipelineInitiationInterval(ValueInt);		setPipelineInitiationInterval(ValueInt);
break;		break;
case LoopHintAttr::Unroll:		case LoopHintAttr::Unroll:
case LoopHintAttr::UnrollAndJam:		case LoopHintAttr::UnrollAndJam:
case LoopHintAttr::VectorizePredicate:		case LoopHintAttr::VectorizePredicate:
▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

clang/lib/Parse/ParsePragma.cpp

Show First 20 Lines • Show All 1,089 Lines • ▼ Show 20 Lines	return std::string(llvm::StringSwitch<StringRef>(Str)
.Case("unroll", Str)		.Case("unroll", Str)
.Default(""));		.Default(""));
}		}

bool Parser::HandlePragmaLoopHint(LoopHint &Hint) {		bool Parser::HandlePragmaLoopHint(LoopHint &Hint) {
assert(Tok.is(tok::annot_pragma_loop_hint));		assert(Tok.is(tok::annot_pragma_loop_hint));
PragmaLoopHintInfo *Info =		PragmaLoopHintInfo *Info =
static_cast<PragmaLoopHintInfo *>(Tok.getAnnotationValue());		static_cast<PragmaLoopHintInfo *>(Tok.getAnnotationValue());

fhahnUnsubmitted Not Done Reply Inline Actions nit: unrelated change? fhahn: nit: unrelated change?
IdentifierInfo *PragmaNameInfo = Info->PragmaName.getIdentifierInfo();		IdentifierInfo *PragmaNameInfo = Info->PragmaName.getIdentifierInfo();
Hint.PragmaNameLoc = IdentifierLoc::create(		Hint.PragmaNameLoc = IdentifierLoc::create(
Actions.Context, Info->PragmaName.getLocation(), PragmaNameInfo);		Actions.Context, Info->PragmaName.getLocation(), PragmaNameInfo);

// It is possible that the loop hint has no option identifier, such as		// It is possible that the loop hint has no option identifier, such as
// #pragma unroll(4).		// #pragma unroll(4).
IdentifierInfo *OptionInfo = Info->Option.is(tok::identifier)		IdentifierInfo *OptionInfo = Info->Option.is(tok::identifier)
? Info->Option.getIdentifierInfo()		? Info->Option.getIdentifierInfo()
▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	bool Parser::HandlePragmaLoopHint(LoopHint &Hint) {
} else {		} else {
// Enter constant expression including eof terminator into token stream.		// Enter constant expression including eof terminator into token stream.
PP.EnterTokenStream(Toks, /DisableMacroExpansion=/false,		PP.EnterTokenStream(Toks, /DisableMacroExpansion=/false,
/IsReinject=/false);		/IsReinject=/false);
ConsumeAnnotationToken();		ConsumeAnnotationToken();

ExprResult R = ParseConstantExpression();		ExprResult R = ParseConstantExpression();

		if (OptionInfo && OptionInfo->getName() == "vectorize_width" &&
		Tok.is(tok::comma)) {
		PP.Lex(Tok); // ,

		SourceLocation StateLoc = Tok.getLocation();
		IdentifierInfo *StateInfo = Tok.getIdentifierInfo();
		StringRef IsScalableStr = StateInfo->getName();

		if (IsScalableStr != "scalable" && IsScalableStr != "fixed") {
		Diag(Tok.getLocation(), diag::err_pragma_loop_invalid_vectorize_option);
		return false;
		}
		PP.Lex(Tok); // Identifier

		Hint.StateLoc =
		IdentifierLoc::create(Actions.Context, StateLoc, StateInfo);
		}

// Tokens following an error in an ill-formed constant expression will		// Tokens following an error in an ill-formed constant expression will
// remain in the token stream and must be removed.		// remain in the token stream and must be removed.
if (Tok.isNot(tok::eof)) {		if (Tok.isNot(tok::eof)) {
Diag(Tok.getLocation(), diag::warn_pragma_extra_tokens_at_eol)		Diag(Tok.getLocation(), diag::warn_pragma_extra_tokens_at_eol)
<< PragmaLoopHintString(Info->PragmaName, Info->Option);		<< PragmaLoopHintString(Info->PragmaName, Info->Option);
while (Tok.isNot(tok::eof))		while (Tok.isNot(tok::eof))
ConsumeAnyToken();		ConsumeAnyToken();
}		}
▲ Show 20 Lines • Show All 2,423 Lines • Show Last 20 Lines

clang/lib/Sema/SemaStmtAttr.cpp

//===--- SemaStmtAttr.cpp - Statement Attribute Handling ------------------===//		//===--- SemaStmtAttr.cpp - Statement Attribute Handling ------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file implements stmt-related attribute processing.		// This file implements stmt-related attribute processing.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "clang/AST/EvaluatedExprVisitor.h"		#include "clang/AST/EvaluatedExprVisitor.h"
#include "clang/Sema/SemaInternal.h"		#include "clang/Sema/SemaInternal.h"
#include "clang/AST/ASTContext.h"		#include "clang/AST/ASTContext.h"
#include "clang/Basic/SourceManager.h"		#include "clang/Basic/SourceManager.h"
		#include "clang/Basic/TargetInfo.h"
#include "clang/Sema/DelayedDiagnostic.h"		#include "clang/Sema/DelayedDiagnostic.h"
#include "clang/Sema/Lookup.h"		#include "clang/Sema/Lookup.h"
#include "clang/Sema/ScopeInfo.h"		#include "clang/Sema/ScopeInfo.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"

using namespace clang;		using namespace clang;
using namespace sema;		using namespace sema;

▲ Show 20 Lines • Show All 109 Lines • ▼ Show 20 Lines	Option = llvm::StringSwitch<LoopHintAttr::OptionType>(
.Case("interleave_count", LoopHintAttr::InterleaveCount)		.Case("interleave_count", LoopHintAttr::InterleaveCount)
.Case("unroll", LoopHintAttr::Unroll)		.Case("unroll", LoopHintAttr::Unroll)
.Case("unroll_count", LoopHintAttr::UnrollCount)		.Case("unroll_count", LoopHintAttr::UnrollCount)
.Case("pipeline", LoopHintAttr::PipelineDisabled)		.Case("pipeline", LoopHintAttr::PipelineDisabled)
.Case("pipeline_initiation_interval",		.Case("pipeline_initiation_interval",
LoopHintAttr::PipelineInitiationInterval)		LoopHintAttr::PipelineInitiationInterval)
.Case("distribute", LoopHintAttr::Distribute)		.Case("distribute", LoopHintAttr::Distribute)
.Default(LoopHintAttr::Vectorize);		.Default(LoopHintAttr::Vectorize);
if (Option == LoopHintAttr::VectorizeWidth \|\|		if (Option == LoopHintAttr::VectorizeWidth) {
Option == LoopHintAttr::InterleaveCount \|\|		assert(ValueExpr && "Attribute must have a valid value expression.");
		if (S.CheckLoopHintExpr(ValueExpr, St->getBeginLoc()))
		fhahnUnsubmitted Done Reply Inline Actions Is there a way to only accept `fixed_width/scalable` for targets that support it? Not sure if we have enough information here, but we might be able to reject it eg per target basis or something fhahn: Is there a way to only accept `fixed_width/scalable` for targets that support it? Not sure if…
		david-armAuthorUnsubmitted Done Reply Inline Actions Hi @fhahn, I think if possible we'd prefer not to reject scalable vectors at this point. Theoretically there is no reason why we can't perform scalable vectorisation for targets that don't have hardware support for scalable vectors. In this case it simply means that vscale is 1. If you want we could add some kind of opt-remark in the vectoriser that says something like "target does not support scalable vectors, vectorising for vscale=1"? david-arm: Hi @fhahn, I think if possible we'd prefer not to reject scalable vectors at this point.
		sdesmalenUnsubmitted Not Done Reply Inline Actions I agree with @david-arm that we shouldn't prevent this in the front-end. Even if the architecture may not support scalable vectors natively, there may still be reasons to want to create scalable vectors in IR, for example to have more portable IR. sdesmalen: I agree with @david-arm that we shouldn't prevent this in the front-end. Even if the…
		fhahnUnsubmitted Not Done Reply Inline Actions Hm, I am just a bit worried that it might be a bit confusing to users that do not know what scalable vectors are (it is obvious when knowing all about SVE, but I would assume most people don't necessarily know what this means). I guess that is not the biggest deal, as long `vectorize_width(X, scalable)` works for every target. Even if the architecture may not support scalable vectors natively, there may still be reasons to want to create scalable vectors in IR, for example to have more portable IR. Sure, but there are so many other target-specific things encoded that make the IR really un-portable between targets. Granted, it is not impossible to convert IR between some architectures (as in arm64_32) fhahn: Hm, I am just a bit worried that it might be a bit confusing to users that do not know what…
		sdesmalenUnsubmitted Not Done Reply Inline Actions Sorry, forgot to reply to this. Hm, I am just a bit worried that it might be a bit confusing to users that do not know what scalable vectors are (it is obvious when knowing all about SVE, but I would assume most people don't necessarily know what this means). I guess that is not the biggest deal, as long vectorize_width(X, scalable) works for every target. At the moment this feature is still experimental, so I don't think any target would be able to return `true` to the question if this is supported :) That said, I agree that the compiler shouldn't crash for other targets after support in the loop-vectorizer stops being experimental. So I'm changing my mind here, and am happy to go with your suggestion to ignore the flag for other targets. When some default mechanism is added to lower scalable vectors to fixed-width vectors (for targets that don't natively support them), this check can probably be removed. @david-arm can you add some target hook to ignore the hint? Sure, but there are so many other target-specific things encoded that make the IR really un-portable between targets. Granted, it is not impossible to convert IR between some architectures (as in arm64_32) I didn't mean portable between targets, but more as keeping the length of the vector agnostic in IR and leaving it until code-generation to pick a suitable/available vector extension, so that the same IR could be used to generate code for Neon or 256bit SVE for example. This is more a hypothetical use-case at the moment though. sdesmalen: Sorry, forgot to reply to this. > Hm, I am just a bit worried that it might be a bit confusing…
		return nullptr;
		if (StateLoc && StateLoc->Ident && StateLoc->Ident->isStr("scalable")) {
		if (!S.Context.getTargetInfo().supportsScalableVectors()) {
		sdesmalenUnsubmitted Not Done Reply Inline Actions If the target does not support scalable vectors, it currently assumes `"fixed"`. If we want to stick with that approach, the diagnostic message should be changed (see my other comment). The alternative is dropping the hint entirely by returning `nullptr` and changing the diagnostic message to say the hint is ignored. I could live with both options. @fhahn do you have a preference here? nit: to reduce nesting, can you hoist this out one level, e.g. if (StateLoc && StateLoc->Ident & ...) State = LoopHintAttr::ScalableNumeric; else State = LoopHintAttr::Numeric; if (State == LoopHintAttr::ScalableNumeric && !S.Context.getTargetInfo().supportsScalableVectors()) { S.Diag(....); State = LoopHintAttr::Numeric; } sdesmalen: If the target does not support scalable vectors, it currently assumes `"fixed"`. If we want to…
		S.Diag(St->getBeginLoc(), diag::warn_pragma_attribute_scalable_unused);
		State = LoopHintAttr::Numeric;
		} else
		State = LoopHintAttr::ScalableNumeric;
		} else
		State = LoopHintAttr::Numeric;
		} else if (Option == LoopHintAttr::InterleaveCount \|\|
Option == LoopHintAttr::UnrollCount \|\|		Option == LoopHintAttr::UnrollCount \|\|
Option == LoopHintAttr::PipelineInitiationInterval) {		Option == LoopHintAttr::PipelineInitiationInterval) {
assert(ValueExpr && "Attribute must have a valid value expression.");		assert(ValueExpr && "Attribute must have a valid value expression.");
if (S.CheckLoopHintExpr(ValueExpr, St->getBeginLoc()))		if (S.CheckLoopHintExpr(ValueExpr, St->getBeginLoc()))
return nullptr;		return nullptr;
State = LoopHintAttr::Numeric;		State = LoopHintAttr::Numeric;
} else if (Option == LoopHintAttr::Vectorize \|\|		} else if (Option == LoopHintAttr::Vectorize \|\|
Option == LoopHintAttr::Interleave \|\|		Option == LoopHintAttr::Interleave \|\|
Option == LoopHintAttr::VectorizePredicate \|\|		Option == LoopHintAttr::VectorizePredicate \|\|
Option == LoopHintAttr::Unroll \|\|		Option == LoopHintAttr::Unroll \|\|
▲ Show 20 Lines • Show All 299 Lines • Show Last 20 Lines

clang/test/CodeGenCXX/pragma-loop.cpp

	// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s \| FileCheck %s			// RUN: %clang_cc1 -triple x86_64-apple-darwin -std=c++11 -emit-llvm -o - %s 2>%t \| FileCheck %s
				// RUN: FileCheck --check-prefix=CHECK-SCALABLE %s < %t

	// Verify while loop is recognized after sequence of pragma clang loop directives.			// Verify while loop is recognized after sequence of pragma clang loop directives.
	void while_test(int *List, int Length) {			void while_test(int *List, int Length) {
	// CHECK: define {{.*}} @_Z10while_test			// CHECK: define {{.*}} @_Z10while_test
	int i = 0;			int i = 0;

	#pragma clang loop vectorize(enable)			#pragma clang loop vectorize(enable)
	#pragma clang loop interleave_count(4)			#pragma clang loop interleave_count(4)
	▲ Show 20 Lines • Show All 143 Lines • ▼ Show 20 Lines
	void template_test(double *List, int Length) {			void template_test(double *List, int Length) {
	double Value = 10;			double Value = 10;

	for_template_test<double>(List, Length, Value);			for_template_test<double>(List, Length, Value);
	for_template_define_test<double, int>(List, Length, Value);			for_template_define_test<double, int>(List, Length, Value);
	for_template_constant_expression_test<double, 2, 4, 8>(List, Length);			for_template_constant_expression_test<double, 2, 4, 8>(List, Length);
	}			}

				// Verify for loop is performing fixed width vectorization
				void for_test_fixed(int *List, int Length) {
				#pragma clang loop vectorize_width(16, fixed) interleave_count(4) unroll(disable) distribute(disable)
				for (int i = 0; i < Length; i++) {
				// CHECK: br label {{.}}, !llvm.loop ![[LOOP_15:.]]
				List[i] = i * 2;
				sdesmalenUnsubmitted Not Done Reply Inline Actions out of curiosity, is there a particular reason you're testing it with a do-while loop instead of a shorter for-loop like the tests in `for_template_constant_expression_test` ? sdesmalen: out of curiosity, is there a particular reason you're testing it with a do-while loop instead…
				}
				}

				// Verify for loop rejects scalable vectorization due to lack of target support
				// CHECK-SCALABLE: ignoring scalable vectorize_width flag due to lack of target support
				void for_test_scalable(int *List, int Length) {
				#pragma clang loop vectorize_width(16, scalable) interleave_count(4) unroll(disable) distribute(disable)
				for (int i = 0; i < Length; i++) {
				// CHECK: br label {{.}}, !llvm.loop ![[LOOP_16:.]]
				// CHECK-SVE: br label {{.}}, !llvm.loop ![[LOOP_16_SVE:.]]
				List[i] = i * 2;
				}
				}

	// CHECK: ![[LOOP_1]] = distinct !{![[LOOP_1]], ![[UNROLL_FULL:.*]]}			// CHECK: ![[LOOP_1]] = distinct !{![[LOOP_1]], ![[UNROLL_FULL:.*]]}
	// CHECK: ![[UNROLL_FULL]] = !{!"llvm.loop.unroll.full"}			// CHECK: ![[UNROLL_FULL]] = !{!"llvm.loop.unroll.full"}

	// CHECK: ![[LOOP_2]] = distinct !{![[LOOP_2]], ![[UNROLL_DISABLE:.]], ![[DISTRIBUTE_DISABLE:.]], ![[WIDTH_8:.]], ![[INTERLEAVE_4:.]], ![[VECTORIZE_ENABLE:.*]]}			// CHECK: ![[LOOP_2]] = distinct !{![[LOOP_2]], ![[UNROLL_DISABLE:.]], ![[DISTRIBUTE_DISABLE:.]], ![[WIDTH_8:.]], ![[INTERLEAVE_4:.]], ![[VECTORIZE_ENABLE:.*]]}
	// CHECK: ![[UNROLL_DISABLE]] = !{!"llvm.loop.unroll.disable"}			// CHECK: ![[UNROLL_DISABLE]] = !{!"llvm.loop.unroll.disable"}
	// CHECK: ![[DISTRIBUTE_DISABLE]] = !{!"llvm.loop.distribute.enable", i1 false}			// CHECK: ![[DISTRIBUTE_DISABLE]] = !{!"llvm.loop.distribute.enable", i1 false}
	// CHECK: ![[WIDTH_8]] = !{!"llvm.loop.vectorize.width", i32 8}			// CHECK: ![[WIDTH_8]] = !{!"llvm.loop.vectorize.width", i32 8}
	// CHECK: ![[INTERLEAVE_4]] = !{!"llvm.loop.interleave.count", i32 4}			// CHECK: ![[INTERLEAVE_4]] = !{!"llvm.loop.interleave.count", i32 4}
	▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
	// CHECK: ![[LOOP_13]] = distinct !{![[LOOP_13]], ![[WIDTH_8:.]], ![[INTERLEAVE_16:.]], ![[VECTORIZE_ENABLE]], ![[FOLLOWUP_VECTOR_13:.*]]}			// CHECK: ![[LOOP_13]] = distinct !{![[LOOP_13]], ![[WIDTH_8:.]], ![[INTERLEAVE_16:.]], ![[VECTORIZE_ENABLE]], ![[FOLLOWUP_VECTOR_13:.*]]}
	// CHECK: ![[INTERLEAVE_16]] = !{!"llvm.loop.interleave.count", i32 16}			// CHECK: ![[INTERLEAVE_16]] = !{!"llvm.loop.interleave.count", i32 16}
	// CHECK: ![[FOLLOWUP_VECTOR_13]] = !{!"llvm.loop.vectorize.followup_all", ![[AFTER_VECTOR_13:.*]]}			// CHECK: ![[FOLLOWUP_VECTOR_13]] = !{!"llvm.loop.vectorize.followup_all", ![[AFTER_VECTOR_13:.*]]}
	// CHECK: ![[AFTER_VECTOR_13]] = distinct !{![[AFTER_VECTOR_13]], ![[ISVECTORIZED:.]], ![[UNROLL_32:.]]}			// CHECK: ![[AFTER_VECTOR_13]] = distinct !{![[AFTER_VECTOR_13]], ![[ISVECTORIZED:.]], ![[UNROLL_32:.]]}
	// CHECK: ![[UNROLL_32]] = !{!"llvm.loop.unroll.count", i32 32}			// CHECK: ![[UNROLL_32]] = !{!"llvm.loop.unroll.count", i32 32}

	// CHECK: ![[LOOP_14]] = distinct !{![[LOOP_14]], ![[WIDTH_10:.*]], ![[VECTORIZE_ENABLE]]}			// CHECK: ![[LOOP_14]] = distinct !{![[LOOP_14]], ![[WIDTH_10:.*]], ![[VECTORIZE_ENABLE]]}
	// CHECK: ![[WIDTH_10]] = !{!"llvm.loop.vectorize.width", i32 10}			// CHECK: ![[WIDTH_10]] = !{!"llvm.loop.vectorize.width", i32 10}

				// CHECK: ![[LOOP_15]] = distinct !{![[LOOP_15]], ![[UNROLL_DISABLE:.]], ![[DISTRIBUTE_DISABLE:.]], ![[WIDTH_16_FIXED:.]], ![[INTERLEAVE_4:.]], ![[VECTORIZE_ENABLE:.*]]}
				// CHECK: ![[WIDTH_16_FIXED]] = !{!"llvm.loop.vectorize.width", i32 16}

				// CHECK: ![[LOOP_16]] = distinct !{![[LOOP_16]], ![[UNROLL_DISABLE:.]], ![[DISTRIBUTE_DISABLE:.]], ![[WIDTH_16_FIXED:.]], ![[INTERLEAVE_4:.]], ![[VECTORIZE_ENABLE:.*]]}

clang/test/CodeGenCXX/pragma-scalable-loop.cpp

This file was added.

				// RUN: %clang_cc1 -triple aarch64-linux-gnu -target-feature +sve -std=c++11 -emit-llvm -o - %s \| FileCheck %s

				// Verify do loop is performing scalable vectorization
				void for_test_scalable(int *List, int Length) {
				#pragma clang loop vectorize_width(16, scalable) interleave_count(4) unroll(disable) distribute(disable)
				for (int i = 0; i < Length; i++) {
				// CHECK: br label {{.}}, !llvm.loop ![[LOOP_1:.]]
				List[i] = i * 2;
				}
				}

				// CHECK: ![[LOOP_1]] = distinct !{![[LOOP_1]], ![[UNROLL_DISABLE:.]], ![[DISTRIBUTE_DISABLE:.]], ![[WIDTH_16_SCALABLE:.]], ![[INTERLEAVE_4:.]], ![[VECTORIZE_ENABLE:.*]]}
				// CHECK: ![[UNROLL_DISABLE]] = !{!"llvm.loop.unroll.disable"}
				// CHECK: ![[DISTRIBUTE_DISABLE]] = !{!"llvm.loop.distribute.enable", i1 false}
				// CHECK: ![[WIDTH_16_SCALABLE]] = !{!"llvm.loop.vectorize.width", ![[ELEMENT_COUNT_16_SCALABLE:.*]]}
				// CHECK: ![[ELEMENT_COUNT_16_SCALABLE]] = !{i32 16, i1 true}
				// CHECK: ![[INTERLEAVE_4]] = !{!"llvm.loop.interleave.count", i32 4}
				// CHECK: ![[VECTORIZE_ENABLE]] = !{!"llvm.loop.vectorize.enable", i1 true}