This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
2/2
LangRef.rst
-
include/llvm/
-
llvm/
-
CodeGen/
-
ISDOpcodes.h
-
IR/
1/1
Intrinsics.td
-
lib/
-
Analysis/
-
InstructionSimplify.cpp
-
CodeGen/
-
SelectionDAG/
1/1
LegalizeIntegerTypes.cpp
-
LegalizeTypes.h
-
LegalizeVectorTypes.cpp
-
SelectionDAGBuilder.h
3/8
SelectionDAGBuilder.cpp
1/2
SelectionDAGDumper.cpp
-
TargetLoweringBase.cpp
-
Target/AArch64/
-
AArch64/
-
AArch64ISelLowering.h
2/2
AArch64ISelLowering.cpp
3/3
AArch64SVEInstrInfo.td
-
test/
-
CodeGen/AArch64/
-
AArch64/
-
named-vector-shuffles-neon.ll
1
named-vector-shuffles-sve.ll
-
Transforms/InstSimplify/
-
InstSimplify/
-
named-vector-shuffles.ll

Differential D94883

[CodeGen][SelectionDAG]Add new intrinsic experimental.vector.reverse
ClosedPublic

Authored by CarolineConcatto on Jan 17 2021, 11:45 AM.

Download Raw Diff

Details

Reviewers

sdesmalen
david-arm
paulwalker-arm
kmclaughlin
rogfer01
cameron.mcinally
fhahn

Commits

rG2d728bbff5c6: [CodeGen][SelectionDAG]Add new intrinsic experimental.vector.reverse

Summary

This patch adds a new intrinsic experimental.vector.reduce that takes a single
vector and returns a vector of matching type but with the original lane order
reversed. For example:

vector.reverse(<A,B,C,D>) ==> <D,C,B,A>

The new intrinsic supports fixed and scalable vectors types.
The fixed-width vector relies on shufflevector to maintain existing behaviour.
Scalable vector uses the new ISD node - VECTOR_REVERSE.

This new intrinsic is one of the named shufflevector intrinsics proposed on the
mailing-list in the RFC at [1].

Patch by Paul Walker (@paulwalker-arm).

[1] https://lists.llvm.org/pipermail/llvm-dev/2020-November/146864.html

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

CarolineConcatto created this revision.Jan 17 2021, 11:45 AM

Herald added subscribers: ecnelises, jdoerfert, hiraditya. · View Herald TranscriptJan 17 2021, 11:45 AM

CarolineConcatto requested review of this revision.Jan 17 2021, 11:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 17 2021, 11:45 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

CarolineConcatto added reviewers: sdesmalen, david-arm, paulwalker-arm, kmclaughlin.Jan 17 2021, 11:56 AM

Harbormaster completed remote builds in B85533: Diff 317237.Jan 17 2021, 12:24 PM

craig.topper added a subscriber: craig.topper.Jan 17 2021, 9:29 PM

craig.topper added inline comments.

llvm/include/llvm/IR/Intrinsics.td
1627	Should this use DefaultAttrsIntrinsic?
llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
99	Can we keep consistent formatting with the rest of this switch?
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
10827	Put VT.getVectorNumElements() in a variable since its used here and in the loop body.
llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
253	Why not put this in TargetSelectionDAG.td?

david-arm added inline comments.Jan 18 2021, 1:21 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
10817	I think we can drop the "experimental" here now?
10819	Perhaps worth explaining that we're using VECTOR_SHUFFLE to implement the VECTOR_REVERSE operation for fixed length vectors? It confused me a bit at first trying to understand why we jump suddenly into VECTOR_SHUFFLE that's all.
10825	VECTOR_REVERSE?
llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp
293	I think this can just be "vector_reverse" because it's a generic opcode and not experimental?
llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
5888	Just for reference, if all the lowering operation does is create the ISD_VECTOR_REVERSE with the same arguments then you can actually just mark the operation as Legal and avoid creating the LowerVECTOR_REVERSE function as it should happen automatically. If we have plans to use SVE for fixed length vectors in future then it might be useful to keep this custom function anyway.
llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
253	Yeah, I think you need to add something to llvm/include/llvm/Target/TargetSelectionDAG.td along the lines of def vector_reverse : SDNode<"ISD::VECTOR_REVERSE", SDTypeProfile<1, 1, [SDTCisVec<0>, SDTCisSameAs<0,1>]>,[]>; because you've added a generic ISD opcode in this patch. In places where you've got patterns using "AArch64rev" you can just then use "vector_reverse" instead.

david-arm added inline comments.Jan 18 2021, 1:23 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
10825	Sorry, ignore the comment above!
llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
252	nit: whitespace

CarolineConcatto added reviewers: rogfer01, cameron.mcinally.Jan 18 2021, 6:28 AM

CarolineConcatto edited the summary of this revision. (Show Details)Jan 19 2021, 7:56 AM

Herald added a subscriber: kristof.beyls. · View Herald TranscriptJan 19 2021, 7:56 AM

-remove AArch64 custom lowering
-move SNode vector_reverse to TargetSelectionDAG.td

-fix style on LegalizeIntegerTypes.cpp

-fix table gen style for TargetSelectionDAG.td

CarolineConcatto added inline comments.Jan 19 2021, 8:39 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
10819	@david-arm I've updated the commit message and the comments as well. Is that good?
llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
5888	Thank you @david-arm for pointing that. It is true that we can remove the custom lowering for aarch64 and have it as legal. This simplifies the code.

Matt added a subscriber: Matt.Jan 19 2021, 9:14 AM

Harbormaster completed remote builds in B85725: Diff 317582.Jan 19 2021, 9:54 AM

Harbormaster completed remote builds in B85724: Diff 317580.Jan 19 2021, 10:01 AM

Harbormaster completed remote builds in B85726: Diff 317586.Jan 19 2021, 10:18 AM

LGTM!

This revision is now accepted and ready to land.Jan 21 2021, 8:23 AM

Mirroring my comments for D94708: if the intrinsics needs to support fixed vectors, it would be good to have some tests for platforms other than AArch64 and also support in GlobalISel, which is the default on AArch64 with -O0 IIRC (or do the transform to shuffles as an IR transform).

timsmith78 added a subscriber: timsmith78.Jan 21 2021, 9:23 AM

In D94883#2512711, @fhahn wrote:

Mirroring my comments for D94708: if the intrinsics needs to support fixed vectors, it would be good to have some tests for platforms other than AArch64 and also support in GlobalISel, which is the default on AArch64 with -O0 IIRC (or do the transform to shuffles as an IR transform).

Hi @fhahn, I agree with your point about -O0, but I'm not sure why we need tests for other platforms? Carol has an extensive set of tests for both fixed width and scalable vectors. The lowering is identical for fixed width vectors regardless of the target so all it would be testing is the different codegen of vector shuffles.

In D94883#2512892, @david-arm wrote:

In D94883#2512711, @fhahn wrote:

Mirroring my comments for D94708: if the intrinsics needs to support fixed vectors, it would be good to have some tests for platforms other than AArch64 and also support in GlobalISel, which is the default on AArch64 with -O0 IIRC (or do the transform to shuffles as an IR transform).

Hi @fhahn, I agree with your point about -O0, but I'm not sure why we need tests for other platforms? Carol has an extensive set of tests for both fixed width and scalable vectors. The lowering is identical for fixed width vectors regardless of the target so all it would be testing is the different codegen of vector shuffles.

I agree, such extra tests seem wasteful as I'd expect ISD::VECTOR_SHUFFLE to already be sufficiently tested for all targets.

In D94883#2515074, @paulwalker-arm wrote:

In D94883#2512892, @david-arm wrote:

In D94883#2512711, @fhahn wrote:

Mirroring my comments for D94708: if the intrinsics needs to support fixed vectors, it would be good to have some tests for platforms other than AArch64 and also support in GlobalISel, which is the default on AArch64 with -O0 IIRC (or do the transform to shuffles as an IR transform).

Hi @fhahn, I agree with your point about -O0, but I'm not sure why we need tests for other platforms? Carol has an extensive set of tests for both fixed width and scalable vectors. The lowering is identical for fixed width vectors regardless of the target so all it would be testing is the different codegen of vector shuffles.

I agree, such extra tests seem wasteful as I'd expect ISD::VECTOR_SHUFFLE to already be sufficiently tested for all targets.

Given the current implementation details that's true, but the implementation could change and having a few tests for other targets guards against the implementation changing in a way that makes it only work on a certain target for example. I'm not saying that is very likely, but something like that could get missed as backends transition to GlobalISel at different times, for example.

I'm not saying we need extensive tests for all targets, just suggesting to add a small sanity test for a different commonly used backend, like X86. Adding such a test seems cheap, but please feel free to ignore the suggestion.

Taking a step back, are there plans to use the intrinsics for fixed vectors?

In D94883#2515136, @fhahn wrote:

Taking a step back, are there plans to use the intrinsics for fixed vectors?

I'd like to say yes but at this stage it is too early to say. What we definitely need is a unified interface so that transforms can be written without needing to worry about the type of vector. If a pass uses an IRBuilder then great, we can hide the "do I create a shufflevector or intrinsic" code behind suitably names functions (e.g. craeteVectorSplice()...). For passes that want to create a node directly then we'd recommend just creating an intrinsic call with the expectation that those working with fixed length vectors are transformed to shufflevector sufficiently early to maintain existing code quality.

Once we're at a stage where the expected shuffle optimisations apply equally well to the intrinsic variants, there's consensus of how LLVM will represent step vector like constants and ultimately the final decision on if/how shufflevector can operate on scalable vectors, we should be in a position to determine whether it is worth unifying shuffles paths (i.e. minimise the use of shufflevector).

Just an extra note to say that whilst the above is true for the intrinsics, it doesn't necessarily hold for the ISD nodes, where I believe the common case will be to rely on VECTOR_SHUFFLE to implement the vector intrinsics that operate on fixed length vectors.

CarolineConcatto added a child revision: D95363: [SVE][LoopVectorize] Add support for scalable vectorization of loops with vector reverse.Jan 25 2021, 7:52 AM

-add -O0 and X86 test for vector.reverse

Herald added a subscriber: pengfei. · View Herald TranscriptJan 26 2021, 9:03 AM

Harbormaster completed remote builds in B86727: Diff 319320.Jan 26 2021, 9:57 AM

Thank you all for the comments.
I've added tests for -O0 in AArch64 and fixed-width test for X86.
ATM the X86 test has no specific target feature.

craig.topper added inline comments.Jan 27 2021, 12:49 AM

llvm/test/CodeGen/X86/named-vector-shuffles.ll
65 ↗	(On Diff #319320)	You can probably drop this. Vectors of f16 aren't legal on X86.

In D94883#2518090, @paulwalker-arm wrote:

In D94883#2515136, @fhahn wrote:

Taking a step back, are there plans to use the intrinsics for fixed vectors?

I'd like to say yes but at this stage it is too early to say. What we definitely need is a unified interface so that transforms can be written without needing to worry about the type of vector. If a pass uses an IRBuilder then great, we can hide the "do I create a shufflevector or intrinsic" code behind suitably names functions (e.g. craeteVectorSplice()...). For passes that want to create a node directly then we'd recommend just creating an intrinsic call with the expectation that those working with fixed length vectors are transformed to shufflevector sufficiently early to maintain existing code quality.

Hm, making things slightly easier for passes not using IRBuilder doesn't seem like the strongest motivation to me, especially if it also comes with new pass ordering constraints (doing the conversion during instruction selection seems like it would mean we potentially miss existing folds in InstCombine & co). Also, even if passes not using IRBuilder, creating an intrinsic call without it is probably more work than just instantiating IRBuilder and using it directly?

It seems to me like only allowing them for scalable vectors initially would define away a couple of potential problems (like introducing new pass-ordering constraints, uncertainty whether to use shuffle vector or the intrinsics). Personally I'd prefer one way of doing things rather than having to chose whether to use an intrinsic or a shuffle for fixed vectors. If it is uncertain whether the intrinsic will actually be used for fixed vectors, having it first restricted to the actual use case (scalable vectors) and extend it to fixed vectors once it is needed seems like a slightly more incremental/conservative approach.

But if others think having them for fixed with vectors is useful, that's fine by me, especially because they are still experimental and we can remove fixed vector support again if it is not used.

CarolineConcatto added a child revision: D95603: [CostModel]Add cost model for experimental.vector.reverse.Jan 28 2021, 2:02 AM

-remove test for f16 on X86 as they are not legal.

CarolineConcatto marked an inline comment as done.Jan 28 2021, 2:49 AM

CarolineConcatto added inline comments.

llvm/test/CodeGen/X86/named-vector-shuffles.ll
65 ↗	(On Diff #319320)	Thank you @craig.topper. I've removed the test for f16 tests on X86

Harbormaster completed remote builds in B86993: Diff 319807.Jan 28 2021, 3:46 AM

rscottmanley added a subscriber: rscottmanley.Jan 31 2021, 11:16 AM

In D94883#2524665, @fhahn wrote:

It seems to me like only allowing them for scalable vectors initially would define away a couple of potential problems (like introducing new pass-ordering constraints, uncertainty whether to use shuffle vector or the intrinsics). Personally I'd prefer one way of doing things rather than having to chose whether to use an intrinsic or a shuffle for fixed vectors. If it is uncertain whether the intrinsic will actually be used for fixed vectors, having it first restricted to the actual use case (scalable vectors) and extend it to fixed vectors once it is needed seems like a slightly more incremental/conservative approach.

But if others think having them for fixed with vectors is useful, that's fine by me, especially because they are still experimental and we can remove fixed vector support again if it is not used.

While these intrinsics are marked experimental they probably shouldn't be created for fixed-width vectors by passes or front-ends until we have a better understanding what direction to take for shuffles. At the same time, the name doesn't suggest the intrinsic is limited to scalable-vectors only, and it is quite trivial to make this work for fixed-width vectors, so I don't see a reason not to support this. Perhaps the LangRef description can mention this consideration?

llvm/docs/LangRef.rst
16222	`nxv4i32`
16230	Maybe add: These intrinsics work for both fixed and scalable vectors. While this intrinsic is marked as `experimental` the recommended way to express reverse operations for fixed-width vectors is still to use a `shufflevector`, as that may allow for more optimization opportunities.
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
10819–10821	nit: remove comment.
10828	nit: s/behaviours/behavior/
llvm/test/CodeGen/AArch64/named-vector-shuffles-sve.ll
2	nit: can you rename this file (and the one for neon and x86) to: `named-vector-shuffle-reverse-sve.ll`, `named-vector-shuffle-reverse-neon.ll` and `named-vector-shuffle-reverse.ll` respectively?

-change the names of the tests to named-vector-shuffle-reverse
-add text in LangRef about the use of experimental.vector.reverse

CarolineConcatto marked 2 inline comments as done.Feb 11 2021, 4:44 AM

Cheers, it looks good to me now!

Harbormaster completed remote builds in B88783: Diff 322947.Feb 11 2021, 6:04 AM

In D94883#2553696, @sdesmalen wrote:

In D94883#2524665, @fhahn wrote:

It seems to me like only allowing them for scalable vectors initially would define away a couple of potential problems (like introducing new pass-ordering constraints, uncertainty whether to use shuffle vector or the intrinsics). Personally I'd prefer one way of doing things rather than having to chose whether to use an intrinsic or a shuffle for fixed vectors. If it is uncertain whether the intrinsic will actually be used for fixed vectors, having it first restricted to the actual use case (scalable vectors) and extend it to fixed vectors once it is needed seems like a slightly more incremental/conservative approach.

But if others think having them for fixed with vectors is useful, that's fine by me, especially because they are still experimental and we can remove fixed vector support again if it is not used.

While these intrinsics are marked experimental they probably shouldn't be created for fixed-width vectors by passes or front-ends until we have a better understanding what direction to take for shuffles. At the same time, the name doesn't suggest the intrinsic is limited to scalable-vectors only, and it is quite trivial to make this work for fixed-width vectors, so I don't see a reason not to support this. Perhaps the LangRef description can mention this consideration?

Making it explicit in the langref sounds good to me, thanks!

LGTM.

One minor issue that can be fixed when merging.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp
293	Stylistically this should be on the same line as the case to match all the other entries, even those with longer names.

This revision was landed with ongoing or failed builds.Feb 15 2021, 5:41 AM

Closed by commit rG2d728bbff5c6: [CodeGen][SelectionDAG]Add new intrinsic experimental.vector.reverse (authored by CarolineConcatto). · Explain Why

This revision was automatically updated to reflect the committed changes.

CarolineConcatto added a commit: rG2d728bbff5c6: [CodeGen][SelectionDAG]Add new intrinsic experimental.vector.reverse.

c-rhodes mentioned this in D94708: [IR] Introduce llvm.experimental.vector.splice intrinsic.Feb 18 2021, 7:13 AM

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

24 lines

include/

llvm/

CodeGen/

ISDOpcodes.h

5 lines

IR/

Intrinsics.td

6 lines

lib/

Analysis/

InstructionSimplify.cpp

6 lines

CodeGen/

SelectionDAG/

LegalizeIntegerTypes.cpp

11 lines

LegalizeTypes.h

2 lines

LegalizeVectorTypes.cpp

13 lines

SelectionDAGBuilder.h

1 line

SelectionDAGBuilder.cpp

25 lines

SelectionDAGDumper.cpp

2 lines

TargetLoweringBase.cpp

3 lines

Target/

AArch64/

AArch64ISelLowering.h

2 lines

AArch64ISelLowering.cpp

14 lines

AArch64SVEInstrInfo.td

4 lines

test/

CodeGen/

AArch64/

named-vector-shuffles-neon.ll

184 lines

named-vector-shuffles-sve.ll

180 lines

Transforms/

InstSimplify/

named-vector-shuffles.ll

12 lines

Diff 317237

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 16,203 Lines • ▼ Show 20 Lines
	vector length of the result type. If the result type is a scalable vector,			vector length of the result type. If the result type is a scalable vector,
	``idx`` is first scaled by the result type's runtime scaling factor. Elements			``idx`` is first scaled by the result type's runtime scaling factor. Elements
	``idx`` through (``idx`` + num_elements(result_type) - 1) must be valid vector			``idx`` through (``idx`` + num_elements(result_type) - 1) must be valid vector
	indices. If this condition cannot be determined statically but is false at			indices. If this condition cannot be determined statically but is false at
	runtime, then the result vector is undefined. The ``idx`` parameter must be a			runtime, then the result vector is undefined. The ``idx`` parameter must be a
	vector index constant type (for most targets this will be an integer pointer			vector index constant type (for most targets this will be an integer pointer
	type).			type).

				'``llvm.experimental.vector.reverse``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""
				This is an overloaded intrinsic.

				::

				declare <2 x i8> @llvm.experimental.vector.reverse.v2i8(<2 x i8> %a)
				declare <vscale x 4 x i32> @llvm.experimental.vector.reverse.v4i32(<vscale x 4 x i32> %a)
				sdesmalenUnsubmitted Done Reply Inline Actions `nxv4i32` sdesmalen: `nxv4i32`

				Overview:
				"""""""""

				The '``llvm.experimental.vector.reverse.*``' intrinsics reverse a vector.
				The intrinsic takes a single vector and returns a vector of matching type but
				with the original lane order reversed.

				sdesmalenUnsubmitted Done Reply Inline Actions Maybe add: These intrinsics work for both fixed and scalable vectors. While this intrinsic is marked as `experimental` the recommended way to express reverse operations for fixed-width vectors is still to use a `shufflevector`, as that may allow for more optimization opportunities. sdesmalen: Maybe add: > These intrinsics work for both fixed and scalable vectors. While this intrinsic is…
				Arguments:
				""""""""""

				The argument to this intrinsic must be a vector.

	Matrix Intrinsics			Matrix Intrinsics
	-----------------			-----------------

	Operations on matrixes requiring shape information (like number of rows/columns			Operations on matrixes requiring shape information (like number of rows/columns
	or the memory layout) can be expressed using the matrix intrinsics. These			or the memory layout) can be expressed using the matrix intrinsics. These
	intrinsics require matrix dimensions to be passed as immediate arguments, and			intrinsics require matrix dimensions to be passed as immediate arguments, and
	matrixes are passed and returned as vectors. This means that for a ``R`` x			matrixes are passed and returned as vectors. This means that for a ``R`` x
	``C`` matrix, element ``i`` of column ``j`` is at index ``j * R + i`` in the			``C`` matrix, element ``i`` of column ``j`` is at index ``j * R + i`` in the
	▲ Show 20 Lines • Show All 5,053 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/ISDOpcodes.h

//===-- llvm/CodeGen/ISDOpcodes.h - CodeGen opcodes -------------- C++ --===//		//===-- llvm/CodeGen/ISDOpcodes.h - CodeGen opcodes -------------- C++ --===//
		Lint: Lint Inline Actions clang-format suggested style edits found: Lint: Lint: clang-format suggested style edits found:
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file declares codegen opcodes and related utilities.		// This file declares codegen opcodes and related utilities.
▲ Show 20 Lines • Show All 525 Lines • ▼ Show 20 Lines	enum NodeType {
/// condition cannot be determined statically but is false at runtime, then		/// condition cannot be determined statically but is false at runtime, then
/// the result vector is undefined. The IDX parameter must be a vector index		/// the result vector is undefined. The IDX parameter must be a vector index
/// constant type, which for most targets will be an integer pointer type.		/// constant type, which for most targets will be an integer pointer type.
///		///
/// This operation supports extracting a fixed-width vector from a scalable		/// This operation supports extracting a fixed-width vector from a scalable
/// vector, but not the other way around.		/// vector, but not the other way around.
EXTRACT_SUBVECTOR,		EXTRACT_SUBVECTOR,

		/// VECTOR_REVERSE(VECTOR) - Returns a vector, of the same type as VECTOR,
		/// whose elements are shuffled using the following algorithm:
		/// RESULT[i] = VECTOR[VECTOR.ElementCount - 1 - i]
		VECTOR_REVERSE,

/// VECTOR_SHUFFLE(VEC1, VEC2) - Returns a vector, of the same type as		/// VECTOR_SHUFFLE(VEC1, VEC2) - Returns a vector, of the same type as
/// VEC1/VEC2. A VECTOR_SHUFFLE node also contains an array of constant int		/// VEC1/VEC2. A VECTOR_SHUFFLE node also contains an array of constant int
/// values that indicate which value (or undef) each result element will		/// values that indicate which value (or undef) each result element will
/// get. These constant ints are accessible through the		/// get. These constant ints are accessible through the
/// ShuffleVectorSDNode class. This is quite similar to the Altivec		/// ShuffleVectorSDNode class. This is quite similar to the Altivec
/// 'vperm' instruction, except that the indices must be constants and are		/// 'vperm' instruction, except that the indices must be constants and are
/// in terms of the element size of VEC1/VEC2, not in terms of bytes.		/// in terms of the element size of VEC1/VEC2, not in terms of bytes.
VECTOR_SHUFFLE,		VECTOR_SHUFFLE,
▲ Show 20 Lines • Show All 476 Lines • ▼ Show 20 Lines	enum NodeType {
ADJUST_TRAMPOLINE,		ADJUST_TRAMPOLINE,

/// TRAP - Trapping instruction		/// TRAP - Trapping instruction
TRAP,		TRAP,

/// DEBUGTRAP - Trap intended to get the attention of a debugger.		/// DEBUGTRAP - Trap intended to get the attention of a debugger.
DEBUGTRAP,		DEBUGTRAP,

/// UBSANTRAP - Trap with an immediate describing the kind of sanitizer failure.		/// UBSANTRAP - Trap with an immediate describing the kind of sanitizer failure.
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// UBSANTRAP - Trap with an immediate describing the kind of sanitizer failure. + /// UBSANTRAP - Trap with an immediate describing the kind of sanitizer + /// failure. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// UBSANTRAP - Trap with an immediate describing…
UBSANTRAP,		UBSANTRAP,

/// PREFETCH - This corresponds to a prefetch intrinsic. The first operand		/// PREFETCH - This corresponds to a prefetch intrinsic. The first operand
/// is the chain. The other operands are the address to prefetch,		/// is the chain. The other operands are the address to prefetch,
/// read / write specifier, locality specifier and instruction / data cache		/// read / write specifier, locality specifier and instruction / data cache
/// specifier.		/// specifier.
PREFETCH,		PREFETCH,

▲ Show 20 Lines • Show All 336 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 1,616 Lines • ▼ Show 20 Lines	def int_preserve_union_access_index : DefaultAttrsIntrinsic<[llvm_anyptr_ty],
ImmArg<ArgIndex<1>>]>;		ImmArg<ArgIndex<1>>]>;
def int_preserve_struct_access_index : DefaultAttrsIntrinsic<[llvm_anyptr_ty],		def int_preserve_struct_access_index : DefaultAttrsIntrinsic<[llvm_anyptr_ty],
[llvm_anyptr_ty, llvm_i32_ty,		[llvm_anyptr_ty, llvm_i32_ty,
llvm_i32_ty],		llvm_i32_ty],
[IntrNoMem,		[IntrNoMem,
ImmArg<ArgIndex<1>>,		ImmArg<ArgIndex<1>>,
ImmArg<ArgIndex<2>>]>;		ImmArg<ArgIndex<2>>]>;

		//===------------ Intrinsics to perform common vector shuffles ------------===//

		def int_experimental_vector_reverse : Intrinsic<[llvm_anyvector_ty],
		craig.topperUnsubmitted Done Reply Inline Actions Should this use DefaultAttrsIntrinsic? craig.topper: Should this use DefaultAttrsIntrinsic?
		[LLVMMatchType<0>],
		[IntrNoMem]>;

//===---------- Intrinsics to query properties of scalable vectors --------===//		//===---------- Intrinsics to query properties of scalable vectors --------===//
def int_vscale : DefaultAttrsIntrinsic<[llvm_anyint_ty], [], [IntrNoMem]>;		def int_vscale : DefaultAttrsIntrinsic<[llvm_anyint_ty], [], [IntrNoMem]>;

//===---------- Intrinsics to perform subvector insertion/extraction ------===//		//===---------- Intrinsics to perform subvector insertion/extraction ------===//
def int_experimental_vector_insert : DefaultAttrsIntrinsic<[llvm_anyvector_ty],		def int_experimental_vector_insert : DefaultAttrsIntrinsic<[llvm_anyvector_ty],
[LLVMMatchType<0>, llvm_anyvector_ty, llvm_i64_ty],		[LLVMMatchType<0>, llvm_anyvector_ty, llvm_i64_ty],
[IntrNoMem, ImmArg<ArgIndex<2>>]>;		[IntrNoMem, ImmArg<ArgIndex<2>>]>;

Show All 24 Lines

llvm/lib/Analysis/InstructionSimplify.cpp

Show First 20 Lines • Show All 5,307 Lines • ▼ Show 20 Lines	case Intrinsic::rint: {
//		//
// Converting from int always results in a finite integral number or		// Converting from int always results in a finite integral number or
// infinity. For either of those inputs, these rounding functions always		// infinity. For either of those inputs, these rounding functions always
// return the same value, so the rounding can be eliminated.		// return the same value, so the rounding can be eliminated.
if (match(Op0, m_SIToFP(m_Value())) \|\| match(Op0, m_UIToFP(m_Value())))		if (match(Op0, m_SIToFP(m_Value())) \|\| match(Op0, m_UIToFP(m_Value())))
return Op0;		return Op0;
break;		break;
}		}
		case Intrinsic::experimental_vector_reverse:
		// experimental.vector.reverse(experimental.vector.reverse(x)) -> x
		if (match(Op0,
		m_Intrinsic<Intrinsic::experimental_vector_reverse>(m_Value(X))))
		return X;
		break;
default:		default:
break;		break;
}		}

return nullptr;		return nullptr;
}		}

static Intrinsic::ID getMaxMinOpposite(Intrinsic::ID IID) {		static Intrinsic::ID getMaxMinOpposite(Intrinsic::ID IID) {
▲ Show 20 Lines • Show All 680 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

Show First 20 Lines • Show All 90 Lines • ▼ Show 20 Lines	#endif
case ISD::SRL: Res = PromoteIntRes_SRL(N); break;		case ISD::SRL: Res = PromoteIntRes_SRL(N); break;
case ISD::TRUNCATE: Res = PromoteIntRes_TRUNCATE(N); break;		case ISD::TRUNCATE: Res = PromoteIntRes_TRUNCATE(N); break;
case ISD::UNDEF: Res = PromoteIntRes_UNDEF(N); break;		case ISD::UNDEF: Res = PromoteIntRes_UNDEF(N); break;
case ISD::VAARG: Res = PromoteIntRes_VAARG(N); break;		case ISD::VAARG: Res = PromoteIntRes_VAARG(N); break;
case ISD::VSCALE: Res = PromoteIntRes_VSCALE(N); break;		case ISD::VSCALE: Res = PromoteIntRes_VSCALE(N); break;

case ISD::EXTRACT_SUBVECTOR:		case ISD::EXTRACT_SUBVECTOR:
Res = PromoteIntRes_EXTRACT_SUBVECTOR(N); break;		Res = PromoteIntRes_EXTRACT_SUBVECTOR(N); break;
		case ISD::VECTOR_REVERSE:
		craig.topperUnsubmitted Done Reply Inline Actions Can we keep consistent formatting with the rest of this switch? craig.topper: Can we keep consistent formatting with the rest of this switch?
		Res = PromoteIntRes_VECTOR_REVERSE(N);
		break;
case ISD::VECTOR_SHUFFLE:		case ISD::VECTOR_SHUFFLE:
Res = PromoteIntRes_VECTOR_SHUFFLE(N); break;		Res = PromoteIntRes_VECTOR_SHUFFLE(N); break;
case ISD::INSERT_VECTOR_ELT:		case ISD::INSERT_VECTOR_ELT:
Res = PromoteIntRes_INSERT_VECTOR_ELT(N); break;		Res = PromoteIntRes_INSERT_VECTOR_ELT(N); break;
case ISD::BUILD_VECTOR:		case ISD::BUILD_VECTOR:
Res = PromoteIntRes_BUILD_VECTOR(N); break;		Res = PromoteIntRes_BUILD_VECTOR(N); break;
case ISD::SCALAR_TO_VECTOR:		case ISD::SCALAR_TO_VECTOR:
Res = PromoteIntRes_SCALAR_TO_VECTOR(N); break;		Res = PromoteIntRes_SCALAR_TO_VECTOR(N); break;
▲ Show 20 Lines • Show All 4,536 Lines • ▼ Show 20 Lines	for (unsigned i = 0; i != OutNumElems; ++i) {
SDValue Op = DAG.getAnyExtOrTrunc(Ext, dl, NOutVTElem);		SDValue Op = DAG.getAnyExtOrTrunc(Ext, dl, NOutVTElem);
// Insert the converted element to the new vector.		// Insert the converted element to the new vector.
Ops.push_back(Op);		Ops.push_back(Op);
}		}

return DAG.getBuildVector(NOutVT, dl, Ops);		return DAG.getBuildVector(NOutVT, dl, Ops);
}		}

		SDValue DAGTypeLegalizer::PromoteIntRes_VECTOR_REVERSE(SDNode *N) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'PromoteIntRes_VECTOR_REVERSE' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'PromoteIntRes_VECTOR_REVERSE'…
		SDLoc dl(N);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'dl' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'dl' [readability-identifier-naming]…

		SDValue V0 = GetPromotedInteger(N->getOperand(0));
		EVT OutVT = V0.getValueType();

		return DAG.getNode(ISD::VECTOR_REVERSE, dl, OutVT, V0);
		}

SDValue DAGTypeLegalizer::PromoteIntRes_VECTOR_SHUFFLE(SDNode *N) {		SDValue DAGTypeLegalizer::PromoteIntRes_VECTOR_SHUFFLE(SDNode *N) {
ShuffleVectorSDNode *SV = cast<ShuffleVectorSDNode>(N);		ShuffleVectorSDNode *SV = cast<ShuffleVectorSDNode>(N);
EVT VT = N->getValueType(0);		EVT VT = N->getValueType(0);
SDLoc dl(N);		SDLoc dl(N);

ArrayRef<int> NewMask = SV->getMask().slice(0, VT.getVectorNumElements());		ArrayRef<int> NewMask = SV->getMask().slice(0, VT.getVectorNumElements());

▲ Show 20 Lines • Show All 228 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

Show First 20 Lines • Show All 292 Lines • ▼ Show 20 Lines	private:
void PromoteIntegerResult(SDNode *N, unsigned ResNo);		void PromoteIntegerResult(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_MERGE_VALUES(SDNode *N, unsigned ResNo);		SDValue PromoteIntRes_MERGE_VALUES(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_AssertSext(SDNode *N);		SDValue PromoteIntRes_AssertSext(SDNode *N);
SDValue PromoteIntRes_AssertZext(SDNode *N);		SDValue PromoteIntRes_AssertZext(SDNode *N);
SDValue PromoteIntRes_Atomic0(AtomicSDNode *N);		SDValue PromoteIntRes_Atomic0(AtomicSDNode *N);
SDValue PromoteIntRes_Atomic1(AtomicSDNode *N);		SDValue PromoteIntRes_Atomic1(AtomicSDNode *N);
SDValue PromoteIntRes_AtomicCmpSwap(AtomicSDNode *N, unsigned ResNo);		SDValue PromoteIntRes_AtomicCmpSwap(AtomicSDNode *N, unsigned ResNo);
SDValue PromoteIntRes_EXTRACT_SUBVECTOR(SDNode *N);		SDValue PromoteIntRes_EXTRACT_SUBVECTOR(SDNode *N);
		SDValue PromoteIntRes_VECTOR_REVERSE(SDNode *N);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'PromoteIntRes_VECTOR_REVERSE' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'PromoteIntRes_VECTOR_REVERSE'…
SDValue PromoteIntRes_VECTOR_SHUFFLE(SDNode *N);		SDValue PromoteIntRes_VECTOR_SHUFFLE(SDNode *N);
SDValue PromoteIntRes_BUILD_VECTOR(SDNode *N);		SDValue PromoteIntRes_BUILD_VECTOR(SDNode *N);
SDValue PromoteIntRes_SCALAR_TO_VECTOR(SDNode *N);		SDValue PromoteIntRes_SCALAR_TO_VECTOR(SDNode *N);
SDValue PromoteIntRes_SPLAT_VECTOR(SDNode *N);		SDValue PromoteIntRes_SPLAT_VECTOR(SDNode *N);
SDValue PromoteIntRes_EXTEND_VECTOR_INREG(SDNode *N);		SDValue PromoteIntRes_EXTEND_VECTOR_INREG(SDNode *N);
SDValue PromoteIntRes_INSERT_VECTOR_ELT(SDNode *N);		SDValue PromoteIntRes_INSERT_VECTOR_ELT(SDNode *N);
SDValue PromoteIntRes_CONCAT_VECTORS(SDNode *N);		SDValue PromoteIntRes_CONCAT_VECTORS(SDNode *N);
SDValue PromoteIntRes_BITCAST(SDNode *N);		SDValue PromoteIntRes_BITCAST(SDNode *N);
▲ Show 20 Lines • Show All 519 Lines • ▼ Show 20 Lines	private:
void SplitVecRes_FPOWI(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_FPOWI(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_FCOPYSIGN(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_FCOPYSIGN(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_INSERT_VECTOR_ELT(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_INSERT_VECTOR_ELT(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_LOAD(LoadSDNode *LD, SDValue &Lo, SDValue &Hi);		void SplitVecRes_LOAD(LoadSDNode *LD, SDValue &Lo, SDValue &Hi);
void SplitVecRes_MLOAD(MaskedLoadSDNode *MLD, SDValue &Lo, SDValue &Hi);		void SplitVecRes_MLOAD(MaskedLoadSDNode *MLD, SDValue &Lo, SDValue &Hi);
void SplitVecRes_MGATHER(MaskedGatherSDNode *MGT, SDValue &Lo, SDValue &Hi);		void SplitVecRes_MGATHER(MaskedGatherSDNode *MGT, SDValue &Lo, SDValue &Hi);
void SplitVecRes_ScalarOp(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_ScalarOp(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_SETCC(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_SETCC(SDNode *N, SDValue &Lo, SDValue &Hi);
		void SplitVecRes_VECTOR_REVERSE(SDNode *N, SDValue &Lo, SDValue &Hi);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'SplitVecRes_VECTOR_REVERSE' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'SplitVecRes_VECTOR_REVERSE' [readability…
void SplitVecRes_VECTOR_SHUFFLE(ShuffleVectorSDNode *N, SDValue &Lo,		void SplitVecRes_VECTOR_SHUFFLE(ShuffleVectorSDNode *N, SDValue &Lo,
SDValue &Hi);		SDValue &Hi);
void SplitVecRes_VAARG(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_VAARG(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_FP_TO_XINT_SAT(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_FP_TO_XINT_SAT(SDNode *N, SDValue &Lo, SDValue &Hi);

// Vector Operand Splitting: <128 x ty> -> 2 x <64 x ty>.		// Vector Operand Splitting: <128 x ty> -> 2 x <64 x ty>.
bool SplitVectorOperand(SDNode *N, unsigned OpNo);		bool SplitVectorOperand(SDNode *N, unsigned OpNo);
SDValue SplitVecOp_VSELECT(SDNode *N, unsigned OpNo);		SDValue SplitVecOp_VSELECT(SDNode *N, unsigned OpNo);
▲ Show 20 Lines • Show All 207 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

Show First 20 Lines • Show All 924 Lines • ▼ Show 20 Lines	case ISD::MLOAD:
SplitVecRes_MLOAD(cast<MaskedLoadSDNode>(N), Lo, Hi);		SplitVecRes_MLOAD(cast<MaskedLoadSDNode>(N), Lo, Hi);
break;		break;
case ISD::MGATHER:		case ISD::MGATHER:
SplitVecRes_MGATHER(cast<MaskedGatherSDNode>(N), Lo, Hi);		SplitVecRes_MGATHER(cast<MaskedGatherSDNode>(N), Lo, Hi);
break;		break;
case ISD::SETCC:		case ISD::SETCC:
SplitVecRes_SETCC(N, Lo, Hi);		SplitVecRes_SETCC(N, Lo, Hi);
break;		break;
		case ISD::VECTOR_REVERSE:
		SplitVecRes_VECTOR_REVERSE(N, Lo, Hi);
		break;
case ISD::VECTOR_SHUFFLE:		case ISD::VECTOR_SHUFFLE:
SplitVecRes_VECTOR_SHUFFLE(cast<ShuffleVectorSDNode>(N), Lo, Hi);		SplitVecRes_VECTOR_SHUFFLE(cast<ShuffleVectorSDNode>(N), Lo, Hi);
break;		break;
case ISD::VAARG:		case ISD::VAARG:
SplitVecRes_VAARG(N, Lo, Hi);		SplitVecRes_VAARG(N, Lo, Hi);
break;		break;

case ISD::ANY_EXTEND_VECTOR_INREG:		case ISD::ANY_EXTEND_VECTOR_INREG:
▲ Show 20 Lines • Show All 4,546 Lines • ▼ Show 20 Lines	Ops[Idx] = DAG.getNode(ISD::EXTRACT_VECTOR_ELT, dl, EltVT, InOp,
DAG.getVectorIdxConstant(Idx, dl));		DAG.getVectorIdxConstant(Idx, dl));

SDValue FillVal = FillWithZeroes ? DAG.getConstant(0, dl, EltVT) :		SDValue FillVal = FillWithZeroes ? DAG.getConstant(0, dl, EltVT) :
DAG.getUNDEF(EltVT);		DAG.getUNDEF(EltVT);
for ( ; Idx < WidenNumElts; ++Idx)		for ( ; Idx < WidenNumElts; ++Idx)
Ops[Idx] = FillVal;		Ops[Idx] = FillVal;
return DAG.getBuildVector(NVT, dl, Ops);		return DAG.getBuildVector(NVT, dl, Ops);
}		}

		void DAGTypeLegalizer::SplitVecRes_VECTOR_REVERSE(SDNode *N, SDValue &Lo,
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'SplitVecRes_VECTOR_REVERSE' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'SplitVecRes_VECTOR_REVERSE' [readability…
		SDValue &Hi) {
		SDValue InLo, InHi;
		GetSplitVector(N->getOperand(0), InLo, InHi);
		SDLoc DL(N);

		Lo = DAG.getNode(ISD::VECTOR_REVERSE, DL, InHi.getValueType(), InHi);
		Hi = DAG.getNode(ISD::VECTOR_REVERSE, DL, InLo.getValueType(), InLo);
		}

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h

Show First 20 Lines • Show All 767 Lines • ▼ Show 20 Lines	private:
void visitStackmap(const CallInst &I);		void visitStackmap(const CallInst &I);
void visitPatchpoint(const CallBase &CB, const BasicBlock *EHPadBB = nullptr);		void visitPatchpoint(const CallBase &CB, const BasicBlock *EHPadBB = nullptr);

// These two are implemented in StatepointLowering.cpp		// These two are implemented in StatepointLowering.cpp
void visitGCRelocate(const GCRelocateInst &Relocate);		void visitGCRelocate(const GCRelocateInst &Relocate);
void visitGCResult(const GCResultInst &I);		void visitGCResult(const GCResultInst &I);

void visitVectorReduce(const CallInst &I, unsigned Intrinsic);		void visitVectorReduce(const CallInst &I, unsigned Intrinsic);
		void visitVectorReverse(const CallInst &I);

void visitUserOp1(const Instruction &I) {		void visitUserOp1(const Instruction &I) {
llvm_unreachable("UserOp1 should not exist at instruction selection time!");		llvm_unreachable("UserOp1 should not exist at instruction selection time!");
}		}
void visitUserOp2(const Instruction &I) {		void visitUserOp2(const Instruction &I) {
llvm_unreachable("UserOp2 should not exist at instruction selection time!");		llvm_unreachable("UserOp2 should not exist at instruction selection time!");
}		}

▲ Show 20 Lines • Show All 121 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,992 Lines • ▼ Show 20 Lines	case Intrinsic::experimental_vector_extract: {

SDValue Vec = getValue(I.getOperand(0));		SDValue Vec = getValue(I.getOperand(0));
SDValue Index = getValue(I.getOperand(1));		SDValue Index = getValue(I.getOperand(1));
EVT ResultVT = TLI.getValueType(DAG.getDataLayout(), I.getType());		EVT ResultVT = TLI.getValueType(DAG.getDataLayout(), I.getType());

setValue(&I, DAG.getNode(ISD::EXTRACT_SUBVECTOR, DL, ResultVT, Vec, Index));		setValue(&I, DAG.getNode(ISD::EXTRACT_SUBVECTOR, DL, ResultVT, Vec, Index));
return;		return;
}		}
		case Intrinsic::experimental_vector_reverse:
		visitVectorReverse(I);
		return;
}		}
}		}

void SelectionDAGBuilder::visitConstrainedFPIntrinsic(		void SelectionDAGBuilder::visitConstrainedFPIntrinsic(
const ConstrainedFPIntrinsic &FPI) {		const ConstrainedFPIntrinsic &FPI) {
SDLoc sdl = getCurSDLoc();		SDLoc sdl = getCurSDLoc();

const TargetLowering &TLI = DAG.getTargetLoweringInfo();		const TargetLowering &TLI = DAG.getTargetLoweringInfo();
▲ Show 20 Lines • Show All 3,791 Lines • ▼ Show 20 Lines	if (NumClusters > 3 && TM.getOptLevel() != CodeGenOpt::None &&
splitWorkItem(WorkList, W, SI.getCondition(), SwitchMBB);		splitWorkItem(WorkList, W, SI.getCondition(), SwitchMBB);
continue;		continue;
}		}

lowerWorkItem(W, SI.getCondition(), SwitchMBB, DefaultMBB);		lowerWorkItem(W, SI.getCondition(), SwitchMBB, DefaultMBB);
}		}
}		}

		void SelectionDAGBuilder::visitVectorReverse(const CallInst &I) {
		const TargetLowering &TLI = DAG.getTargetLoweringInfo();
		EVT VT = TLI.getValueType(DAG.getDataLayout(), I.getType());

		SDLoc DL = getCurSDLoc();
		SDValue V = getValue(I.getOperand(0));
		assert(VT == V.getValueType() && "Malformed experimental.vector.reverse!");
		david-armUnsubmitted Done Reply Inline Actions I think we can drop the "experimental" here now? david-arm: I think we can drop the "experimental" here now?

		// VECTOR_SHUFFLE doesn't support a scalable mask so use a dedicated node.
		david-armUnsubmitted Not Done Reply Inline Actions Perhaps worth explaining that we're using VECTOR_SHUFFLE to implement the VECTOR_REVERSE operation for fixed length vectors? It confused me a bit at first trying to understand why we jump suddenly into VECTOR_SHUFFLE that's all. david-arm: Perhaps worth explaining that we're using VECTOR_SHUFFLE to implement the VECTOR_REVERSE…
		CarolineConcattoAuthorUnsubmitted Done Reply Inline Actions @david-arm I've updated the commit message and the comments as well. Is that good? CarolineConcatto: @david-arm I've updated the commit message and the comments as well. Is that good?
		if (VT.isScalableVector()) {
		setValue(&I, DAG.getNode(ISD::VECTOR_REVERSE, DL, VT, V));
		sdesmalenUnsubmitted Not Done Reply Inline Actions nit: remove comment. sdesmalen: nit: remove comment.
		return;
		}

		// Use VECTOR_SHUFFLE to maintain original behaviour for fixed-length vectors.
		david-armUnsubmitted Not Done Reply Inline Actions VECTOR_REVERSE? david-arm: VECTOR_REVERSE?
		david-armUnsubmitted Not Done Reply Inline Actions Sorry, ignore the comment above! david-arm: Sorry, ignore the comment above!
		SmallVector<int, 8> Mask;
		for (unsigned i = 0; i != VT.getVectorNumElements(); ++i)
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'i' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'i' [readability-identifier-naming]…
		craig.topperUnsubmitted Done Reply Inline Actions Put VT.getVectorNumElements() in a variable since its used here and in the loop body. craig.topper: Put VT.getVectorNumElements() in a variable since its used here and in the loop body.
		Mask.push_back(VT.getVectorNumElements() - 1 - i);
		sdesmalenUnsubmitted Not Done Reply Inline Actions nit: s/behaviours/behavior/ sdesmalen: nit: s/behaviours/behavior/

		setValue(&I, DAG.getVectorShuffle(VT, DL, V, DAG.getUNDEF(VT), Mask));
		}

void SelectionDAGBuilder::visitFreeze(const FreezeInst &I) {		void SelectionDAGBuilder::visitFreeze(const FreezeInst &I) {
SmallVector<EVT, 4> ValueVTs;		SmallVector<EVT, 4> ValueVTs;
ComputeValueVTs(DAG.getTargetLoweringInfo(), DAG.getDataLayout(), I.getType(),		ComputeValueVTs(DAG.getTargetLoweringInfo(), DAG.getDataLayout(), I.getType(),
ValueVTs);		ValueVTs);
unsigned NumValues = ValueVTs.size();		unsigned NumValues = ValueVTs.size();
if (NumValues == 0) return;		if (NumValues == 0) return;

SmallVector<SDValue, 4> Values(NumValues);		SmallVector<SDValue, 4> Values(NumValues);
Show All 9 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 283 Lines • ▼ Show 20 Lines	#endif
case ISD::INSERT_VECTOR_ELT: return "insert_vector_elt";		case ISD::INSERT_VECTOR_ELT: return "insert_vector_elt";
case ISD::EXTRACT_VECTOR_ELT: return "extract_vector_elt";		case ISD::EXTRACT_VECTOR_ELT: return "extract_vector_elt";
case ISD::CONCAT_VECTORS: return "concat_vectors";		case ISD::CONCAT_VECTORS: return "concat_vectors";
case ISD::INSERT_SUBVECTOR: return "insert_subvector";		case ISD::INSERT_SUBVECTOR: return "insert_subvector";
case ISD::EXTRACT_SUBVECTOR: return "extract_subvector";		case ISD::EXTRACT_SUBVECTOR: return "extract_subvector";
case ISD::SCALAR_TO_VECTOR: return "scalar_to_vector";		case ISD::SCALAR_TO_VECTOR: return "scalar_to_vector";
case ISD::VECTOR_SHUFFLE: return "vector_shuffle";		case ISD::VECTOR_SHUFFLE: return "vector_shuffle";
case ISD::SPLAT_VECTOR: return "splat_vector";		case ISD::SPLAT_VECTOR: return "splat_vector";
		case ISD::VECTOR_REVERSE:
		return "experimental_vector_reverse";
		david-armUnsubmitted Done Reply Inline Actions I think this can just be "vector_reverse" because it's a generic opcode and not experimental? david-arm: I think this can just be "vector_reverse" because it's a generic opcode and not experimental?
		paulwalker-armUnsubmitted Not Done Reply Inline Actions Stylistically this should be on the same line as the case to match all the other entries, even those with longer names. paulwalker-arm: Stylistically this should be on the same line as the case to match all the other entries, even…
case ISD::CARRY_FALSE: return "carry_false";		case ISD::CARRY_FALSE: return "carry_false";
case ISD::ADDC: return "addc";		case ISD::ADDC: return "addc";
case ISD::ADDE: return "adde";		case ISD::ADDE: return "adde";
case ISD::ADDCARRY: return "addcarry";		case ISD::ADDCARRY: return "addcarry";
case ISD::SADDO_CARRY: return "saddo_carry";		case ISD::SADDO_CARRY: return "saddo_carry";
case ISD::SADDO: return "saddo";		case ISD::SADDO: return "saddo";
case ISD::UADDO: return "uaddo";		case ISD::UADDO: return "uaddo";
case ISD::SSUBO: return "ssubo";		case ISD::SSUBO: return "ssubo";
▲ Show 20 Lines • Show All 741 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 843 Lines • ▼ Show 20 Lines	#include "llvm/IR/ConstrainedOps.def"
setOperationAction(ISD::VECREDUCE_SMAX, VT, Expand);		setOperationAction(ISD::VECREDUCE_SMAX, VT, Expand);
setOperationAction(ISD::VECREDUCE_SMIN, VT, Expand);		setOperationAction(ISD::VECREDUCE_SMIN, VT, Expand);
setOperationAction(ISD::VECREDUCE_UMAX, VT, Expand);		setOperationAction(ISD::VECREDUCE_UMAX, VT, Expand);
setOperationAction(ISD::VECREDUCE_UMIN, VT, Expand);		setOperationAction(ISD::VECREDUCE_UMIN, VT, Expand);
setOperationAction(ISD::VECREDUCE_FMAX, VT, Expand);		setOperationAction(ISD::VECREDUCE_FMAX, VT, Expand);
setOperationAction(ISD::VECREDUCE_FMIN, VT, Expand);		setOperationAction(ISD::VECREDUCE_FMIN, VT, Expand);
setOperationAction(ISD::VECREDUCE_SEQ_FADD, VT, Expand);		setOperationAction(ISD::VECREDUCE_SEQ_FADD, VT, Expand);
setOperationAction(ISD::VECREDUCE_SEQ_FMUL, VT, Expand);		setOperationAction(ISD::VECREDUCE_SEQ_FMUL, VT, Expand);

		// Named vector shuffles default to expand.
		setOperationAction(ISD::VECTOR_REVERSE, VT, Expand);
}		}

// Most targets ignore the @llvm.prefetch intrinsic.		// Most targets ignore the @llvm.prefetch intrinsic.
setOperationAction(ISD::PREFETCH, MVT::Other, Expand);		setOperationAction(ISD::PREFETCH, MVT::Other, Expand);

// Most targets also ignore the @llvm.readcyclecounter intrinsic.		// Most targets also ignore the @llvm.readcyclecounter intrinsic.
setOperationAction(ISD::READCYCLECOUNTER, MVT::i64, Expand);		setOperationAction(ISD::READCYCLECOUNTER, MVT::i64, Expand);

▲ Show 20 Lines • Show All 1,468 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64ISelLowering.h

Show First 20 Lines • Show All 286 Lines • ▼ Show 20 Lines	enum NodeType : unsigned {
SUNPKLO,		SUNPKLO,
UUNPKHI,		UUNPKHI,
UUNPKLO,		UUNPKLO,

CLASTA_N,		CLASTA_N,
CLASTB_N,		CLASTB_N,
LASTA,		LASTA,
LASTB,		LASTB,
REV,
TBL,		TBL,

// Floating-point reductions.		// Floating-point reductions.
FADDA_PRED,		FADDA_PRED,
FADDV_PRED,		FADDV_PRED,
FMAXV_PRED,		FMAXV_PRED,
FMAXNMV_PRED,		FMAXNMV_PRED,
FMINV_PRED,		FMINV_PRED,
▲ Show 20 Lines • Show All 592 Lines • ▼ Show 20 Lines	private:
SDValue LowerSPONENTRY(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerSPONENTRY(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerRETURNADDR(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerRETURNADDR(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerFLT_ROUNDS_(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerFLT_ROUNDS_(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerINSERT_VECTOR_ELT(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerINSERT_VECTOR_ELT(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerEXTRACT_VECTOR_ELT(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerEXTRACT_VECTOR_ELT(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerSCALAR_TO_VECTOR(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerSCALAR_TO_VECTOR(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerBUILD_VECTOR(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerBUILD_VECTOR(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerVECTOR_SHUFFLE(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerVECTOR_SHUFFLE(SDValue Op, SelectionDAG &DAG) const;
		SDValue LowerVECTOR_REVERSE(SDValue Op, SelectionDAG &DAG) const;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'LowerVECTOR_REVERSE' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'LowerVECTOR_REVERSE' [readability…
SDValue LowerSPLAT_VECTOR(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerSPLAT_VECTOR(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerDUPQLane(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerDUPQLane(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerToPredicatedOp(SDValue Op, SelectionDAG &DAG, unsigned NewOp,		SDValue LowerToPredicatedOp(SDValue Op, SelectionDAG &DAG, unsigned NewOp,
bool OverrideNEON = false) const;		bool OverrideNEON = false) const;
SDValue LowerToScalableOp(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerToScalableOp(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerEXTRACT_SUBVECTOR(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerEXTRACT_SUBVECTOR(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerINSERT_SUBVECTOR(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerINSERT_SUBVECTOR(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerDIV(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerDIV(SDValue Op, SelectionDAG &DAG) const;
▲ Show 20 Lines • Show All 135 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,094 Lines • ▼ Show 20 Lines	if (Subtarget->hasSVE()) {
// D68877 for more details.		// D68877 for more details.
for (auto VT : {MVT::nxv16i8, MVT::nxv8i16, MVT::nxv4i32, MVT::nxv2i64}) {		for (auto VT : {MVT::nxv16i8, MVT::nxv8i16, MVT::nxv4i32, MVT::nxv2i64}) {
setOperationAction(ISD::BITREVERSE, VT, Custom);		setOperationAction(ISD::BITREVERSE, VT, Custom);
setOperationAction(ISD::BSWAP, VT, Custom);		setOperationAction(ISD::BSWAP, VT, Custom);
setOperationAction(ISD::CTLZ, VT, Custom);		setOperationAction(ISD::CTLZ, VT, Custom);
setOperationAction(ISD::CTPOP, VT, Custom);		setOperationAction(ISD::CTPOP, VT, Custom);
setOperationAction(ISD::CTTZ, VT, Custom);		setOperationAction(ISD::CTTZ, VT, Custom);
setOperationAction(ISD::INSERT_SUBVECTOR, VT, Custom);		setOperationAction(ISD::INSERT_SUBVECTOR, VT, Custom);
		setOperationAction(ISD::VECTOR_REVERSE, VT, Custom);
setOperationAction(ISD::UINT_TO_FP, VT, Custom);		setOperationAction(ISD::UINT_TO_FP, VT, Custom);
setOperationAction(ISD::SINT_TO_FP, VT, Custom);		setOperationAction(ISD::SINT_TO_FP, VT, Custom);
setOperationAction(ISD::FP_TO_UINT, VT, Custom);		setOperationAction(ISD::FP_TO_UINT, VT, Custom);
setOperationAction(ISD::FP_TO_SINT, VT, Custom);		setOperationAction(ISD::FP_TO_SINT, VT, Custom);
setOperationAction(ISD::MGATHER, VT, Custom);		setOperationAction(ISD::MGATHER, VT, Custom);
setOperationAction(ISD::MSCATTER, VT, Custom);		setOperationAction(ISD::MSCATTER, VT, Custom);
setOperationAction(ISD::MUL, VT, Custom);		setOperationAction(ISD::MUL, VT, Custom);
setOperationAction(ISD::SPLAT_VECTOR, VT, Custom);		setOperationAction(ISD::SPLAT_VECTOR, VT, Custom);
Show All 23 Lines	for (auto VT : {MVT::nxv8i8, MVT::nxv4i16, MVT::nxv2i32}) {
setOperationAction(ISD::EXTRACT_SUBVECTOR, VT, Custom);		setOperationAction(ISD::EXTRACT_SUBVECTOR, VT, Custom);
setOperationAction(ISD::INSERT_SUBVECTOR, VT, Custom);		setOperationAction(ISD::INSERT_SUBVECTOR, VT, Custom);
}		}

for (auto VT : {MVT::nxv16i1, MVT::nxv8i1, MVT::nxv4i1, MVT::nxv2i1}) {		for (auto VT : {MVT::nxv16i1, MVT::nxv8i1, MVT::nxv4i1, MVT::nxv2i1}) {
setOperationAction(ISD::CONCAT_VECTORS, VT, Custom);		setOperationAction(ISD::CONCAT_VECTORS, VT, Custom);
setOperationAction(ISD::SELECT, VT, Custom);		setOperationAction(ISD::SELECT, VT, Custom);
setOperationAction(ISD::SETCC, VT, Custom);		setOperationAction(ISD::SETCC, VT, Custom);
		setOperationAction(ISD::VECTOR_REVERSE, VT, Custom);
setOperationAction(ISD::SPLAT_VECTOR, VT, Custom);		setOperationAction(ISD::SPLAT_VECTOR, VT, Custom);
setOperationAction(ISD::TRUNCATE, VT, Custom);		setOperationAction(ISD::TRUNCATE, VT, Custom);
setOperationAction(ISD::VECREDUCE_AND, VT, Custom);		setOperationAction(ISD::VECREDUCE_AND, VT, Custom);
setOperationAction(ISD::VECREDUCE_OR, VT, Custom);		setOperationAction(ISD::VECREDUCE_OR, VT, Custom);
setOperationAction(ISD::VECREDUCE_XOR, VT, Custom);		setOperationAction(ISD::VECREDUCE_XOR, VT, Custom);

// There are no legal MVT::nxv16f## based types.		// There are no legal MVT::nxv16f## based types.
if (VT != MVT::nxv16i1) {		if (VT != MVT::nxv16i1) {
Show All 28 Lines	for (auto VT : {MVT::nxv2f16, MVT::nxv4f16, MVT::nxv8f16, MVT::nxv2f32,
setOperationAction(ISD::FSQRT, VT, Custom);		setOperationAction(ISD::FSQRT, VT, Custom);
setOperationAction(ISD::FABS, VT, Custom);		setOperationAction(ISD::FABS, VT, Custom);
setOperationAction(ISD::FP_EXTEND, VT, Custom);		setOperationAction(ISD::FP_EXTEND, VT, Custom);
setOperationAction(ISD::FP_ROUND, VT, Custom);		setOperationAction(ISD::FP_ROUND, VT, Custom);
setOperationAction(ISD::VECREDUCE_FADD, VT, Custom);		setOperationAction(ISD::VECREDUCE_FADD, VT, Custom);
setOperationAction(ISD::VECREDUCE_FMAX, VT, Custom);		setOperationAction(ISD::VECREDUCE_FMAX, VT, Custom);
setOperationAction(ISD::VECREDUCE_FMIN, VT, Custom);		setOperationAction(ISD::VECREDUCE_FMIN, VT, Custom);
setOperationAction(ISD::VECREDUCE_SEQ_FADD, VT, Custom);		setOperationAction(ISD::VECREDUCE_SEQ_FADD, VT, Custom);
		setOperationAction(ISD::VECTOR_REVERSE, VT, Custom);
}		}

for (auto VT : {MVT::nxv2bf16, MVT::nxv4bf16, MVT::nxv8bf16}) {		for (auto VT : {MVT::nxv2bf16, MVT::nxv4bf16, MVT::nxv8bf16}) {
setOperationAction(ISD::CONCAT_VECTORS, VT, Custom);		setOperationAction(ISD::CONCAT_VECTORS, VT, Custom);
setOperationAction(ISD::MGATHER, VT, Custom);		setOperationAction(ISD::MGATHER, VT, Custom);
setOperationAction(ISD::MSCATTER, VT, Custom);		setOperationAction(ISD::MSCATTER, VT, Custom);
}		}

▲ Show 20 Lines • Show All 657 Lines • ▼ Show 20 Lines	case AArch64ISD::FIRST_NUMBER:
MAKE_CASE(AArch64ISD::UMINV_PRED)		MAKE_CASE(AArch64ISD::UMINV_PRED)
MAKE_CASE(AArch64ISD::ORV_PRED)		MAKE_CASE(AArch64ISD::ORV_PRED)
MAKE_CASE(AArch64ISD::EORV_PRED)		MAKE_CASE(AArch64ISD::EORV_PRED)
MAKE_CASE(AArch64ISD::ANDV_PRED)		MAKE_CASE(AArch64ISD::ANDV_PRED)
MAKE_CASE(AArch64ISD::CLASTA_N)		MAKE_CASE(AArch64ISD::CLASTA_N)
MAKE_CASE(AArch64ISD::CLASTB_N)		MAKE_CASE(AArch64ISD::CLASTB_N)
MAKE_CASE(AArch64ISD::LASTA)		MAKE_CASE(AArch64ISD::LASTA)
MAKE_CASE(AArch64ISD::LASTB)		MAKE_CASE(AArch64ISD::LASTB)
MAKE_CASE(AArch64ISD::REV)
MAKE_CASE(AArch64ISD::REINTERPRET_CAST)		MAKE_CASE(AArch64ISD::REINTERPRET_CAST)
MAKE_CASE(AArch64ISD::TBL)		MAKE_CASE(AArch64ISD::TBL)
MAKE_CASE(AArch64ISD::FADD_PRED)		MAKE_CASE(AArch64ISD::FADD_PRED)
MAKE_CASE(AArch64ISD::FADDA_PRED)		MAKE_CASE(AArch64ISD::FADDA_PRED)
MAKE_CASE(AArch64ISD::FADDV_PRED)		MAKE_CASE(AArch64ISD::FADDV_PRED)
MAKE_CASE(AArch64ISD::FDIV_PRED)		MAKE_CASE(AArch64ISD::FDIV_PRED)
MAKE_CASE(AArch64ISD::FMA_PRED)		MAKE_CASE(AArch64ISD::FMA_PRED)
MAKE_CASE(AArch64ISD::FMAXV_PRED)		MAKE_CASE(AArch64ISD::FMAXV_PRED)
▲ Show 20 Lines • Show All 1,723 Lines • ▼ Show 20 Lines	return DAG.getNode(AArch64ISD::CLASTB_N, dl, Op.getValueType(),
Op.getOperand(1), Op.getOperand(2), Op.getOperand(3));		Op.getOperand(1), Op.getOperand(2), Op.getOperand(3));
case Intrinsic::aarch64_sve_lasta:		case Intrinsic::aarch64_sve_lasta:
return DAG.getNode(AArch64ISD::LASTA, dl, Op.getValueType(),		return DAG.getNode(AArch64ISD::LASTA, dl, Op.getValueType(),
Op.getOperand(1), Op.getOperand(2));		Op.getOperand(1), Op.getOperand(2));
case Intrinsic::aarch64_sve_lastb:		case Intrinsic::aarch64_sve_lastb:
return DAG.getNode(AArch64ISD::LASTB, dl, Op.getValueType(),		return DAG.getNode(AArch64ISD::LASTB, dl, Op.getValueType(),
Op.getOperand(1), Op.getOperand(2));		Op.getOperand(1), Op.getOperand(2));
case Intrinsic::aarch64_sve_rev:		case Intrinsic::aarch64_sve_rev:
return DAG.getNode(AArch64ISD::REV, dl, Op.getValueType(),		return DAG.getNode(ISD::VECTOR_REVERSE, dl, Op.getValueType(),
Op.getOperand(1));		Op.getOperand(1));
case Intrinsic::aarch64_sve_tbl:		case Intrinsic::aarch64_sve_tbl:
return DAG.getNode(AArch64ISD::TBL, dl, Op.getValueType(),		return DAG.getNode(AArch64ISD::TBL, dl, Op.getValueType(),
Op.getOperand(1), Op.getOperand(2));		Op.getOperand(1), Op.getOperand(2));
case Intrinsic::aarch64_sve_trn1:		case Intrinsic::aarch64_sve_trn1:
return DAG.getNode(AArch64ISD::TRN1, dl, Op.getValueType(),		return DAG.getNode(AArch64ISD::TRN1, dl, Op.getValueType(),
Op.getOperand(1), Op.getOperand(2));		Op.getOperand(1), Op.getOperand(2));
case Intrinsic::aarch64_sve_trn2:		case Intrinsic::aarch64_sve_trn2:
▲ Show 20 Lines • Show All 784 Lines • ▼ Show 20 Lines	SDValue AArch64TargetLowering::LowerOperation(SDValue Op,
case ISD::VECREDUCE_FMIN:		case ISD::VECREDUCE_FMIN:
return LowerVECREDUCE(Op, DAG);		return LowerVECREDUCE(Op, DAG);
case ISD::ATOMIC_LOAD_SUB:		case ISD::ATOMIC_LOAD_SUB:
return LowerATOMIC_LOAD_SUB(Op, DAG);		return LowerATOMIC_LOAD_SUB(Op, DAG);
case ISD::ATOMIC_LOAD_AND:		case ISD::ATOMIC_LOAD_AND:
return LowerATOMIC_LOAD_AND(Op, DAG);		return LowerATOMIC_LOAD_AND(Op, DAG);
case ISD::DYNAMIC_STACKALLOC:		case ISD::DYNAMIC_STACKALLOC:
return LowerDYNAMIC_STACKALLOC(Op, DAG);		return LowerDYNAMIC_STACKALLOC(Op, DAG);
		case ISD::VECTOR_REVERSE:
		return LowerVECTOR_REVERSE(Op, DAG);
case ISD::VSCALE:		case ISD::VSCALE:
return LowerVSCALE(Op, DAG);		return LowerVSCALE(Op, DAG);
case ISD::ANY_EXTEND:		case ISD::ANY_EXTEND:
case ISD::SIGN_EXTEND:		case ISD::SIGN_EXTEND:
case ISD::ZERO_EXTEND:		case ISD::ZERO_EXTEND:
return LowerFixedLengthVectorIntExtendToSVE(Op, DAG);		return LowerFixedLengthVectorIntExtendToSVE(Op, DAG);
case ISD::SIGN_EXTEND_INREG: {		case ISD::SIGN_EXTEND_INREG: {
// Only custom lower when ExtraVT has a legal byte based element type.		// Only custom lower when ExtraVT has a legal byte based element type.
▲ Show 20 Lines • Show All 1,468 Lines • ▼ Show 20 Lines	SDValue AArch64TargetLowering::getAddrTiny(NodeTy *N, SelectionDAG &DAG,
unsigned Flags) const {		unsigned Flags) const {
LLVM_DEBUG(dbgs() << "AArch64TargetLowering::getAddrTiny\n");		LLVM_DEBUG(dbgs() << "AArch64TargetLowering::getAddrTiny\n");
SDLoc DL(N);		SDLoc DL(N);
EVT Ty = getPointerTy(DAG.getDataLayout());		EVT Ty = getPointerTy(DAG.getDataLayout());
SDValue Sym = getTargetNode(N, Ty, DAG, Flags);		SDValue Sym = getTargetNode(N, Ty, DAG, Flags);
return DAG.getNode(AArch64ISD::ADR, DL, Ty, Sym);		return DAG.getNode(AArch64ISD::ADR, DL, Ty, Sym);
}		}

		SDValue AArch64TargetLowering::LowerVECTOR_REVERSE(SDValue Op,
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'LowerVECTOR_REVERSE' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'LowerVECTOR_REVERSE' [readability…
		david-armUnsubmitted Done Reply Inline Actions Just for reference, if all the lowering operation does is create the ISD_VECTOR_REVERSE with the same arguments then you can actually just mark the operation as Legal and avoid creating the LowerVECTOR_REVERSE function as it should happen automatically. If we have plans to use SVE for fixed length vectors in future then it might be useful to keep this custom function anyway. david-arm: Just for reference, if all the lowering operation does is create the ISD_VECTOR_REVERSE with…
		CarolineConcattoAuthorUnsubmitted Done Reply Inline Actions Thank you @david-arm for pointing that. It is true that we can remove the custom lowering for aarch64 and have it as legal. This simplifies the code. CarolineConcatto: Thank you @david-arm for pointing that. It is true that we can remove the custom lowering for…
		SelectionDAG &DAG) const {
		return DAG.getNode(ISD::VECTOR_REVERSE, SDLoc(Op), Op.getValueType(),
		Op.getOperand(0));
		}

SDValue AArch64TargetLowering::LowerGlobalAddress(SDValue Op,		SDValue AArch64TargetLowering::LowerGlobalAddress(SDValue Op,
SelectionDAG &DAG) const {		SelectionDAG &DAG) const {
GlobalAddressSDNode *GN = cast<GlobalAddressSDNode>(Op);		GlobalAddressSDNode *GN = cast<GlobalAddressSDNode>(Op);
const GlobalValue *GV = GN->getGlobal();		const GlobalValue *GV = GN->getGlobal();
unsigned OpFlags = Subtarget->ClassifyGlobalReference(GV, getTargetMachine());		unsigned OpFlags = Subtarget->ClassifyGlobalReference(GV, getTargetMachine());

if (OpFlags != AArch64II::MO_NO_FLAG)		if (OpFlags != AArch64II::MO_NO_FLAG)
assert(cast<GlobalAddressSDNode>(Op)->getOffset() == 0 &&		assert(cast<GlobalAddressSDNode>(Op)->getOffset() == 0 &&
▲ Show 20 Lines • Show All 11,316 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td

	Show First 20 Lines • Show All 243 Lines • ▼ Show 20 Lines
	def AArch64fcvtzu_mt : SDNode<"AArch64ISD::FCVTZU_MERGE_PASSTHRU", SDT_AArch64FCVT>;			def AArch64fcvtzu_mt : SDNode<"AArch64ISD::FCVTZU_MERGE_PASSTHRU", SDT_AArch64FCVT>;
	def AArch64fcvtzs_mt : SDNode<"AArch64ISD::FCVTZS_MERGE_PASSTHRU", SDT_AArch64FCVT>;			def AArch64fcvtzs_mt : SDNode<"AArch64ISD::FCVTZS_MERGE_PASSTHRU", SDT_AArch64FCVT>;

	def SDT_AArch64ReduceWithInit : SDTypeProfile<1, 3, [SDTCisVec<1>, SDTCisVec<3>]>;			def SDT_AArch64ReduceWithInit : SDTypeProfile<1, 3, [SDTCisVec<1>, SDTCisVec<3>]>;
	def AArch64clasta_n : SDNode<"AArch64ISD::CLASTA_N", SDT_AArch64ReduceWithInit>;			def AArch64clasta_n : SDNode<"AArch64ISD::CLASTA_N", SDT_AArch64ReduceWithInit>;
	def AArch64clastb_n : SDNode<"AArch64ISD::CLASTB_N", SDT_AArch64ReduceWithInit>;			def AArch64clastb_n : SDNode<"AArch64ISD::CLASTB_N", SDT_AArch64ReduceWithInit>;
	def AArch64fadda_p : SDNode<"AArch64ISD::FADDA_PRED", SDT_AArch64ReduceWithInit>;			def AArch64fadda_p : SDNode<"AArch64ISD::FADDA_PRED", SDT_AArch64ReduceWithInit>;


				david-armUnsubmitted Done Reply Inline Actions nit: whitespace david-arm: nit: whitespace
	def SDT_AArch64Rev : SDTypeProfile<1, 1, [SDTCisVec<0>, SDTCisSameAs<0,1>]>;			def SDT_AArch64Rev : SDTypeProfile<1, 1, [SDTCisVec<0>, SDTCisSameAs<0,1>]>;
				craig.topperUnsubmitted Done Reply Inline Actions Why not put this in TargetSelectionDAG.td? craig.topper: Why not put this in TargetSelectionDAG.td?
				david-armUnsubmitted Done Reply Inline Actions Yeah, I think you need to add something to llvm/include/llvm/Target/TargetSelectionDAG.td along the lines of def vector_reverse : SDNode<"ISD::VECTOR_REVERSE", SDTypeProfile<1, 1, [SDTCisVec<0>, SDTCisSameAs<0,1>]>,[]>; because you've added a generic ISD opcode in this patch. In places where you've got patterns using "AArch64rev" you can just then use "vector_reverse" instead. david-arm: Yeah, I think you need to add something to llvm/include/llvm/Target/TargetSelectionDAG.td along…
	def AArch64rev : SDNode<"AArch64ISD::REV", SDT_AArch64Rev>;			def AArch64rev : SDNode<"ISD::VECTOR_REVERSE", SDT_AArch64Rev>;


	def SDT_AArch64PTest : SDTypeProfile<0, 2, [SDTCisVec<0>, SDTCisSameAs<0,1>]>;			def SDT_AArch64PTest : SDTypeProfile<0, 2, [SDTCisVec<0>, SDTCisSameAs<0,1>]>;
	def AArch64ptest : SDNode<"AArch64ISD::PTEST", SDT_AArch64PTest>;			def AArch64ptest : SDNode<"AArch64ISD::PTEST", SDT_AArch64PTest>;

	def SDT_AArch64DUP_PRED : SDTypeProfile<1, 3, [SDTCisVec<0>, SDTCisSameAs<0, 3>, SDTCisVec<1>, SDTCVecEltisVT<1,i1>]>;			def SDT_AArch64DUP_PRED : SDTypeProfile<1, 3, [SDTCisVec<0>, SDTCisSameAs<0, 3>, SDTCisVec<1>, SDTCVecEltisVT<1,i1>]>;
	def AArch64dup_mt : SDNode<"AArch64ISD::DUP_MERGE_PASSTHRU", SDT_AArch64DUP_PRED>;			def AArch64dup_mt : SDNode<"AArch64ISD::DUP_MERGE_PASSTHRU", SDT_AArch64DUP_PRED>;

	def SDT_IndexVector : SDTypeProfile<1, 2, [SDTCisVec<0>, SDTCisSameAs<1, 2>, SDTCisInt<2>]>;			def SDT_IndexVector : SDTypeProfile<1, 2, [SDTCisVec<0>, SDTCisSameAs<1, 2>, SDTCisInt<2>]>;
	▲ Show 20 Lines • Show All 2,496 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/named-vector-shuffles-neon.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc -verify-machineinstrs < %s \| FileCheck %s

				target triple = "aarch64-unknown-linux-gnu"

				;
				; VECTOR_REVERSE
				;

				define <16 x i8> @reverse_v16i8(<16 x i8> %a) #0 {
				; CHECK-LABEL: .LCPI0_0:
				; CHECK: .byte 15 // 0xf
				; CHECK-NEXT: .byte 14 // 0xe
				; CHECK-NEXT: .byte 13 // 0xd
				; CHECK-NEXT: .byte 12 // 0xc
				; CHECK-NEXT: .byte 11 // 0xb
				; CHECK-NEXT: .byte 10 // 0xa
				; CHECK-NEXT: .byte 9 // 0x9
				; CHECK-NEXT: .byte 8 // 0x8
				; CHECK-NEXT: .byte 7 // 0x7
				; CHECK-NEXT: .byte 6 // 0x6
				; CHECK-NEXT: .byte 5 // 0x5
				; CHECK-NEXT: .byte 4 // 0x4
				; CHECK-NEXT: .byte 3 // 0x3
				; CHECK-NEXT: .byte 2 // 0x2
				; CHECK-NEXT: .byte 1 // 0x1
				; CHECK-NEXT: .byte 0 // 0x0
				; CHECK-LABEL: reverse_v16i8:
				; CHECK: // %bb.0:
				; CHECK-NEXT: adrp x8, .LCPI0_0
				; CHECK-NEXT: ldr q1, [x8, :lo12:.LCPI0_0]
				; CHECK-NEXT: tbl v0.16b, { v0.16b }, v1.16b
				; CHECK-NEXT: ret
				%res = call <16 x i8> @llvm.experimental.vector.reverse.v16i8(<16 x i8> %a)
				ret <16 x i8> %res
				}

				define <8 x i16> @reverse_v8i16(<8 x i16> %a) #0 {
				; CHECK-LABEL: .LCPI1_0:
				; CHECK: .byte 14 // 0xe
				; CHECK-NEXT: .byte 15 // 0xf
				; CHECK-NEXT: .byte 12 // 0xc
				; CHECK-NEXT: .byte 13 // 0xd
				; CHECK-NEXT: .byte 10 // 0xa
				; CHECK-NEXT: .byte 11 // 0xb
				; CHECK-NEXT: .byte 8 // 0x8
				; CHECK-NEXT: .byte 9 // 0x9
				; CHECK-NEXT: .byte 6 // 0x6
				; CHECK-NEXT: .byte 7 // 0x7
				; CHECK-NEXT: .byte 4 // 0x4
				; CHECK-NEXT: .byte 5 // 0x5
				; CHECK-NEXT: .byte 2 // 0x2
				; CHECK-NEXT: .byte 3 // 0x3
				; CHECK-NEXT: .byte 0 // 0x0
				; CHECK-NEXT: .byte 1 // 0x1
				; CHECK-LABEL: reverse_v8i16:
				; CHECK: // %bb.0:
				; CHECK-NEXT: adrp x8, .LCPI1_0
				; CHECK-NEXT: ldr q1, [x8, :lo12:.LCPI1_0]
				; CHECK-NEXT: tbl v0.16b, { v0.16b }, v1.16b
				; CHECK-NEXT: ret
				%res = call <8 x i16> @llvm.experimental.vector.reverse.v8i16(<8 x i16> %a)
				ret <8 x i16> %res
				}

				define <4 x i32> @reverse_v4i32(<4 x i32> %a) #0 {
				; CHECK-LABEL: reverse_v4i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev64 v0.4s, v0.4s
				; CHECK-NEXT: ext v0.16b, v0.16b, v0.16b, #8
				; CHECK-NEXT: ret
				%res = call <4 x i32> @llvm.experimental.vector.reverse.v4i32(<4 x i32> %a)
				ret <4 x i32> %res
				}

				define <2 x i64> @reverse_v2i64(<2 x i64> %a) #0 {
				; CHECK-LABEL: reverse_v2i64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: ext v0.16b, v0.16b, v0.16b, #8
				; CHECK-NEXT: ret
				%res = call <2 x i64> @llvm.experimental.vector.reverse.v2i64(<2 x i64> %a)
				ret <2 x i64> %res
				}

				define <8 x half> @reverse_v8f16(<8 x half> %a) #0 {
				; CHECK-LABEL: .LCPI4_0:
				; CHECK: .byte 14 // 0xe
				; CHECK-NEXT: .byte 15 // 0xf
				; CHECK-NEXT: .byte 12 // 0xc
				; CHECK-NEXT: .byte 13 // 0xd
				; CHECK-NEXT: .byte 10 // 0xa
				; CHECK-NEXT: .byte 11 // 0xb
				; CHECK-NEXT: .byte 8 // 0x8
				; CHECK-NEXT: .byte 9 // 0x9
				; CHECK-NEXT: .byte 6 // 0x6
				; CHECK-NEXT: .byte 7 // 0x7
				; CHECK-NEXT: .byte 4 // 0x4
				; CHECK-NEXT: .byte 5 // 0x5
				; CHECK-NEXT: .byte 2 // 0x2
				; CHECK-NEXT: .byte 3 // 0x3
				; CHECK-NEXT: .byte 0 // 0x0
				; CHECK-NEXT: .byte 1 // 0x1
				; CHECK-LABEL: reverse_v8f16:
				; CHECK: // %bb.0:
				; CHECK-NEXT: adrp x8, .LCPI4_0
				; CHECK-NEXT: ldr q1, [x8, :lo12:.LCPI4_0]
				; CHECK-NEXT: tbl v0.16b, { v0.16b }, v1.16b
				; CHECK-NEXT: ret
				%res = call <8 x half> @llvm.experimental.vector.reverse.v8f16(<8 x half> %a)
				ret <8 x half> %res
				}

				define <4 x float> @reverse_v4f32(<4 x float> %a) #0 {
				; CHECK-LABEL: reverse_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev64 v0.4s, v0.4s
				; CHECK-NEXT: ext v0.16b, v0.16b, v0.16b, #8
				; CHECK-NEXT: ret
				%res = call <4 x float> @llvm.experimental.vector.reverse.v4f32(<4 x float> %a)
				ret <4 x float> %res
				}

				define <2 x double> @reverse_v2f64(<2 x double> %a) #0 {
				; CHECK-LABEL: reverse_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: ext v0.16b, v0.16b, v0.16b, #8
				; CHECK-NEXT: ret
				%res = call <2 x double> @llvm.experimental.vector.reverse.v2f64(<2 x double> %a)
				ret <2 x double> %res
				}

				; Verify promote type legalisation works as expected.
				define <2 x i8> @reverse_v2i8(<2 x i8> %a) #0 {
				; CHECK-LABEL: reverse_v2i8:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev64 v0.2s, v0.2s
				; CHECK-NEXT: ret
				%res = call <2 x i8> @llvm.experimental.vector.reverse.v2i8(<2 x i8> %a)
				ret <2 x i8> %res
				}

				; Verify splitvec type legalisation works as expected.
				define <8 x i32> @reverse_v8i32(<8 x i32> %a) #0 {
				; CHECK-LABEL: reverse_v8i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev64 v1.4s, v1.4s
				; CHECK-NEXT: rev64 v2.4s, v0.4s
				; CHECK-NEXT: ext v0.16b, v1.16b, v1.16b, #8
				; CHECK-NEXT: ext v1.16b, v2.16b, v2.16b, #8
				; CHECK-NEXT: ret
				%res = call <8 x i32> @llvm.experimental.vector.reverse.v8i32(<8 x i32> %a)
				ret <8 x i32> %res
				}

				; Verify splitvec type legalisation works as expected.
				define <16 x float> @reverse_v16f32(<16 x float> %a) #0 {
				; CHECK-LABEL: reverse_v16f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev64 v3.4s, v3.4s
				; CHECK-NEXT: rev64 v2.4s, v2.4s
				; CHECK-NEXT: rev64 v4.4s, v1.4s
				; CHECK-NEXT: rev64 v5.4s, v0.4s
				; CHECK-NEXT: ext v0.16b, v3.16b, v3.16b, #8
				; CHECK-NEXT: ext v1.16b, v2.16b, v2.16b, #8
				; CHECK-NEXT: ext v2.16b, v4.16b, v4.16b, #8
				; CHECK-NEXT: ext v3.16b, v5.16b, v5.16b, #8
				; CHECK-NEXT: ret
				%res = call <16 x float> @llvm.experimental.vector.reverse.v16f32(<16 x float> %a)
				ret <16 x float> %res
				}


				declare <2 x i8> @llvm.experimental.vector.reverse.v2i8(<2 x i8>)
				declare <16 x i8> @llvm.experimental.vector.reverse.v16i8(<16 x i8>)
				declare <8 x i16> @llvm.experimental.vector.reverse.v8i16(<8 x i16>)
				declare <4 x i32> @llvm.experimental.vector.reverse.v4i32(<4 x i32>)
				declare <8 x i32> @llvm.experimental.vector.reverse.v8i32(<8 x i32>)
				declare <2 x i64> @llvm.experimental.vector.reverse.v2i64(<2 x i64>)
				declare <8 x half> @llvm.experimental.vector.reverse.v8f16(<8 x half>)
				declare <4 x float> @llvm.experimental.vector.reverse.v4f32(<4 x float>)
				declare <16 x float> @llvm.experimental.vector.reverse.v16f32(<16 x float>)
				declare <2 x double> @llvm.experimental.vector.reverse.v2f64(<2 x double>)

				attributes #0 = { nounwind "target-features"="+neon" }

llvm/test/CodeGen/AArch64/named-vector-shuffles-sve.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc -verify-machineinstrs < %s \| FileCheck %s
				sdesmalenUnsubmitted Not Done Reply Inline Actions nit: can you rename this file (and the one for neon and x86) to: `named-vector-shuffle-reverse-sve.ll`, `named-vector-shuffle-reverse-neon.ll` and `named-vector-shuffle-reverse.ll` respectively? sdesmalen: nit: can you rename this file (and the one for neon and x86) to: `named-vector-shuffle-reverse…

				target triple = "aarch64-unknown-linux-gnu"

				;
				; VECTOR_REVERSE - PPR
				;

				define <vscale x 2 x i1> @reverse_nxv2i1(<vscale x 2 x i1> %a) #0 {
				; CHECK-LABEL: reverse_nxv2i1:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev p0.d, p0.d
				; CHECK-NEXT: ret
				%res = call <vscale x 2 x i1> @llvm.experimental.vector.reverse.nxv2i1(<vscale x 2 x i1> %a)
				ret <vscale x 2 x i1> %res
				}

				define <vscale x 4 x i1> @reverse_nxv4i1(<vscale x 4 x i1> %a) #0 {
				; CHECK-LABEL: reverse_nxv4i1:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev p0.s, p0.s
				; CHECK-NEXT: ret
				%res = call <vscale x 4 x i1> @llvm.experimental.vector.reverse.nxv4i1(<vscale x 4 x i1> %a)
				ret <vscale x 4 x i1> %res
				}

				define <vscale x 8 x i1> @reverse_nxv8i1(<vscale x 8 x i1> %a) #0 {
				; CHECK-LABEL: reverse_nxv8i1:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev p0.h, p0.h
				; CHECK-NEXT: ret
				%res = call <vscale x 8 x i1> @llvm.experimental.vector.reverse.nxv8i1(<vscale x 8 x i1> %a)
				ret <vscale x 8 x i1> %res
				}

				define <vscale x 16 x i1> @reverse_nxv16i1(<vscale x 16 x i1> %a) #0 {
				; CHECK-LABEL: reverse_nxv16i1:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev p0.b, p0.b
				; CHECK-NEXT: ret
				%res = call <vscale x 16 x i1> @llvm.experimental.vector.reverse.nxv16i1(<vscale x 16 x i1> %a)
				ret <vscale x 16 x i1> %res
				}

				; Verify splitvec type legalisation works as expected.
				define <vscale x 32 x i1> @reverse_nxv32i1(<vscale x 32 x i1> %a) #0 {
				; CHECK-LABEL: reverse_nxv32i1:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev p2.b, p1.b
				; CHECK-NEXT: rev p1.b, p0.b
				; CHECK-NEXT: mov p0.b, p2.b
				; CHECK-NEXT: ret
				%res = call <vscale x 32 x i1> @llvm.experimental.vector.reverse.nxv32i1(<vscale x 32 x i1> %a)
				ret <vscale x 32 x i1> %res
				}

				;
				; VECTOR_REVERSE - ZPR
				;

				define <vscale x 16 x i8> @reverse_nxv16i8(<vscale x 16 x i8> %a) #0 {
				; CHECK-LABEL: reverse_nxv16i8:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z0.b, z0.b
				; CHECK-NEXT: ret
				%res = call <vscale x 16 x i8> @llvm.experimental.vector.reverse.nxv16i8(<vscale x 16 x i8> %a)
				ret <vscale x 16 x i8> %res
				}

				define <vscale x 8 x i16> @reverse_nxv8i16(<vscale x 8 x i16> %a) #0 {
				; CHECK-LABEL: reverse_nxv8i16:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z0.h, z0.h
				; CHECK-NEXT: ret
				%res = call <vscale x 8 x i16> @llvm.experimental.vector.reverse.nxv8i16(<vscale x 8 x i16> %a)
				ret <vscale x 8 x i16> %res
				}

				define <vscale x 4 x i32> @reverse_nxv4i32(<vscale x 4 x i32> %a) #0 {
				; CHECK-LABEL: reverse_nxv4i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z0.s, z0.s
				; CHECK-NEXT: ret
				%res = call <vscale x 4 x i32> @llvm.experimental.vector.reverse.nxv4i32(<vscale x 4 x i32> %a)
				ret <vscale x 4 x i32> %res
				}

				define <vscale x 2 x i64> @reverse_nxv2i64(<vscale x 2 x i64> %a) #0 {
				; CHECK-LABEL: reverse_nxv2i64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z0.d, z0.d
				; CHECK-NEXT: ret
				%res = call <vscale x 2 x i64> @llvm.experimental.vector.reverse.nxv2i64(<vscale x 2 x i64> %a)
				ret <vscale x 2 x i64> %res
				}

				define <vscale x 8 x half> @reverse_nxv8f16(<vscale x 8 x half> %a) #0 {
				; CHECK-LABEL: reverse_nxv8f16:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z0.h, z0.h
				; CHECK-NEXT: ret
				%res = call <vscale x 8 x half> @llvm.experimental.vector.reverse.nxv8f16(<vscale x 8 x half> %a)
				ret <vscale x 8 x half> %res
				}

				define <vscale x 4 x float> @reverse_nxv4f32(<vscale x 4 x float> %a) #0 {
				; CHECK-LABEL: reverse_nxv4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z0.s, z0.s
				; CHECK-NEXT: ret
				%res = call <vscale x 4 x float> @llvm.experimental.vector.reverse.nxv4f32(<vscale x 4 x float> %a)
				ret <vscale x 4 x float> %res
				}

				define <vscale x 2 x double> @reverse_nxv2f64(<vscale x 2 x double> %a) #0 {
				; CHECK-LABEL: reverse_nxv2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z0.d, z0.d
				; CHECK-NEXT: ret
				%res = call <vscale x 2 x double> @llvm.experimental.vector.reverse.nxv2f64(<vscale x 2 x double> %a)
				ret <vscale x 2 x double> %res
				}

				; Verify promote type legalisation works as expected.
				define <vscale x 2 x i8> @reverse_nxv2i8(<vscale x 2 x i8> %a) #0 {
				; CHECK-LABEL: reverse_nxv2i8:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z0.d, z0.d
				; CHECK-NEXT: ret
				%res = call <vscale x 2 x i8> @llvm.experimental.vector.reverse.nxv2i8(<vscale x 2 x i8> %a)
				ret <vscale x 2 x i8> %res
				}

				; Verify splitvec type legalisation works as expected.
				define <vscale x 8 x i32> @reverse_nxv8i32(<vscale x 8 x i32> %a) #0 {
				; CHECK-LABEL: reverse_nxv8i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z2.s, z1.s
				; CHECK-NEXT: rev z1.s, z0.s
				; CHECK-NEXT: mov z0.d, z2.d
				; CHECK-NEXT: ret
				%res = call <vscale x 8 x i32> @llvm.experimental.vector.reverse.nxv8i32(<vscale x 8 x i32> %a)
				ret <vscale x 8 x i32> %res
				}

				; Verify splitvec type legalisation works as expected.
				define <vscale x 16 x float> @reverse_nxv16f32(<vscale x 16 x float> %a) #0 {
				; CHECK-LABEL: reverse_nxv16f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: rev z5.s, z3.s
				; CHECK-NEXT: rev z4.s, z2.s
				; CHECK-NEXT: rev z2.s, z1.s
				; CHECK-NEXT: rev z3.s, z0.s
				; CHECK-NEXT: mov z0.d, z5.d
				; CHECK-NEXT: mov z1.d, z4.d
				; CHECK-NEXT: ret
				%res = call <vscale x 16 x float> @llvm.experimental.vector.reverse.nxv16f32(<vscale x 16 x float> %a)
				ret <vscale x 16 x float> %res
				}


				declare <vscale x 2 x i1> @llvm.experimental.vector.reverse.nxv2i1(<vscale x 2 x i1>)
				declare <vscale x 4 x i1> @llvm.experimental.vector.reverse.nxv4i1(<vscale x 4 x i1>)
				declare <vscale x 8 x i1> @llvm.experimental.vector.reverse.nxv8i1(<vscale x 8 x i1>)
				declare <vscale x 16 x i1> @llvm.experimental.vector.reverse.nxv16i1(<vscale x 16 x i1>)
				declare <vscale x 32 x i1> @llvm.experimental.vector.reverse.nxv32i1(<vscale x 32 x i1>)
				declare <vscale x 2 x i8> @llvm.experimental.vector.reverse.nxv2i8(<vscale x 2 x i8>)
				declare <vscale x 16 x i8> @llvm.experimental.vector.reverse.nxv16i8(<vscale x 16 x i8>)
				declare <vscale x 8 x i16> @llvm.experimental.vector.reverse.nxv8i16(<vscale x 8 x i16>)
				declare <vscale x 4 x i32> @llvm.experimental.vector.reverse.nxv4i32(<vscale x 4 x i32>)
				declare <vscale x 8 x i32> @llvm.experimental.vector.reverse.nxv8i32(<vscale x 8 x i32>)
				declare <vscale x 2 x i64> @llvm.experimental.vector.reverse.nxv2i64(<vscale x 2 x i64>)
				declare <vscale x 8 x half> @llvm.experimental.vector.reverse.nxv8f16(<vscale x 8 x half>)
				declare <vscale x 4 x float> @llvm.experimental.vector.reverse.nxv4f32(<vscale x 4 x float>)
				declare <vscale x 16 x float> @llvm.experimental.vector.reverse.nxv16f32(<vscale x 16 x float>)
				declare <vscale x 2 x double> @llvm.experimental.vector.reverse.nxv2f64(<vscale x 2 x double>)


				attributes #0 = { nounwind "target-features"="+sve" }

llvm/test/Transforms/InstSimplify/named-vector-shuffles.ll

This file was added.

				; RUN: opt < %s -instsimplify -S \| FileCheck %s

				; Test back to back reverse shuffles are eliminated.
				define <vscale x 4 x i32> @shuffle_b2b_reverse(<vscale x 4 x i32> %a) {
				; CHECK-LABEL: @shuffle_b2b_reverse(
				; CHECK: ret <vscale x 4 x i32> %a
				%rev = tail call <vscale x 4 x i32> @llvm.experimental.vector.reverse.nxv4i32(<vscale x 4 x i32> %a)
				%rev.rev = tail call <vscale x 4 x i32> @llvm.experimental.vector.reverse.nxv4i32(<vscale x 4 x i32> %rev)
				ret <vscale x 4 x i32> %rev.rev
				}

				declare <vscale x 4 x i32> @llvm.experimental.vector.reverse.nxv4i32(<vscale x 4 x i32>)

This is an archive of the discontinued LLVM Phabricator instance.

[CodeGen][SelectionDAG]Add new intrinsic experimental.vector.reverseClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 317237

llvm/docs/LangRef.rst

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/IR/Intrinsics.td

llvm/lib/Analysis/InstructionSimplify.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/CodeGen/TargetLoweringBase.cpp

llvm/lib/Target/AArch64/AArch64ISelLowering.h

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td

llvm/test/CodeGen/AArch64/named-vector-shuffles-neon.ll

llvm/test/CodeGen/AArch64/named-vector-shuffles-sve.ll

llvm/test/Transforms/InstSimplify/named-vector-shuffles.ll

[CodeGen][SelectionDAG]Add new intrinsic experimental.vector.reverse
ClosedPublic