This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Basic/
-
clang/
-
Basic/
1/1
BuiltinsPPC.def
-
lib/
-
CodeGen/
14/14
CGBuiltin.cpp
-
Headers/
-
altivec.h
-
test/CodeGen/
-
CodeGen/
1/1
builtins-ppc-p10vector.c

Differential D83500

[PowerPC][Power10] Implement custom codegen for the vec_replace_elt and vec_replace_unaligned builtins.
ClosedPublic

Authored by amyk on Jul 9 2020, 11:55 AM.

Download Raw Diff

Details

Reviewers

power-llvm-team
nemanjai
lei
kamaub

Group Reviewers

Restricted Project

Commits

rG6b136b19cbe4: [Power10] Implement custom codegen for the vec_replace_elt and…

Summary

This patch implements custom codegen for the vec_replace_elt and vec_replace_unaligned builtins.

These builtins map to the @llvm.ppc.altivec.vinsw and @llvm.ppc.altivec.vinsd intrinsics depending on the arguments.
The main motivation for doing custom codegen for these intrinsics is because there are float and double versions of the
builtin. Normally, the converting the float to an integer would be done via fptoui in the IR. This is incorrect as fptoui
truncates the value and we must ensure the value is not truncated. Therefore, we provide custom codegen to utilize
bitcast instead as bitcasts do not truncate.

The original patch that implemented the front end done this adding unions to altivec.h (https://reviews.llvm.org/D82359) but
this patch uses custom codegen to use bitcast instead for the float conversion instead.

Depends on D83497.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	880 ms	linux > Clang.CodeGen::Unknown Unit Message ("")
	450 ms	linux > MemorySanitizer-X86_64.MemorySanitizer-X86_64::Unknown Unit Message ("")
	510 ms	linux > MemorySanitizer-X86_64.MemorySanitizer-X86_64::Unknown Unit Message ("")
	270 ms	linux > MemorySanitizer-lld-X86_64.MemorySanitizer-lld-X86_64::Unknown Unit Message ("")
	250 ms	linux > MemorySanitizer-lld-X86_64.MemorySanitizer-lld-X86_64::Unknown Unit Message ("")
		View Full Test Results (17 Failed)

Event Timeline

amyk created this revision.Jul 9 2020, 11:55 AM

Herald added a subscriber: shchenz. · View Herald TranscriptJul 9 2020, 11:55 AM

amyk marked 2 inline comments as done.Jul 9 2020, 12:03 PM

amyk added inline comments.

clang/include/clang/Basic/BuiltinsPPC.def
339	I originally intended to implement this like the `xxpermdi` builtin: BUILTIN(__builtin_vsx_xxpermdi, "v.", "t") to use `v.` but I am not able to declare these builtins as void. For now, they're more or less an arbitrary signature that would match `vinsw`.
clang/test/CodeGen/builtins-ppc-p10vector.c
606	I've utilized tests that were from Biplob's original patch (https://reviews.llvm.org/D82359), but added the `bitcasts` to the float/double cases.

Updated for clang format changes.

Harbormaster failed remote builds in B63621: Diff 276794!Jul 9 2020, 12:20 PM

lei added inline comments.Jul 9 2020, 12:40 PM

clang/lib/CodeGen/CGBuiltin.cpp
14273	Do you mean? // The third argument of vec_replace_elt must be a compile time constant and will be emitted either // to the vinsw or vinsd instruction.
14289	ConstArg *= 4; // Fix the constant according to endianess. if (getTarget().isLittleEndian()) ConstArg = 12 - ConstArg;
14320	What are the chances of reaching to the end of this if/else-if section and `Call` is null? ie `getPrimitiveSizeInBits() != [32\|64]` I feel like it would be better if we can structure it so that we are not doing all these nesting of `if`s and just do returns within the diff if-conditions. Have you tried to pull out the diff handling of 32/64bit arg and consolidating the code a bit?

amyk marked 3 inline comments as done.Jul 9 2020, 2:41 PM

amyk added inline comments.

clang/lib/CodeGen/CGBuiltin.cpp
14273	Yes. Thank you - I will update the wording here and in the other builtin.
14320	Thanks - I realize that I should probably pull the `Call` out. I'll update this. I've actually consolidated the code quite a bit already, but I'll see if I can make any further improvements on this.

Address review comments

update comments
pull out common code

amyk marked 2 inline comments as done.Jul 9 2020, 2:42 PM

Harbormaster failed remote builds in B63649: Diff 276844!Jul 9 2020, 3:18 PM

Fix assignment of variable.

Harbormaster failed remote builds in B63655: Diff 276853!Jul 9 2020, 4:03 PM

Corrected the patch as it previously caused errors to the clang test case.

Harbormaster failed remote builds in B64259: Diff 278019!Jul 14 2020, 5:18 PM

LGTM

This revision is now accepted and ready to land.Jul 16 2020, 9:24 AM

The description includes ... however it is more preferable to use bitcast. It is not a question of preference but of correctness. The fp to int conversions truncate while bitcasts don't. The semantics of the builtins require that no truncation happen.

Also, please include checks in SemaChecking for:

Third argument being constant
Third argument being within range
Second argument having the same type as the element type of the first

clang/lib/CodeGen/CGBuiltin.cpp
14275	Where is the code that ensures this? There does not appear to be a Sema check to emit a meaningful message for this. We also need a test with a non-constant argument to show the message.
14278	I don't think we should be creating the declaration if we may not use it. Just initialize this to `nullptr` here and set it for each case.
14307	Please change this to a negative condition (i.e. if the type is not `i64`). Similarly in other similar conditions.
14319	Can we reorganize this as something like: case PPC::BI__builtin_altivec_vec_replace_elt: case PPC::BI__builtin_altivec_vec_replace_unaligned: { // Define variables that are needed unsigned ArgWidth = Ops[1]->getType()->getPrimitiveSizeInBits(); if (BuiltinID == PPC::BI__builtin_altivec_vec_replace_elt) ConstArg *= ArgWidth / 8; assert((ArgWidth == 32 \|\| ArgWidth == 64) && "Invalid argument width"); if (ArgWidth == 32) { // set up what is needed for vinsw } else { // set up what is needed for vinsd } // Emit the call if (BuiltinID == PPC::BI__builtin_altivec_vec_replace_elt) // add the bitcast of the result }

This revision now requires changes to proceed.Jul 16 2020, 11:01 AM

amyk edited the summary of this revision. (Show Details)Jul 17 2020, 3:15 PM

Address review comments:

Further consolidate the custom codegen of the two builtins
Add SemaChecking for if the third argument is a constant, if the third argument is in range and if the second argument is the same type as the element type of the first argument
Add extra test to test the semantic checks that were added

Harbormaster completed remote builds in B69201: Diff 287142.Aug 21 2020, 7:16 PM

Update to address clang-format.

Harbormaster completed remote builds in B69348: Diff 287444.Aug 24 2020, 11:17 AM

@nemanjai Could you please take another look to see if I have addressed your comments?

nemanjai added inline comments.Sep 17 2020, 7:12 PM

clang/lib/CodeGen/CGBuiltin.cpp
14285	`// The input to vec_replace_elt is an element index, not a byte index.`
14295	This is too vague. `// If the input vector is a float type, bitcast the inputs to integers.`
14296	This seems to be duplicated in both blocks. Can we not just do something like `if (!Ops[1]->getType()->isIntegerTy(ArgWidth))`? Then inside we can use the ternary operator to select between `Int32Ty` and `Int64Ty` if necessary. Then we only need one of these bitcast blocks just before we emit the call.
14309	More specific comment please - just as above.
14318	s/resultant/result
clang/lib/Sema/SemaChecking.cpp
3196 ↗	(On Diff #287444)	I am very surprised that this doesn't exist already but it seems more useful to have a `static` function in this file along the lines of: `static bool isEltOfVectorTy(QualType VectorTy, QualType EltTy)` That would do the obvious check.
3245 ↗	(On Diff #287444)	I don't think the `if` statements add to readability. I think this should just be a single return statement and the range should be selected by a ternary op. Something like: unsigned Width = Context.getIntWidth(TheCall->getArg(1)->getType()); QualType VecTy = TheCall->getArg(0)->getType(); QualType EltTy = TheCall->getArg(1)->getType(); return SemaBuiltinConstantArgRange(TheCall, 2, 0, Width == 32 ? 12 : 8) \|\| isEltOfVectorTy(VecTy, EltTy);

Just marking this not ready to keep my queue clean until the comments are addressed.

This revision now requires changes to proceed.Sep 17 2020, 7:32 PM

Address Nemanja's review comments:

More specific comments when bitcasting the inputs
Pull out conditions to bitcast the input, use ternary op depending if the input is 32 or 64-bits
Create new static function to check if a given type is the same type as a vector element

amyk marked 10 inline comments as done.Sep 21 2020, 7:11 PM

Harbormaster completed remote builds in B72458: Diff 293313.Sep 21 2020, 7:44 PM

@nemanjai Would you please take another look to see if I have addressed your comments when you get a chance? Thanks.

nemanjai added inline comments.Sep 23 2020, 9:10 AM

clang/lib/Sema/SemaChecking.cpp
3165 ↗	(On Diff #293313)	I think this should actually take a vector type and a scalar type. Then check that the scalar type is the same as the element type of the vector. The way this is implemented, a more apt name would be something like `checkSameTypes()`.

Updated the isEltOfVectorTy() to the correct semantics; making it take in a vector type and then getting the element type within the function.

Harbormaster completed remote builds in B72683: Diff 293779.Sep 23 2020, 9:57 AM

amyk marked 2 inline comments as done.Sep 23 2020, 10:34 AM

LGTM. Thanks for your patience and for addressing all the comments.

This revision is now accepted and ready to land.Sep 23 2020, 11:14 AM

This revision was landed with ongoing or failed builds.Sep 23 2020, 8:55 PM

Closed by commit rG6b136b19cbe4: [Power10] Implement custom codegen for the vec_replace_elt and… (authored by amyk). · Explain Why

This revision was automatically updated to reflect the committed changes.

amyk added a commit: rG6b136b19cbe4: [Power10] Implement custom codegen for the vec_replace_elt and….

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

BuiltinsPPC.def

2 lines

lib/

CodeGen/

CGBuiltin.cpp

92 lines

Headers/

altivec.h

8 lines

test/

CodeGen/

builtins-ppc-p10vector.c

124 lines

Diff 276844

clang/include/clang/Basic/BuiltinsPPC.def

	Show First 20 Lines • Show All 329 Lines • ▼ Show 20 Lines
	BUILTIN(__builtin_altivec_vinsdlx, "V2ULLiV2ULLiULLiULLi", "")			BUILTIN(__builtin_altivec_vinsdlx, "V2ULLiV2ULLiULLiULLi", "")
	BUILTIN(__builtin_altivec_vinsdrx, "V2ULLiV2ULLiULLiULLi", "")			BUILTIN(__builtin_altivec_vinsdrx, "V2ULLiV2ULLiULLiULLi", "")
	BUILTIN(__builtin_altivec_vinsbvlx, "V16UcV16UcULLiV16Uc", "")			BUILTIN(__builtin_altivec_vinsbvlx, "V16UcV16UcULLiV16Uc", "")
	BUILTIN(__builtin_altivec_vinsbvrx, "V16UcV16UcULLiV16Uc", "")			BUILTIN(__builtin_altivec_vinsbvrx, "V16UcV16UcULLiV16Uc", "")
	BUILTIN(__builtin_altivec_vinshvlx, "V8UsV8UsULLiV8Us", "")			BUILTIN(__builtin_altivec_vinshvlx, "V8UsV8UsULLiV8Us", "")
	BUILTIN(__builtin_altivec_vinshvrx, "V8UsV8UsULLiV8Us", "")			BUILTIN(__builtin_altivec_vinshvrx, "V8UsV8UsULLiV8Us", "")
	BUILTIN(__builtin_altivec_vinswvlx, "V4UiV4UiULLiV4Ui", "")			BUILTIN(__builtin_altivec_vinswvlx, "V4UiV4UiULLiV4Ui", "")
	BUILTIN(__builtin_altivec_vinswvrx, "V4UiV4UiULLiV4Ui", "")			BUILTIN(__builtin_altivec_vinswvrx, "V4UiV4UiULLiV4Ui", "")
				BUILTIN(__builtin_altivec_vec_replace_elt, "V4UiV4UiUiIi", "t")
				BUILTIN(__builtin_altivec_vec_replace_unaligned, "V4UiV4UiUiIi", "t")
				amykAuthorUnsubmitted Done Reply Inline Actions I originally intended to implement this like the `xxpermdi` builtin: BUILTIN(__builtin_vsx_xxpermdi, "v.", "t") to use `v.` but I am not able to declare these builtins as void. For now, they're more or less an arbitrary signature that would match `vinsw`. amyk: I originally intended to implement this like the `xxpermdi` builtin: ``` BUILTIN…

	// VSX built-ins.			// VSX built-ins.

	BUILTIN(__builtin_vsx_lxvd2x, "V2divC*", "")			BUILTIN(__builtin_vsx_lxvd2x, "V2divC*", "")
	BUILTIN(__builtin_vsx_lxvw4x, "V4iivC*", "")			BUILTIN(__builtin_vsx_lxvw4x, "V4iivC*", "")
	BUILTIN(__builtin_vsx_lxvd2x_be, "V2dSLLivC*", "")			BUILTIN(__builtin_vsx_lxvd2x_be, "V2dSLLivC*", "")
	BUILTIN(__builtin_vsx_lxvw4x_be, "V4iSLLivC*", "")			BUILTIN(__builtin_vsx_lxvw4x_be, "V4iSLLivC*", "")

	▲ Show 20 Lines • Show All 200 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGBuiltin.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 14,262 Lines • ▼ Show 20 Lines	Value *CodeGenFunction::EmitPPCBuiltinExpr(unsigned BuiltinID,
case PPC::BI__builtin_altivec_vctzw:		case PPC::BI__builtin_altivec_vctzw:
case PPC::BI__builtin_altivec_vctzd: {		case PPC::BI__builtin_altivec_vctzd: {
llvm::Type *ResultType = ConvertType(E->getType());		llvm::Type *ResultType = ConvertType(E->getType());
Value *X = EmitScalarExpr(E->getArg(0));		Value *X = EmitScalarExpr(E->getArg(0));
Value *Undef = ConstantInt::get(Builder.getInt1Ty(), false);		Value *Undef = ConstantInt::get(Builder.getInt1Ty(), false);
Function *F = CGM.getIntrinsic(Intrinsic::cttz, ResultType);		Function *F = CGM.getIntrinsic(Intrinsic::cttz, ResultType);
return Builder.CreateCall(F, {X, Undef});		return Builder.CreateCall(F, {X, Undef});
}		}
		case PPC::BI__builtin_altivec_vec_replace_elt: {
		// The third argument of vec_replace_elt must be a compile time constant
		// and will be emitted either to the vinsw or vinsd instruction.
		leiUnsubmitted Done Reply Inline Actions Do you mean? // The third argument of vec_replace_elt must be a compile time constant and will be emitted either // to the vinsw or vinsd instruction. lei: Do you mean? ``` // The third argument of vec_replace_elt must be a compile time constant…
		amykAuthorUnsubmitted Done Reply Inline Actions Yes. Thank you - I will update the wording here and in the other builtin. amyk: Yes. Thank you - I will update the wording here and in the other builtin.
		ConstantInt *ArgCI = dyn_cast<ConstantInt>(Ops[2]);
		assert(ArgCI &&
		nemanjaiUnsubmitted Done Reply Inline Actions Where is the code that ensures this? There does not appear to be a Sema check to emit a meaningful message for this. We also need a test with a non-constant argument to show the message. nemanjai: Where is the code that ensures this? There does not appear to be a Sema check to emit a…
		"Third Arg to vinsw/vinsd intrinsic must be a constant integer!");
		llvm::Type *ResultType = ConvertType(E->getType());
		llvm::Function *F;
		nemanjaiUnsubmitted Done Reply Inline Actions I don't think we should be creating the declaration if we may not use it. Just initialize this to `nullptr` here and set it for each case. nemanjai: I don't think we should be creating the declaration if we may not use it. Just initialize this…
		int64_t ConstArg = ArgCI->getSExtValue();
		Value *Call;
		if (Ops[1]->getType()->getPrimitiveSizeInBits() == 32) {
		// When the second argument is 32 bits, it can either be an integer or
		// a float. The vinsw intrinsic is used in this case.
		F = CGM.getIntrinsic(Intrinsic::ppc_altivec_vinsw);
		ConstArg *= 4;
		nemanjaiUnsubmitted Done Reply Inline Actions `// The input to vec_replace_elt is an element index, not a byte index.` nemanjai: `// The input to vec_replace_elt is an element index, not a byte index.`
		// Fix the constant according to endianess.
		if (getTarget().isLittleEndian())
		ConstArg = 12 - ConstArg;
		Ops[2] = ConstantInt::getSigned(Int32Ty, ConstArg);
		leiUnsubmitted Done Reply Inline Actions ConstArg = 4; // Fix the constant according to endianess. if (getTarget().isLittleEndian()) ConstArg = 12 - ConstArg; lei:* ``` ConstArg *= 4; // Fix the constant according to endianess. if (getTarget().isLittleEndian…
		// Perform additional handling if the second argument is a float.
		if (Ops[1]->getType()->isFloatTy()) {
		Ops[0] = Builder.CreateBitCast(Ops[0],
		llvm::FixedVectorType::get(Int32Ty, 4));
		Ops[1] = Builder.CreateBitCast(Ops[1], Int32Ty);
		Call = Builder.CreateCall(F, Ops);
		nemanjaiUnsubmitted Done Reply Inline Actions This is too vague. `// If the input vector is a float type, bitcast the inputs to integers.` nemanjai: This is too vague. `// If the input vector is a float type, bitcast the inputs to integers.`
		return Builder.CreateBitCast(Call, ResultType);
		nemanjaiUnsubmitted Done Reply Inline Actions This seems to be duplicated in both blocks. Can we not just do something like `if (!Ops[1]->getType()->isIntegerTy(ArgWidth))`? Then inside we can use the ternary operator to select between `Int32Ty` and `Int64Ty` if necessary. Then we only need one of these bitcast blocks just before we emit the call. nemanjai: This seems to be duplicated in both blocks. Can we not just do something like `if (!Ops[1]…
		}
		} else if (Ops[1]->getType()->getPrimitiveSizeInBits() == 64) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: variable 'F' is used uninitialized whenever 'if' condition is false [clang-diagnostic-sometimes-uninitialized] not useful Lint: Pre-merge checks: clang-tidy: warning: variable 'F' is used uninitialized whenever 'if' condition is false [clang…
		// When the second argument is 64 bits, it can either be a long long or
		// a double. The vinsd intrinsic is used in this case.
		F = CGM.getIntrinsic(Intrinsic::ppc_altivec_vinsd);
		ConstArg *= 8;
		// Fix the constant according to endianness.
		if (getTarget().isLittleEndian())
		ConstArg = 8 - ConstArg;
		Ops[2] = ConstantInt::getSigned(Int32Ty, ConstArg);
		// Perform additional handling if the second argument is a double.
		nemanjaiUnsubmitted Done Reply Inline Actions Please change this to a negative condition (i.e. if the type is not `i64`). Similarly in other similar conditions. nemanjai: Please change this to a negative condition (i.e. if the type is not `i64`). Similarly in…
		if (Ops[1]->getType()->isDoubleTy()) {
		Ops[0] = Builder.CreateBitCast(Ops[0],
		nemanjaiUnsubmitted Done Reply Inline Actions More specific comment please - just as above. nemanjai: More specific comment please - just as above.
		llvm::FixedVectorType::get(Int64Ty, 2));
		Ops[1] = Builder.CreateBitCast(Ops[1], Int64Ty);
		Call = Builder.CreateCall(F, Ops);
		return Builder.CreateBitCast(Call,
		llvm::FixedVectorType::get(DoubleTy, 2));
		}
		}
		Call = Builder.CreateCall(F, Ops);
		return Call;
		nemanjaiUnsubmitted Done Reply Inline Actions s/resultant/result nemanjai: s/resultant/result
		}
		nemanjaiUnsubmitted Done Reply Inline Actions Can we reorganize this as something like: case PPC::BI__builtin_altivec_vec_replace_elt: case PPC::BI__builtin_altivec_vec_replace_unaligned: { // Define variables that are needed unsigned ArgWidth = Ops[1]->getType()->getPrimitiveSizeInBits(); if (BuiltinID == PPC::BI__builtin_altivec_vec_replace_elt) ConstArg = ArgWidth / 8; assert((ArgWidth == 32 \|\| ArgWidth == 64) && "Invalid argument width"); if (ArgWidth == 32) { // set up what is needed for vinsw } else { // set up what is needed for vinsd } // Emit the call if (BuiltinID == PPC::BI__builtin_altivec_vec_replace_elt) // add the bitcast of the result } nemanjai:* Can we reorganize this as something like: ``` case PPC::BI__builtin_altivec_vec_replace_elt…
		case PPC::BI__builtin_altivec_vec_replace_unaligned: {
		leiUnsubmitted Done Reply Inline Actions What are the chances of reaching to the end of this if/else-if section and `Call` is null? ie `getPrimitiveSizeInBits() != [32\|64]` I feel like it would be better if we can structure it so that we are not doing all these nesting of `if`s and just do returns within the diff if-conditions. Have you tried to pull out the diff handling of 32/64bit arg and consolidating the code a bit? lei: What are the chances of reaching to the end of this if/else-if section and `Call` is null? ie…
		amykAuthorUnsubmitted Done Reply Inline Actions Thanks - I realize that I should probably pull the `Call` out. I'll update this. I've actually consolidated the code quite a bit already, but I'll see if I can make any further improvements on this. amyk: Thanks - I realize that I should probably pull the `Call` out. I'll update this. I've actually…
		// The third argument of vec_replace_unaligned must be a compile time
		// constant and will be emitted either to the vinsw or vinsd instruction.
		ConstantInt *ArgCI = dyn_cast<ConstantInt>(Ops[2]);
		assert(ArgCI &&
		"Third Arg to vinsw/vinsd intrinsic must be a constant integer!");
		llvm::Function *F;
		int64_t ConstArg = ArgCI->getSExtValue();
		Value *Call;
		if (Ops[1]->getType()->getPrimitiveSizeInBits() == 32) {
		// When the second argument is 32 bits, it can either be an integer or
		// a float. The vinsw intrinsic is used in this case.
		F = CGM.getIntrinsic(Intrinsic::ppc_altivec_vinsw);
		// Fix the constant if we are on little endian.
		if (getTarget().isLittleEndian())
		ConstArg = 12 - ConstArg;
		Ops[2] = ConstantInt::getSigned(Int32Ty, ConstArg);
		// Perform additional handling if the second argument is a float.
		if (Ops[1]->getType()->isFloatTy()) {
		Ops[0] = Builder.CreateBitCast(Ops[0],
		llvm::FixedVectorType::get(Int32Ty, 4));
		Ops[1] = Builder.CreateBitCast(Ops[1], Int32Ty);
		}
		} else if (Ops[1]->getType()->getPrimitiveSizeInBits() == 64) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: variable 'F' is used uninitialized whenever 'if' condition is false [clang-diagnostic-sometimes-uninitialized] not useful Lint: Pre-merge checks: clang-tidy: warning: variable 'F' is used uninitialized whenever 'if' condition is false [clang…
		// When the second argument is 64 bits, it can either be a long long or
		// a double. The vinsd intrinsic is used in this case.
		F = CGM.getIntrinsic(Intrinsic::ppc_altivec_vinsd);
		// Fix the constant if we are on little endian.
		if (getTarget().isLittleEndian())
		ConstArg = 8 - ConstArg;
		Ops[2] = ConstantInt::getSigned(Int32Ty, ConstArg);
		// Perform additional handling if the second argument is a double.
		if (Ops[1]->getType()->isDoubleTy()) {
		Ops[0] = Builder.CreateBitCast(Ops[0],
		llvm::FixedVectorType::get(Int64Ty, 2));
		Ops[1] = Builder.CreateBitCast(Ops[1], Int64Ty);
		}
		}
		// Emit the call to vinsd, and bitcast the result to a vector of char.
		Call = Builder.CreateCall(F, Ops);
		Call = Builder.CreateBitCast(Call, llvm::FixedVectorType::get(Int8Ty, 16));
		return Call;
		}
case PPC::BI__builtin_altivec_vpopcntb:		case PPC::BI__builtin_altivec_vpopcntb:
case PPC::BI__builtin_altivec_vpopcnth:		case PPC::BI__builtin_altivec_vpopcnth:
case PPC::BI__builtin_altivec_vpopcntw:		case PPC::BI__builtin_altivec_vpopcntw:
case PPC::BI__builtin_altivec_vpopcntd: {		case PPC::BI__builtin_altivec_vpopcntd: {
llvm::Type *ResultType = ConvertType(E->getType());		llvm::Type *ResultType = ConvertType(E->getType());
Value *X = EmitScalarExpr(E->getArg(0));		Value *X = EmitScalarExpr(E->getArg(0));
llvm::Function *F = CGM.getIntrinsic(Intrinsic::ctpop, ResultType);		llvm::Function *F = CGM.getIntrinsic(Intrinsic::ctpop, ResultType);
return Builder.CreateCall(F, X);		return Builder.CreateCall(F, X);
▲ Show 20 Lines • Show All 2,543 Lines • Show Last 20 Lines

clang/lib/Headers/altivec.h

This file is larger than 256 KB, so syntax highlighting is disabled by default.

/===---- altivec.h - Standard header for type generic math ---------------===\		/===---- altivec.h - Standard header for type generic math ---------------===\
*		*
* Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		* Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
* See https://llvm.org/LICENSE.txt for license information.		* See https://llvm.org/LICENSE.txt for license information.
* SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		* SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
*		*
\===----------------------------------------------------------------------===/		\===----------------------------------------------------------------------===/

#ifndef __ALTIVEC_H		#ifndef __ALTIVEC_H
#define __ALTIVEC_H		#define __ALTIVEC_H

#ifndef __ALTIVEC__		#ifndef __ALTIVEC__
#error "AltiVec support not enabled"		#error "AltiVec support not enabled"
		Lint: Pre-merge checks Inline Actions clang-tidy: error: "AltiVec support not enabled" [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: "AltiVec support not enabled" [clang-diagnostic-error] [[https://github.
#endif		#endif

/* Constants for mapping CR6 bits to predicate result. */		/* Constants for mapping CR6 bits to predicate result. */

#define __CR6_EQ 0		#define __CR6_EQ 0
#define __CR6_EQ_REV 1		#define __CR6_EQ_REV 1
#define __CR6_LT 2		#define __CR6_LT 2
#define __CR6_LT_REV 3		#define __CR6_LT_REV 3
Show All 18 Lines	#define __VEC_CLASS_FP_NOT_NORMAL (__VEC_CLASS_FP_NAN \| \
__VEC_CLASS_FP_INFINITY)		__VEC_CLASS_FP_INFINITY)

#define __ATTRS_o_ai __attribute__((__overloadable__, __always_inline__))		#define __ATTRS_o_ai __attribute__((__overloadable__, __always_inline__))

#ifdef __POWER9_VECTOR__		#ifdef __POWER9_VECTOR__
#include <stddef.h>		#include <stddef.h>
#endif		#endif

static __inline__ vector signed char __ATTRS_o_ai vec_perm(		static __inline__ vector signed char __ATTRS_o_ai vec_perm(
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.
vector signed char __a, vector signed char __b, vector unsigned char __c);		vector signed char __a, vector signed char __b, vector unsigned char __c);
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.

static __inline__ vector unsigned char __ATTRS_o_ai		static __inline__ vector unsigned char __ATTRS_o_ai
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.
vec_perm(vector unsigned char __a, vector unsigned char __b,		vec_perm(vector unsigned char __a, vector unsigned char __b,
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.
vector unsigned char __c);		vector unsigned char __c);
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.

static __inline__ vector bool char __ATTRS_o_ai		static __inline__ vector bool char __ATTRS_o_ai
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.
vec_perm(vector bool char __a, vector bool char __b, vector unsigned char __c);		vec_perm(vector bool char __a, vector bool char __b, vector unsigned char __c);
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.

static __inline__ vector short __ATTRS_o_ai vec_perm(vector signed short __a,		static __inline__ vector short __ATTRS_o_ai vec_perm(vector signed short __a,
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.
vector signed short __b,		vector signed short __b,
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.
vector unsigned char __c);		vector unsigned char __c);
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.

static __inline__ vector unsigned short __ATTRS_o_ai		static __inline__ vector unsigned short __ATTRS_o_ai
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.
vec_perm(vector unsigned short __a, vector unsigned short __b,		vec_perm(vector unsigned short __a, vector unsigned short __b,
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'vector' [clang-diagnostic-error] [[https://github.
vector unsigned char __c);		vector unsigned char __c);

static __inline__ vector bool short __ATTRS_o_ai vec_perm(		static __inline__ vector bool short __ATTRS_o_ai vec_perm(
vector bool short __a, vector bool short __b, vector unsigned char __c);		vector bool short __a, vector bool short __b, vector unsigned char __c);

static __inline__ vector pixel __ATTRS_o_ai vec_perm(vector pixel __a,		static __inline__ vector pixel __ATTRS_o_ai vec_perm(vector pixel __a,
vector pixel __b,		vector pixel __b,
vector unsigned char __c);		vector unsigned char __c);
▲ Show 20 Lines • Show All 17,018 Lines • ▼ Show 20 Lines
}		}

static __inline__ vector double __ATTRS_o_ai		static __inline__ vector double __ATTRS_o_ai
vec_blendv(vector double __a, vector double __b,		vec_blendv(vector double __a, vector double __b,
vector unsigned long long __c) {		vector unsigned long long __c) {
return __builtin_vsx_xxblendvd(__a, __b, __c);		return __builtin_vsx_xxblendvd(__a, __b, __c);
}		}

		/* vec_replace_elt */
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /* vec_replace_elt / +/ vec_replace_elt / Lint: Pre-merge checks:* clang-format: please reformat the code ``` - /* vec_replace_elt / +/ vec_replace_elt */ ```

		#define vec_replace_elt __builtin_altivec_vec_replace_elt

		/* vec_replace_unaligned */

		#define vec_replace_unaligned __builtin_altivec_vec_replace_unaligned

/* vec_splati */		/* vec_splati */

#define vec_splati(__a) \		#define vec_splati(__a) \
_Generic((__a), signed int \		_Generic((__a), signed int \
: ((vector signed int)__a), unsigned int \		: ((vector signed int)__a), unsigned int \
: ((vector unsigned int)__a), float \		: ((vector unsigned int)__a), float \
: ((vector float)__a))		: ((vector float)__a))

▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

clang/test/CodeGen/builtins-ppc-p10vector.c

Show All 18 Lines
vector unsigned short vusa, vusb, vusc;		vector unsigned short vusa, vusb, vusc;
vector signed int vsia, vsib;		vector signed int vsia, vsib;
vector unsigned int vuia, vuib, vuic;		vector unsigned int vuia, vuib, vuic;
vector signed long long vslla, vsllb;		vector signed long long vslla, vsllb;
vector unsigned long long vulla, vullb, vullc;		vector unsigned long long vulla, vullb, vullc;
vector unsigned __int128 vui128a, vui128b, vui128c;		vector unsigned __int128 vui128a, vui128b, vui128c;
vector float vfa, vfb;		vector float vfa, vfb;
vector double vda, vdb;		vector double vda, vdb;
		signed int sia;
unsigned int uia, uib;		unsigned int uia, uib;
unsigned char uca;		unsigned char uca;
unsigned short usa;		unsigned short usa;
		signed long long slla;
unsigned long long ulla;		unsigned long long ulla;
		float fa;
		double da;

vector unsigned long long test_vpdepd(void) {		vector unsigned long long test_vpdepd(void) {
// CHECK: @llvm.ppc.altivec.vpdepd(<2 x i64>		// CHECK: @llvm.ppc.altivec.vpdepd(<2 x i64>
// CHECK-NEXT: ret <2 x i64>		// CHECK-NEXT: ret <2 x i64>
return vec_pdep(vulla, vullb);		return vec_pdep(vulla, vullb);
}		}

vector unsigned long long test_vpextd(void) {		vector unsigned long long test_vpextd(void) {
▲ Show 20 Lines • Show All 537 Lines • ▼ Show 20 Lines	vector float test_vec_vec_splati_ins_f(void) {
// CHECK-BE: ret <4 x float>		// CHECK-BE: ret <4 x float>
// CHECK: [[T1:%.+]] = sub i32 1, %{{.+}}		// CHECK: [[T1:%.+]] = sub i32 1, %{{.+}}
// CHECK: insertelement <4 x float> %{{.+}}, float %{{.+}}, i32 [[T1]]		// CHECK: insertelement <4 x float> %{{.+}}, float %{{.+}}, i32 [[T1]]
// CHECK: [[T2:%.+]] = sub i32 3, %{{.+}}		// CHECK: [[T2:%.+]] = sub i32 3, %{{.+}}
// CHECK: insertelement <4 x float> %{{.+}}, float %{{.+}}, i32 [[T2]]		// CHECK: insertelement <4 x float> %{{.+}}, float %{{.+}}, i32 [[T2]]
// CHECK: ret <4 x float>		// CHECK: ret <4 x float>
return vec_splati_ins(vfa, 0, 1.0f);		return vec_splati_ins(vfa, 0, 1.0f);
}		}

		vector signed int test_vec_replace_elt_si(void) {
		// CHECK-BE: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 0
		// CHECK-BE-NEXT: ret <4 x i32>
		// CHECK: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 12
		// CHECK-NEXT: ret <4 x i32>
		return vec_replace_elt(vsia, sia, 0);
		}

		vector unsigned int test_vec_replace_elt_ui(void) {
		// CHECK-BE: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 4
		// CHECK-BE-NEXT: ret <4 x i32>
		// CHECK: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 8
		// CHECK-NEXT: ret <4 x i32>
		return vec_replace_elt(vuia, uia, 1);
		}

		vector float test_vec_replace_elt_f(void) {
		// CHECK-BE: bitcast float %{{.+}} to i32
		amykAuthorUnsubmitted Done Reply Inline Actions I've utilized tests that were from Biplob's original patch (https://reviews.llvm.org/D82359), but added the `bitcasts` to the float/double cases. amyk: I've utilized tests that were from Biplob's original patch (https://reviews.llvm.org/D82359)…
		// CHECK-BE-NEXT: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 8
		// CHECK-BE-NEXT: bitcast <4 x i32> %{{.*}} to <4 x float>
		// CHECK-BE-NEXT: ret <4 x float>
		// CHECK: bitcast float %{{.+}} to i32
		// CHECK-NEXT: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 4
		// CHECK-NEXT: bitcast <4 x i32> %{{.*}} to <4 x float>
		// CHECK-NEXT: ret <4 x float>
		return vec_replace_elt(vfa, fa, 2);
		}

		vector signed long long test_vec_replace_elt_sll(void) {
		// CHECK-BE: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 0
		// CHECK-BE-NEXT: ret <2 x i64>
		// CHECK: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 8
		// CHECK-NEXT: ret <2 x i64>
		return vec_replace_elt(vslla, slla, 0);
		}

		vector unsigned long long test_vec_replace_elt_ull(void) {
		// CHECK-BE: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 0
		// CHECK-BE-NEXT: ret <2 x i64>
		// CHECK: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 8
		// CHECK-NEXT: ret <2 x i64>
		return vec_replace_elt(vulla, ulla, 0);
		}

		vector double test_vec_replace_elt_d(void) {
		// CHECK-BE: bitcast double %{{.+}} to i64
		// CHECK-BE-NEXT: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 8
		// CHECK-BE-NEXT: bitcast <2 x i64> %{{.*}} to <2 x double>
		// CHECK-BE-NEXT: ret <2 x double>
		// CHECK: bitcast double %{{.+}} to i64
		// CHECK-NEXT: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 0
		// CHECK-NEXT: bitcast <2 x i64> %{{.*}} to <2 x double>
		// CHECK-NEXT: ret <2 x double>
		return vec_replace_elt(vda, da, 1);
		}

		vector unsigned char test_vec_replace_unaligned_si(void) {
		// CHECK-BE: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 6
		// CHECK-BE-NEXT: bitcast <4 x i32> %{{.*}} to <16 x i8>
		// CHECK-BE-NEXT: ret <16 x i8>
		// CHECK: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 6
		// CHECK-NEXT: bitcast <4 x i32> %{{.*}} to <16 x i8>
		// CHECK-NEXT: ret <16 x i8>
		return vec_replace_unaligned(vsia, sia, 6);
		}

		vector unsigned char test_vec_replace_unaligned_ui(void) {
		// CHECK-BE: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 8
		// CHECK-BE-NEXT: bitcast <4 x i32> %{{.*}} to <16 x i8>
		// CHECK-BE-NEXT: ret <16 x i8>
		// CHECK: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 4
		// CHECK-NEXT: bitcast <4 x i32> %{{.*}} to <16 x i8>
		// CHECK-NEXT: ret <16 x i8>
		return vec_replace_unaligned(vuia, uia, 8);
		}

		vector unsigned char test_vec_replace_unaligned_f(void) {
		// CHECK-BE: bitcast float %{{.+}} to i32
		// CHECK-BE-NEXT: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 12
		// CHECK-BE-NEXT: bitcast <4 x i32> %{{.*}} to <16 x i8>
		// CHECK-BE-NEXT: ret <16 x i8>
		// CHECK: bitcast float %{{.+}} to i32
		// CHECK-NEXT: @llvm.ppc.altivec.vinsw(<4 x i32> %{{.+}}, i32 %{{.+}}, i32 0
		// CHECK-NEXT: bitcast <4 x i32> %{{.*}} to <16 x i8>
		// CHECK-NEXT: ret <16 x i8>
		return vec_replace_unaligned(vfa, fa, 12);
		}

		vector unsigned char test_vec_replace_unaligned_sll(void) {
		// CHECK-BE: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 6
		// CHECK-BE-NEXT: bitcast <2 x i64> %{{.*}} to <16 x i8>
		// CHECK-BE-NEXT: ret <16 x i8>
		// CHECK: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 2
		// CHECK-NEXT: bitcast <2 x i64> %{{.*}} to <16 x i8>
		// CHECK-NEXT: ret <16 x i8>
		return vec_replace_unaligned(vslla, slla, 6);
		}

		vector unsigned char test_vec_replace_unaligned_ull(void) {
		// CHECK-BE: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 7
		// CHECK-BE-NEXT: bitcast <2 x i64> %{{.*}} to <16 x i8>
		// CHECK-BE-NEXT: ret <16 x i8>
		// CHECK: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 1
		// CHECK-NEXT: bitcast <2 x i64> %{{.*}} to <16 x i8>
		// CHECK-NEXT: ret <16 x i8>
		return vec_replace_unaligned(vulla, ulla, 7);
		}

		vector unsigned char test_vec_replace_unaligned_d(void) {
		// CHECK-BE: bitcast double %{{.+}} to i64
		// CHECK-BE-NEXT: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 8
		// CHECK-BE-NEXT: bitcast <2 x i64> %{{.*}} to <16 x i8>
		// CHECK-BE-NEXT: ret <16 x i8>
		// CHECK: bitcast double %{{.+}} to i64
		// CHECK-NEXT: @llvm.ppc.altivec.vinsd(<2 x i64> %{{.+}}, i64 %{{.+}}, i32 0
		// CHECK-NEXT: bitcast <2 x i64> %{{.*}} to <16 x i8>
		// CHECK-NEXT: ret <16 x i8>
		return vec_replace_unaligned(vda, da, 8);
		}