Download Raw Diff

Details

Reviewers

echristo
kbarton
nemanjai
lei
syzaara
inouehrs
hfinkel
seanbruno

Commits

rGbbc48e91643b: [PowerPC] Implement vec_xxpermdi builtin.
rC303760: [PowerPC] Implement vec_xxpermdi builtin.
rL303760: [PowerPC] Implement vec_xxpermdi builtin.

Summary

The vec_xxpermdi builtin is missing from altivec.h. This has been requested by developers working on libvpx for VP9 support for Google (PR: https://bugs.llvm.org/show_bug.cgi?id=32653). Initially, I tried to define a new intrinsic to map it to the corresponding PowerPC hard instruction (XXPERMDI) directly. But there was feedback from the community that this can be done without introducing new intrinsic. This patch re-implement the vec_xxpermdi builtin by using the existing shuffleVector instruction just in the FE.

And we currently won't emit an XXPERMDI when parameters are not vectors of doubleword elements, subsequent BE optimization work needed to identify the correct shuffles so that we can emit XXPERMDI for all types of vector.

Diff Detail

Repository: rL LLVM

Event Timeline

jtony created this revision.May 10 2017, 10:27 AM

Add a test case like the one that currently crashes (see inline comment). Also, please do the following:

Put a note in the description (and the commit message) with a link to the PR this fixes
Put a note in the description that there is subsequent back end work to identify the correct shuffles (i.e. we currently won't emit an XXPERMDI when parameters are not vectors of doubleword elements)

include/clang/Basic/DiagnosticSemaKinds.td
7997 ↗	(On Diff #98484)	It isn't appropriate to remove a target-independent diagnostic and replace it with a target-specific one (even if it's just in the name). I think it makes sense to update both `err_shufflevector_non_vector` and `err_shufflevector_incompatible_vector` to take an argument for the name of the builtin, but not to replace it with a target-specific one. I would recommend renaming it to something like: `err_shufflevector_non_vector` -> `err_vec_builtin_non_vector` `err_shufflevector_incompatible_vector` -> `err_vec_builtin_incompatible_vector` and add the parameter.
8005 ↗	(On Diff #98484)	We shouldn't diagnose out-of-range values.
lib/CodeGen/CGBuiltin.cpp
8407 ↗	(On Diff #98484)	This change is unrelated and inconsequential. Please revert it.
8425 ↗	(On Diff #98484)	We shouldn't clamp this. The correct semantics should be to mask out everything but the low order two bits (i.e. `&0x3`). But I think even that is unnecessary since you should just use the low order two bits from the input - see below.
8433 ↗	(On Diff #98484)	The switch is overkill. You should just implement this in an obvious way (i.e. the same way as described in the ISA). For big endian: `ElemIdx0 = (Index & 2;) >> 1` `ElemIdx1 = 2 + (Index & 1)` For little endian: `ElemIdx0 = (~Index & 1) + 2;` `ElemIdx1 = ~Index & 2 >> 1;` (of course, please verify the expressions).
8482 ↗	(On Diff #98484)	This is not going to work. You can try it with this simple test case that it crashes with: #include <altivec.h> vector int test(vector int a, vector int b) { return vec_xxpermdi(a, b, 0); } The problem is that the return type of the call expression may be different from the return type of the `shufflevector`. You'll probably need something like this: QualType BIRetType = E->getType(); auto RetTy = ConvertType(BIRetType); return Builder.CreateBitCast(ShuffleCall, RetTy);
lib/Sema/SemaChecking.cpp
3900 ↗	(On Diff #98484)	I assume that we won't even need this at all if we're not diagnosing the range of the third argument.
3915 ↗	(On Diff #98484)	No, we don't want to diagnose this. Values larger than 3 should just use the low-order two bits. This matches GCC behaviour.

This revision now requires changes to proceed.May 13 2017, 6:26 AM

jtony mentioned this in D33236: [PowerPC] Implement vec_xxsldwi builtin..May 16 2017, 6:54 AM

jtony edited the summary of this revision. (Show Details)May 16 2017, 10:52 AM

jtony edited the summary of this revision. (Show Details)May 16 2017, 1:20 PM

jtony marked 6 inline comments as done.

jtony added inline comments.

lib/CodeGen/CGBuiltin.cpp
8433 ↗	(On Diff #98484)	Good call, fixed as suggested.

Address all the comments from Nemanja.

jtony edited the summary of this revision. (Show Details)May 17 2017, 6:00 PM

jtony marked an inline comment as done.May 17 2017, 7:14 PM

Other than the few minor comments, this LGTM.

lib/CodeGen/CGBuiltin.cpp
8458 ↗	(On Diff #99292)	Minor nit: please add a comment explaining the expressions. For example: // Element zero comes from the first input vector and element one comes from the // second. The element indices within each vector are numbered in big-endian // order so the shuffle mask must be adjusted for this on little endian // platforms (i.e. index is complemented and source vector reversed).
lib/Sema/SemaChecking.cpp
3944 ↗	(On Diff #99292)	It seems strange that we're comparing argument types above and element types of argument vectors here. Can we not just use `hasSameUnqualifiedType` like the Sema checks on `__builtin_shufflevector` do?
test/CodeGen/builtins-ppc-error.c
27 ↗	(On Diff #99292)	Add a test for non-vector arguments. Perhaps something like: `vec_xxpermdi(1, 2, 3);`

inouehrs added inline comments.May 19 2017, 2:16 AM

test/CodeGen/builtins-ppc-error.c
23 ↗	(On Diff #99292)	I am not sure we can assure that clang always do a constant propagation to resolve `index` as a compile time constant. But it seems that an existing test case above already assumes clang does it. IMO, `const unsigned index = 5;` is a little better.

Address more comments from Nemanja and Hiroshi.

lib/Sema/SemaChecking.cpp
3900 ↗	(On Diff #98484)	Changed to just check it is Compile Time constant without check the range.
test/CodeGen/builtins-ppc-error.c
23 ↗	(On Diff #99292)	Hi Hiroshi, the index is used as a non-constant variable test input to test the diagnostic message. We want it to be a variable here. But I guess I can leave it uninitialized to be clear.

jtony marked 2 inline comments as done.May 23 2017, 11:19 AM

inouehrs added inline comments.May 23 2017, 11:37 AM

test/CodeGen/builtins-ppc-error.c
23 ↗	(On Diff #99292)	Thank you for the clarification. Comment says expected error. Sorry for confusing you.

Couple of small nits and a request to make some of the change separately, but otherwise LGTM. For the split part please don't actually submit another patch, just go ahead and do it :)

Thanks!

-eric

include/clang/Basic/DiagnosticSemaKinds.td
8007 ↗	(On Diff #99292)	Spaces after commas please.
8014–8017 ↗	(On Diff #99955)	The name change can be done separately.

jtony updated this revision to Diff 99966.May 23 2017, 11:59 AM

seanbruno resigned from this revision.May 23 2017, 12:04 PM

jtony marked 2 inline comments as done.May 23 2017, 12:52 PM

Much like Eric's comments, mine shouldn't hold up approval. Feel free to address them on the commit.

LGTM.

lib/Sema/SemaChecking.cpp
3900 ↗	(On Diff #99966)	This statement doesn't belong in this comment. The Sema check is just that the third argument is a compile-time constant. There's no need to mention `0-3` or `last two bits` here. Just state what you're checking, not the details of what you're currently using it for.
3905 ↗	(On Diff #99966)	All the statements in the comments as well as the code itself only handle functions that have exactly 3 parameters, the first two of which are vectors and the last an integral constant expression. I really don't see the need for the `NumArgs` parameter here. It would indeed be weird if someone down the line called it with something like: `SemaBuiltinVSX(TheCall, 1);` or `SemaBuiltinVSX(TheCall, 5);`

This revision is now accepted and ready to land.May 23 2017, 2:57 PM

Closed by commit rL303760: [PowerPC] Implement vec_xxpermdi builtin. (authored by jtony). · Explain WhyMay 24 2017, 8:13 AM

This revision was automatically updated to reflect the committed changes.

jtony marked 2 inline comments as done.

Diff 100093

cfe/trunk/include/clang/Basic/BuiltinsPPC.def

	Show First 20 Lines • Show All 414 Lines • ▼ Show 20 Lines

	// Vector Test Data Class builtins			// Vector Test Data Class builtins
	BUILTIN(__builtin_vsx_xvtstdcdp, "V2ULLiV2dIi", "")			BUILTIN(__builtin_vsx_xvtstdcdp, "V2ULLiV2dIi", "")
	BUILTIN(__builtin_vsx_xvtstdcsp, "V4UiV4fIi", "")			BUILTIN(__builtin_vsx_xvtstdcsp, "V4UiV4fIi", "")

	BUILTIN(__builtin_vsx_insertword, "V16UcV4UiV16UcIi", "")			BUILTIN(__builtin_vsx_insertword, "V16UcV4UiV16UcIi", "")
	BUILTIN(__builtin_vsx_extractuword, "V2ULLiV16UcIi", "")			BUILTIN(__builtin_vsx_extractuword, "V2ULLiV16UcIi", "")

				BUILTIN(__builtin_vsx_xxpermdi, "v.", "t")

	// HTM builtins			// HTM builtins
	BUILTIN(__builtin_tbegin, "UiUIi", "")			BUILTIN(__builtin_tbegin, "UiUIi", "")
	BUILTIN(__builtin_tend, "UiUIi", "")			BUILTIN(__builtin_tend, "UiUIi", "")

	BUILTIN(__builtin_tabort, "UiUi", "")			BUILTIN(__builtin_tabort, "UiUi", "")
	BUILTIN(__builtin_tabortdc, "UiUiUiUi", "")			BUILTIN(__builtin_tabortdc, "UiUiUiUi", "")
	BUILTIN(__builtin_tabortdci, "UiUiUii", "")			BUILTIN(__builtin_tabortdci, "UiUiUii", "")
	BUILTIN(__builtin_tabortwc, "UiUiUiUi", "")			BUILTIN(__builtin_tabortwc, "UiUiUiUi", "")
	Show All 33 Lines

cfe/trunk/include/clang/Basic/DiagnosticSemaKinds.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,010 Lines • ▼ Show 20 Lines	def err_block_on_nonlocal : Error<
"__block attribute not allowed, only allowed on local variables">;		"__block attribute not allowed, only allowed on local variables">;
def err_block_on_vm : Error<		def err_block_on_vm : Error<
"__block attribute not allowed on declaration with a variably modified type">;		"__block attribute not allowed on declaration with a variably modified type">;

def err_vec_builtin_non_vector : Error<		def err_vec_builtin_non_vector : Error<
"first two arguments to %0 must be vectors">;		"first two arguments to %0 must be vectors">;
def err_vec_builtin_incompatible_vector : Error<		def err_vec_builtin_incompatible_vector : Error<
"first two arguments to %0 must have the same type">;		"first two arguments to %0 must have the same type">;
		def err_vsx_builtin_nonconstant_argument : Error<
		"argument %0 to %1 must be a 2-bit unsigned literal (i.e. 0, 1, 2 or 3)">;

def err_shufflevector_nonconstant_argument : Error<		def err_shufflevector_nonconstant_argument : Error<
"index for __builtin_shufflevector must be a constant integer">;		"index for __builtin_shufflevector must be a constant integer">;
def err_shufflevector_argument_too_large : Error<		def err_shufflevector_argument_too_large : Error<
"index for __builtin_shufflevector must be less than the total number "		"index for __builtin_shufflevector must be less than the total number "
"of vector elements">;		"of vector elements">;

def err_convertvector_non_vector : Error<		def err_convertvector_non_vector : Error<
"first argument to __builtin_convertvector must be a vector">;		"first argument to __builtin_convertvector must be a vector">;
▲ Show 20 Lines • Show All 1,126 Lines • Show Last 20 Lines

cfe/trunk/include/clang/Sema/Sema.h

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 10,118 Lines • ▼ Show 20 Lines	private:
bool CheckX86BuiltinGatherScatterScale(unsigned BuiltinID, CallExpr *TheCall);		bool CheckX86BuiltinGatherScatterScale(unsigned BuiltinID, CallExpr *TheCall);
bool CheckX86BuiltinFunctionCall(unsigned BuiltinID, CallExpr *TheCall);		bool CheckX86BuiltinFunctionCall(unsigned BuiltinID, CallExpr *TheCall);
bool CheckPPCBuiltinFunctionCall(unsigned BuiltinID, CallExpr *TheCall);		bool CheckPPCBuiltinFunctionCall(unsigned BuiltinID, CallExpr *TheCall);

bool SemaBuiltinVAStart(unsigned BuiltinID, CallExpr *TheCall);		bool SemaBuiltinVAStart(unsigned BuiltinID, CallExpr *TheCall);
bool SemaBuiltinVAStartARM(CallExpr *Call);		bool SemaBuiltinVAStartARM(CallExpr *Call);
bool SemaBuiltinUnorderedCompare(CallExpr *TheCall);		bool SemaBuiltinUnorderedCompare(CallExpr *TheCall);
bool SemaBuiltinFPClassification(CallExpr *TheCall, unsigned NumArgs);		bool SemaBuiltinFPClassification(CallExpr *TheCall, unsigned NumArgs);
		bool SemaBuiltinVSX(CallExpr *TheCall);
bool SemaBuiltinOSLogFormat(CallExpr *TheCall);		bool SemaBuiltinOSLogFormat(CallExpr *TheCall);

public:		public:
// Used by C++ template instantiation.		// Used by C++ template instantiation.
ExprResult SemaBuiltinShuffleVector(CallExpr *TheCall);		ExprResult SemaBuiltinShuffleVector(CallExpr *TheCall);
ExprResult SemaConvertVectorExpr(Expr E, TypeSourceInfo TInfo,		ExprResult SemaConvertVectorExpr(Expr E, TypeSourceInfo TInfo,
SourceLocation BuiltinLoc,		SourceLocation BuiltinLoc,
SourceLocation RParenLoc);		SourceLocation RParenLoc);
▲ Show 20 Lines • Show All 371 Lines • Show Last 20 Lines

cfe/trunk/lib/CodeGen/CGBuiltin.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,436 Lines • ▼ Show 20 Lines	if (getTarget().isLittleEndian()) {

Value *ShuffleCall = Builder.CreateShuffleVector(Call, Call, ShuffleMask);		Value *ShuffleCall = Builder.CreateShuffleVector(Call, Call, ShuffleMask);
return ShuffleCall;		return ShuffleCall;
} else {		} else {
Ops[1] = ConstantInt::getSigned(Int32Ty, Index);		Ops[1] = ConstantInt::getSigned(Int32Ty, Index);
return Builder.CreateCall(F, Ops);		return Builder.CreateCall(F, Ops);
}		}
}		}

		case PPC::BI__builtin_vsx_xxpermdi: {
		ConstantInt *ArgCI = dyn_cast<ConstantInt>(Ops[2]);
		assert(ArgCI && "Third arg must be constant integer!");

		unsigned Index = ArgCI->getZExtValue();
		Ops[0] = Builder.CreateBitCast(Ops[0], llvm::VectorType::get(Int64Ty, 2));
		Ops[1] = Builder.CreateBitCast(Ops[1], llvm::VectorType::get(Int64Ty, 2));

		// Element zero comes from the first input vector and element one comes from
		// the second. The element indices within each vector are numbered in big
		// endian order so the shuffle mask must be adjusted for this on little
		// endian platforms (i.e. index is complemented and source vector reversed).
		unsigned ElemIdx0;
		unsigned ElemIdx1;
		if (getTarget().isLittleEndian()) {
		ElemIdx0 = (~Index & 1) + 2;
		ElemIdx1 = (~Index & 2) >> 1;
		} else { // BigEndian
		ElemIdx0 = (Index & 2) >> 1;
		ElemIdx1 = 2 + (Index & 1);
		}

		Constant *ShuffleElts[2] = {ConstantInt::get(Int32Ty, ElemIdx0),
		ConstantInt::get(Int32Ty, ElemIdx1)};
		Constant *ShuffleMask = llvm::ConstantVector::get(ShuffleElts);

		Value *ShuffleCall =
		Builder.CreateShuffleVector(Ops[0], Ops[1], ShuffleMask);
		QualType BIRetType = E->getType();
		auto RetTy = ConvertType(BIRetType);
		return Builder.CreateBitCast(ShuffleCall, RetTy);
		}
}		}
}		}

Value *CodeGenFunction::EmitAMDGPUBuiltinExpr(unsigned BuiltinID,		Value *CodeGenFunction::EmitAMDGPUBuiltinExpr(unsigned BuiltinID,
const CallExpr *E) {		const CallExpr *E) {
switch (BuiltinID) {		switch (BuiltinID) {
case AMDGPU::BI__builtin_amdgcn_div_scale:		case AMDGPU::BI__builtin_amdgcn_div_scale:
case AMDGPU::BI__builtin_amdgcn_div_scalef: {		case AMDGPU::BI__builtin_amdgcn_div_scalef: {
▲ Show 20 Lines • Show All 650 Lines • Show Last 20 Lines

cfe/trunk/lib/Headers/altivec.h

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 12,150 Lines • ▼ Show 20 Lines

	static __inline__ void __ATTRS_o_ai vec_vsx_st(vector unsigned char __a,			static __inline__ void __ATTRS_o_ai vec_vsx_st(vector unsigned char __a,
	int __b, unsigned char *__c) {			int __b, unsigned char *__c) {
	__builtin_vsx_stxvw4x((vector int)__a, __b, __c);			__builtin_vsx_stxvw4x((vector int)__a, __b, __c);
	}			}

	#endif			#endif

				#ifdef __VSX__
				#define vec_xxpermdi __builtin_vsx_xxpermdi
				#endif

	/* vec_xor */			/* vec_xor */

	#define __builtin_altivec_vxor vec_xor			#define __builtin_altivec_vxor vec_xor

	static __inline__ vector signed char __ATTRS_o_ai			static __inline__ vector signed char __ATTRS_o_ai
	vec_xor(vector signed char __a, vector signed char __b) {			vec_xor(vector signed char __a, vector signed char __b) {
	return __a ^ __b;			return __a ^ __b;
	}			}
	▲ Show 20 Lines • Show All 4,573 Lines • Show Last 20 Lines

cfe/trunk/lib/Sema/SemaChecking.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,690 Lines • ▼ Show 20 Lines	bool Sema::CheckPPCBuiltinFunctionCall(unsigned BuiltinID, CallExpr *TheCall) {
case PPC::BI__builtin_tend: i = 0; l = 0; u = 1; break;		case PPC::BI__builtin_tend: i = 0; l = 0; u = 1; break;
case PPC::BI__builtin_tsr: i = 0; l = 0; u = 7; break;		case PPC::BI__builtin_tsr: i = 0; l = 0; u = 7; break;
case PPC::BI__builtin_tabortwc:		case PPC::BI__builtin_tabortwc:
case PPC::BI__builtin_tabortdc: i = 0; l = 0; u = 31; break;		case PPC::BI__builtin_tabortdc: i = 0; l = 0; u = 31; break;
case PPC::BI__builtin_tabortwci:		case PPC::BI__builtin_tabortwci:
case PPC::BI__builtin_tabortdci:		case PPC::BI__builtin_tabortdci:
return SemaBuiltinConstantArgRange(TheCall, 0, 0, 31) \|\|		return SemaBuiltinConstantArgRange(TheCall, 0, 0, 31) \|\|
SemaBuiltinConstantArgRange(TheCall, 2, 0, 31);		SemaBuiltinConstantArgRange(TheCall, 2, 0, 31);
		case PPC::BI__builtin_vsx_xxpermdi:
		return SemaBuiltinVSX(TheCall);
}		}
return SemaBuiltinConstantArgRange(TheCall, i, l, u);		return SemaBuiltinConstantArgRange(TheCall, i, l, u);
}		}

bool Sema::CheckSystemZBuiltinFunctionCall(unsigned BuiltinID,		bool Sema::CheckSystemZBuiltinFunctionCall(unsigned BuiltinID,
CallExpr *TheCall) {		CallExpr *TheCall) {
if (BuiltinID == SystemZ::BI__builtin_tabort) {		if (BuiltinID == SystemZ::BI__builtin_tabort) {
Expr *Arg = TheCall->getArg(0);		Expr *Arg = TheCall->getArg(0);
▲ Show 20 Lines • Show All 2,180 Lines • ▼ Show 20 Lines	if (Cast->getCastKind() == CK_FloatingCast) {
TheCall->setArg(NumArgs-1, CastArg);		TheCall->setArg(NumArgs-1, CastArg);
}		}
}		}
}		}

return false;		return false;
}		}

		// Customized Sema Checking for VSX builtins that have the following signature:
		// vector [...] builtinName(vector [...], vector [...], const int);
		// Which takes the same type of vectors (any legal vector type) for the first
		// two arguments and takes compile time constant for the third argument.
		// Example builtins are :
		// vector double vec_xxpermdi(vector double, vector double, int);
		// vector short vec_xxsldwi(vector short, vector short, int);
		bool Sema::SemaBuiltinVSX(CallExpr *TheCall) {
		unsigned ExpectedNumArgs = 3;
		if (TheCall->getNumArgs() < ExpectedNumArgs)
		return Diag(TheCall->getLocEnd(),
		diag::err_typecheck_call_too_few_args_at_least)
		<< 0 /function call/ << ExpectedNumArgs << TheCall->getNumArgs()
		<< TheCall->getSourceRange();

		if (TheCall->getNumArgs() > ExpectedNumArgs)
		return Diag(TheCall->getLocEnd(),
		diag::err_typecheck_call_too_many_args_at_most)
		<< 0 /function call/ << ExpectedNumArgs << TheCall->getNumArgs()
		<< TheCall->getSourceRange();

		// Check the third argument is a compile time constant
		llvm::APSInt Value;
		if(!TheCall->getArg(2)->isIntegerConstantExpr(Value, Context))
		return Diag(TheCall->getLocStart(),
		diag::err_vsx_builtin_nonconstant_argument)
		<< 3 /* argument index */ << TheCall->getDirectCallee()
		<< SourceRange(TheCall->getArg(2)->getLocStart(),
		TheCall->getArg(2)->getLocEnd());

		QualType Arg1Ty = TheCall->getArg(0)->getType();
		QualType Arg2Ty = TheCall->getArg(1)->getType();

		// Check the type of argument 1 and argument 2 are vectors.
		SourceLocation BuiltinLoc = TheCall->getLocStart();
		if ((!Arg1Ty->isVectorType() && !Arg1Ty->isDependentType()) \|\|
		(!Arg2Ty->isVectorType() && !Arg2Ty->isDependentType())) {
		return Diag(BuiltinLoc, diag::err_vec_builtin_non_vector)
		<< TheCall->getDirectCallee()
		<< SourceRange(TheCall->getArg(0)->getLocStart(),
		TheCall->getArg(1)->getLocEnd());
		}

		// Check the first two arguments are the same type.
		if (!Context.hasSameUnqualifiedType(Arg1Ty, Arg2Ty)) {
		return Diag(BuiltinLoc, diag::err_vec_builtin_incompatible_vector)
		<< TheCall->getDirectCallee()
		<< SourceRange(TheCall->getArg(0)->getLocStart(),
		TheCall->getArg(1)->getLocEnd());
		}

		// When default clang type checking is turned off and the customized type
		// checking is used, the returning type of the function must be explicitly
		// set. Otherwise it is _Bool by default.
		TheCall->setType(Arg1Ty);

		return false;
		}

/// SemaBuiltinShuffleVector - Handle __builtin_shufflevector.		/// SemaBuiltinShuffleVector - Handle __builtin_shufflevector.
// This is declared to take (...), so we have to check everything.		// This is declared to take (...), so we have to check everything.
ExprResult Sema::SemaBuiltinShuffleVector(CallExpr *TheCall) {		ExprResult Sema::SemaBuiltinShuffleVector(CallExpr *TheCall) {
if (TheCall->getNumArgs() < 2)		if (TheCall->getNumArgs() < 2)
return ExprError(Diag(TheCall->getLocEnd(),		return ExprError(Diag(TheCall->getLocEnd(),
diag::err_typecheck_call_too_few_args_at_least)		diag::err_typecheck_call_too_few_args_at_least)
<< 0 /function call/ << 2 << TheCall->getNumArgs()		<< 0 /function call/ << 2 << TheCall->getNumArgs()
<< TheCall->getSourceRange());		<< TheCall->getSourceRange());
▲ Show 20 Lines • Show All 8,201 Lines • Show Last 20 Lines

cfe/trunk/test/CodeGen/builtins-ppc-error.c

	// REQUIRES: powerpc-registered-target			// REQUIRES: powerpc-registered-target

	// RUN: %clang_cc1 -target-feature +altivec -target-feature +power9-vector \			// RUN: %clang_cc1 -target-feature +altivec -target-feature +power9-vector \
	// RUN: -triple powerpc64-unknown-unknown -fsyntax-only \			// RUN: -triple powerpc64-unknown-unknown -fsyntax-only \
	// RUN: -Wall -Werror -verify %s			// RUN: -Wall -Werror -verify %s

	// RUN: %clang_cc1 -target-feature +altivec -target-feature +power9-vector \			// RUN: %clang_cc1 -target-feature +altivec -target-feature +power9-vector \
	// RUN: -triple powerpc64le-unknown-unknown -fsyntax-only \			// RUN: -triple powerpc64le-unknown-unknown -fsyntax-only \
	// RUN: -Wall -Werror -verify %s			// RUN: -Wall -Werror -verify %s

	#include <altivec.h>			#include <altivec.h>

	extern vector signed int vsi;			extern vector signed int vsi;
	extern vector unsigned char vuc;			extern vector unsigned char vuc;

	void testInsertWord1(void) {			void testInsertWord(void) {
	int index = 5;			int index = 5;
	vector unsigned char v1 = vec_insert4b(vsi, vuc, index); // expected-error {{argument to '__builtin_vsx_insertword' must be a constant integer}}			vector unsigned char v1 = vec_insert4b(vsi, vuc, index); // expected-error {{argument to '__builtin_vsx_insertword' must be a constant integer}}
	vector unsigned long long v2 = vec_extract4b(vuc, index); // expected-error {{argument to '__builtin_vsx_extractuword' must be a constant integer}}			vector unsigned long long v2 = vec_extract4b(vuc, index); // expected-error {{argument to '__builtin_vsx_extractuword' must be a constant integer}}
	}			}

				void testXXPERMDI(int index) {
				vec_xxpermdi(vsi); //expected-error {{too few arguments to function call, expected at least 3, have 1}}
				vec_xxpermdi(vsi, vsi, 2, 4); //expected-error {{too many arguments to function call, expected at most 3, have 4}}
				vec_xxpermdi(vsi, vsi, index); //expected-error {{argument 3 to '__builtin_vsx_xxpermdi' must be a 2-bit unsigned literal (i.e. 0, 1, 2 or 3)}}
				vec_xxpermdi(1, 2, 3); //expected-error {{first two arguments to '__builtin_vsx_xxpermdi' must be vectors}}
				vec_xxpermdi(vsi, vuc, 2); //expected-error {{first two arguments to '__builtin_vsx_xxpermdi' must have the same type}}
				}

cfe/trunk/test/CodeGen/builtins-ppc-vsx.c

Show First 20 Lines • Show All 1,685 Lines • ▼ Show 20 Lines	// CHECK-LE: call void @llvm.ppc.vsx.stxvd2x.be(<2 x double> %{{[0-9]+}}, i8* %{{[0-9]+}})

res_vf = vec_neg(vf);		res_vf = vec_neg(vf);
// CHECK: fsub <4 x float> <float -0.000000e+00, float -0.000000e+00, float -0.000000e+00, float -0.000000e+00>, {{%[0-9]+}}		// CHECK: fsub <4 x float> <float -0.000000e+00, float -0.000000e+00, float -0.000000e+00, float -0.000000e+00>, {{%[0-9]+}}
// CHECK-LE: fsub <4 x float> <float -0.000000e+00, float -0.000000e+00, float -0.000000e+00, float -0.000000e+00>, {{%[0-9]+}}		// CHECK-LE: fsub <4 x float> <float -0.000000e+00, float -0.000000e+00, float -0.000000e+00, float -0.000000e+00>, {{%[0-9]+}}

res_vd = vec_neg(vd);		res_vd = vec_neg(vd);
// CHECK: fsub <2 x double> <double -0.000000e+00, double -0.000000e+00>, {{%[0-9]+}}		// CHECK: fsub <2 x double> <double -0.000000e+00, double -0.000000e+00>, {{%[0-9]+}}
// CHECK-LE: fsub <2 x double> <double -0.000000e+00, double -0.000000e+00>, {{%[0-9]+}}		// CHECK-LE: fsub <2 x double> <double -0.000000e+00, double -0.000000e+00>, {{%[0-9]+}}

		res_vd = vec_xxpermdi(vd, vd, 0);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 0, i32 2>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 3, i32 1>

		res_vf = vec_xxpermdi(vf, vf, 1);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 0, i32 3>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 2, i32 1>

		res_vsll = vec_xxpermdi(vsll, vsll, 2);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 1, i32 2>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 3, i32 0>

		res_vull = vec_xxpermdi(vull, vull, 3);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 1, i32 3>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 2, i32 0>

		res_vsi = vec_xxpermdi(vsi, vsi, 0);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 0, i32 2>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 3, i32 1>

		res_vui = vec_xxpermdi(vui, vui, 1);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 0, i32 3>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 2, i32 1>

		res_vss = vec_xxpermdi(vss, vss, 2);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 1, i32 2>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 3, i32 0>

		res_vus = vec_xxpermdi(vus, vus, 3);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 1, i32 3>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 2, i32 0>

		res_vsc = vec_xxpermdi(vsc, vsc, 0);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 0, i32 2>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 3, i32 1>

		res_vuc = vec_xxpermdi(vuc, vuc, 1);
		// CHECK: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 0, i32 3>
		// CHECK-LE: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 2, i32 1>
		}

		// The return type of the call expression may be different from the return type of the shufflevector.
		// Wrong implementation could crash the compiler, add this test case to check that and avoid ICE.
		vector int xxpermdi_should_not_assert(vector int a, vector int b) {
		return vec_xxpermdi(a, b, 0);
		// CHECK-LABEL: xxpermdi_should_not_assert
		// CHECK: bitcast <4 x i32> %{{[0-9]+}} to <2 x i64>
		// CHECK-NEXT: bitcast <4 x i32> %{{[0-9]+}} to <2 x i64>
		// CHECK-NEXT: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 0, i32 2>
		// CHECK-NEXT: bitcast <2 x i64> %{{[0-9]+}} to <4 x i32>

		// CHECK-LE: bitcast <4 x i32> %{{[0-9]+}} to <2 x i64>
		// CHECK-LE-NEXT: bitcast <4 x i32> %{{[0-9]+}} to <2 x i64>
		// CHECK-LE-NEXT: shufflevector <2 x i64> %{{[0-9]+}}, <2 x i64> %{{[0-9]+}}, <2 x i32> <i32 3, i32 1>
		// CHECK-LE-NEXT: bitcast <2 x i64> %{{[0-9]+}} to <4 x i32>
}		}

This is an archive of the discontinued LLVM Phabricator instance.

[PowerPC] Implement vec_xxpermdi builtin.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 100093

cfe/trunk/include/clang/Basic/BuiltinsPPC.def

cfe/trunk/include/clang/Basic/DiagnosticSemaKinds.td

cfe/trunk/include/clang/Sema/Sema.h

cfe/trunk/lib/CodeGen/CGBuiltin.cpp

cfe/trunk/lib/Headers/altivec.h

cfe/trunk/lib/Sema/SemaChecking.cpp

cfe/trunk/test/CodeGen/builtins-ppc-error.c

cfe/trunk/test/CodeGen/builtins-ppc-vsx.c

This is an archive of the discontinued LLVM Phabricator instance.

[PowerPC] Implement vec_xxpermdi builtin.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 100093

cfe/trunk/include/clang/Basic/BuiltinsPPC.def

cfe/trunk/include/clang/Basic/DiagnosticSemaKinds.td

cfe/trunk/include/clang/Sema/Sema.h

cfe/trunk/lib/CodeGen/CGBuiltin.cpp

cfe/trunk/lib/Headers/altivec.h

cfe/trunk/lib/Sema/SemaChecking.cpp

cfe/trunk/test/CodeGen/builtins-ppc-error.c

cfe/trunk/test/CodeGen/builtins-ppc-vsx.c

[PowerPC] Implement vec_xxpermdi builtin.
ClosedPublic