This is an archive of the discontinued LLVM Phabricator instance.

Target Power9 bit counting and vector comparison instructions through builtins (front end portion)
ClosedPublic

Authored by nemanjai on Sep 9 2016, 7:43 AM.

Download Raw Diff

Details

Reviewers

wschmidt
kbarton
amehsan
seurer
hfinkel

Summary

This patch introduces the following builtins:
unsigned int vec_first_match_index (vector signed char, vector signed char);
unsigned int vec_first_match_index (vector unsigned char, vector unsigned char);
unsigned int vec_first_match_index (vector signed int, vector signed int);
unsigned int vec_first_match_index (vector unsigned int, vector unsigned int);
unsigned int vec_first_match_index (vector signed short, vector signed short);
unsigned int vec_first_match_index (vector unsigned short, vector unsigned short);
unsigned int vec_first_match_or_eos_index (vector signed char, vector signed char);
unsigned int vec_first_match_or_eos_index (vector unsigned char, vector unsigned char);
unsigned int vec_first_match_or_eos_index (vector signed int, vector signed int);
unsigned int vec_first_match_or_eos_index (vector unsigned int, vector unsigned int);
unsigned int vec_first_match_or_eos_index (vector signed short, vector signed short);
unsigned int vec_first_match_or_eos_index (vector unsigned short, vector unsigned short);
unsigned int vec_first_mismatch_index (vector signed char, vector signed char);
unsigned int vec_first_mismatch_index (vector unsigned char, vector unsigned char);
unsigned int vec_first_mismatch_index (vector signed int, vector signed int);
unsigned int vec_first_mismatch_index (vector unsigned int, vector unsigned int);
unsigned int vec_first_mismatch_index (vector signed short, vector signed short);
unsigned int vec_first_mismatch_index (vector unsigned short, vector unsigned short);
unsigned int vec_first_mismatch_or_eos_index (vector signed char, vector signed char);
unsigned int vec_first_mismatch_or_eos_index (vector unsigned char, vector unsigned char);
unsigned int vec_first_mismatch_or_eos_index (vector signed int, vector signed int);
unsigned int vec_first_mismatch_or_eos_index (vector unsigned int, vector unsigned int);
unsigned int vec_first_mismatch_or_eos_index (vector signed short, vector signed short);
unsigned int vec_first_mismatch_or_eos_index (vector unsigned short, vector unsigned short);
vector bool char vec_cmpne (vector bool char, vector bool char);
vector bool char vec_cmpne (vector signed char, vector signed char);
vector bool char vec_cmpne (vector unsigned char, vector unsigned char);
vector bool int vec_cmpne (vector bool int, vector bool int);
vector bool int vec_cmpne (vector signed int, vector signed int);
vector bool int vec_cmpne (vector unsigned int, vector unsigned int);
vector bool long long vec_cmpne (vector bool long long, vector bool long long);
vector bool long long vec_cmpne (vector signed long long, vector signed long long);
vector bool long long vec_cmpne (vector unsigned long long, vector unsigned long long);
vector bool short vec_cmpne (vector bool short, vector bool short);
vector bool short vec_cmpne (vector signed short, vector signed short);
vector bool short vec_cmpne (vector unsigned short, vector unsigned short);
vector bool long long vec_cmpne (vector double, vector double);
vector bool int vec_cmpne (vector float, vector float);
vector signed char vec_cnttz (vector signed char);
vector unsigned char vec_cnttz (vector unsigned char);
vector signed int vec_cnttz (vector signed int);
vector unsigned int vec_cnttz (vector unsigned int);
vector signed long long vec_cnttz (vector signed long long);
vector unsigned long long vec_cnttz (vector unsigned long long);
vector signed short vec_cnttz (vector signed short);
vector unsigned short vec_cnttz (vector unsigned short);
vector unsigned char vec_popcnt (vector signed char);
vector unsigned char vec_popcnt (vector unsigned char);
vector unsigned int vec_popcnt (vector signed int);
vector unsigned int vec_popcnt (vector unsigned int);
vector unsigned long long vec_popcnt (vector signed long long);
vector unsigned long long vec_popcnt (vector unsigned long long);
vector unsigned short vec_popcnt (vector signed short);
vector unsigned short vec_popcnt (vector unsigned short);

Diff Detail

Repository: rL LLVM

Event Timeline

nemanjai updated this revision to Diff 70835.Sep 9 2016, 7:43 AM

nemanjai retitled this revision from to Target Power9 bit counting and vector comparison instructions through builtins (front end portion).

nemanjai updated this object.

nemanjai added reviewers: hfinkel, kbarton, wschmidt, amehsan, seurer.

nemanjai set the repository for this revision to rL LLVM.

nemanjai added subscribers: cfe-commits, echristo.

Herald added a subscriber: nemanjai. · View Herald TranscriptSep 9 2016, 7:43 AM

Looking over the patch, I realized that I forgot to add a test case for the POWER9_VECTOR macro and the builtins that target the record forms of the instructions. I'll add those on the next revision along with addressing any review comments.

nemanjai added a parent revision: D24396: Target Power9 bit counting and vector comparison instructions through builtins (backend portion).Sep 9 2016, 8:03 AM

Aside from one minor comment, this LGTM.

lib/Basic/Targets.cpp
1364	Please update this comment to also include float128 and power9-vector.

kbarton accepted this revision.Sep 22 2016, 12:27 PM

kbarton edited edge metadata.

This revision is now accepted and ready to land.Sep 22 2016, 12:27 PM

Committed revision 282481.

This test/CodeGen/builtins-ppc-p9vector.c test will fail together with this upcoming LLVM patch https://reviews.llvm.org/D24955

Problem is that lots of your

add i64 {{.*}}, 64

checks will fails since the improved analysis will find out that the add has the "nsw" "nuw" properties.

I'm not so familiar with the regexps used by FileCheck, but somehow we need to (also) allow

add nsw nuw i64 {{.*}}, 64

in the checks to make it more future proof.

In D24397#555057, @bjope wrote:
This test/CodeGen/builtins-ppc-p9vector.c test will fail together with this upcoming LLVM patch https://reviews.llvm.org/D24955

Problem is that lots of your
add i64 {{.*}}, 64
checks will fails since the improved analysis will find out that the add has the "nsw" "nuw" properties.

I'm not so familiar with the regexps used by FileCheck, but somehow we need to (also) allow
add nsw nuw i64 {{.*}}, 64
in the checks to make it more future proof.

I can change the patterns that check for the add instructions to the following:
// CHECK: add {{[nsuw ]*}}i64 {{.*}}, 64

That will pass with:
add nsw i64
add nuw i64
add nsw nuw i64
...

Basically if all that is found between the "add" and "i64" is any combination of the letters "nsuw" and space, it will pass. As far as I'm concerned, ensuring that the strings there are well formed is irrelevant - all I'm testing is that an add instruction is emitted that adds the constant 64.

I can make the change and check it in if you're in agreement.

In D24397#555121, @nemanjai wrote:
In D24397#555057, @bjope wrote:
This test/CodeGen/builtins-ppc-p9vector.c test will fail together with this upcoming LLVM patch https://reviews.llvm.org/D24955

Problem is that lots of your
add i64 {{.*}}, 64
checks will fails since the improved analysis will find out that the add has the "nsw" "nuw" properties.

I'm not so familiar with the regexps used by FileCheck, but somehow we need to (also) allow
add nsw nuw i64 {{.*}}, 64
in the checks to make it more future proof.
I can change the patterns that check for the add instructions to the following:
// CHECK: add {{[nsuw ]*}}i64 {{.*}}, 64

That will pass with:
add nsw i64
add nuw i64
add nsw nuw i64
...

Basically if all that is found between the "add" and "i64" is any combination of the letters "nsuw" and space, it will pass. As far as I'm concerned, ensuring that the strings there are well formed is irrelevant - all I'm testing is that an add instruction is emitted that adds the constant 64.

I can make the change and check it in if you're in agreement.

Solution sounds good to me!

And it would be very helpful if you do that. D24955 will be my first patch contributing to llvm :-)

Having a clang regression/unit test that depends on optimizer behavior is generally viewed as wrong. Can the tests be split into front-end (clang) tests and separate tests for the IR optimizer? Both x86 and AArch64 have done something like that in the last few months for testing of builtins/intrinsics.

In D24397#555470, @spatel wrote:

Having a clang regression/unit test that depends on optimizer behavior is generally viewed as wrong. Can the tests be split into front-end (clang) tests and separate tests for the IR optimizer? Both x86 and AArch64 have done something like that in the last few months for testing of builtins/intrinsics.

Yeah, that sounds reasonable. I'll remove the -O2 from the test case and remove the checks for the select instructions. That's really the only major difference. So am I to understand the nsw/nuw flags will not be added without -O2 and the aforementioned changes will suffice?

In D24397#555552, @nemanjai wrote:

In D24397#555470, @spatel wrote:

Having a clang regression/unit test that depends on optimizer behavior is generally viewed as wrong. Can the tests be split into front-end (clang) tests and separate tests for the IR optimizer? Both x86 and AArch64 have done something like that in the last few months for testing of builtins/intrinsics.

Yeah, that sounds reasonable. I'll remove the -O2 from the test case and remove the checks for the select instructions. That's really the only major difference. So am I to understand the nsw/nuw flags will not be added without -O2 and the aforementioned changes will suffice?

Changing to -O0 or using -disable-llvm-optzns should keep the clang tests from breaking due to underlying changes in the IR optimizer. That may lead to a lot of bloat though. In http://lists.llvm.org/pipermail/cfe-commits/Week-of-Mon-20160307/152324.html , it was viewed as ok, if not ideal, to pipe the clang IR output using "opt -S -mem2reg".

Note that clang itself uses APIs like IRBuilder::CreateNUWSub(), so I think it's possible to see no-wrap IR even without the IR optimizer kicking in (but probably isn't a concern in this case?).

Should also mention:
https://reviews.llvm.org/D17999
has scripts attached that could make this kind of test generation a lot easier. :)

echristo added inline comments.Sep 28 2016, 2:23 PM

lib/Basic/Targets.cpp
1393	Can you combine this with the power8 check above?
1500	This is starting to look a bit complicated here as I think you're now enabling power9 vector on power8? :\
test/CodeGen/builtins-ppc-p9vector.c
2	Please no code generation checks in clang. I doubt you really need to even pipe this through opt unless you feel that alloca issues are going to cause problems. I'd prefer just to check unoptimized IR.

What is the progress about getting rid of these code generation checks?

(I'm still hesitating about commiting D24955 in llvm since that would make these clang tests fail...)

In D24397#562469, @bjope wrote:

(I'm still hesitating about commiting D24955 in llvm since that would make these clang tests fail...)

You can't do that. Bots will send you fail mail all day as they choke on the clang tests - speaking from experience. :)
We either need to fix or revert this commit in order to let D24955 proceed.

Revision Contents

Path

Size

include/

clang/

Basic/

BuiltinsPPC.def

23 lines

Driver/

Options.td

4 lines

lib/

Basic/

Targets.cpp

30 lines

CodeGen/

CGBuiltin.cpp

19 lines

Headers/

altivec.h

581 lines

test/

CodeGen/

builtins-ppc-p9vector.c

748 lines

Diff 70835

include/clang/Basic/BuiltinsPPC.def

	Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines
	BUILTIN(__builtin_altivec_vcmpgefp, "V4iV4fV4f", "")			BUILTIN(__builtin_altivec_vcmpgefp, "V4iV4fV4f", "")

	BUILTIN(__builtin_altivec_vcmpequb, "V16cV16cV16c", "")			BUILTIN(__builtin_altivec_vcmpequb, "V16cV16cV16c", "")
	BUILTIN(__builtin_altivec_vcmpequh, "V8sV8sV8s", "")			BUILTIN(__builtin_altivec_vcmpequh, "V8sV8sV8s", "")
	BUILTIN(__builtin_altivec_vcmpequw, "V4iV4iV4i", "")			BUILTIN(__builtin_altivec_vcmpequw, "V4iV4iV4i", "")
	BUILTIN(__builtin_altivec_vcmpequd, "V2LLiV2LLiV2LLi", "")			BUILTIN(__builtin_altivec_vcmpequd, "V2LLiV2LLiV2LLi", "")
	BUILTIN(__builtin_altivec_vcmpeqfp, "V4iV4fV4f", "")			BUILTIN(__builtin_altivec_vcmpeqfp, "V4iV4fV4f", "")

				BUILTIN(__builtin_altivec_vcmpneb, "V16cV16cV16c", "")
				BUILTIN(__builtin_altivec_vcmpneh, "V8sV8sV8s", "")
				BUILTIN(__builtin_altivec_vcmpnew, "V4iV4iV4i", "")

				BUILTIN(__builtin_altivec_vcmpnezb, "V16cV16cV16c", "")
				BUILTIN(__builtin_altivec_vcmpnezh, "V8sV8sV8s", "")
				BUILTIN(__builtin_altivec_vcmpnezw, "V4iV4iV4i", "")

	BUILTIN(__builtin_altivec_vcmpgtsb, "V16cV16ScV16Sc", "")			BUILTIN(__builtin_altivec_vcmpgtsb, "V16cV16ScV16Sc", "")
	BUILTIN(__builtin_altivec_vcmpgtub, "V16cV16UcV16Uc", "")			BUILTIN(__builtin_altivec_vcmpgtub, "V16cV16UcV16Uc", "")
	BUILTIN(__builtin_altivec_vcmpgtsh, "V8sV8SsV8Ss", "")			BUILTIN(__builtin_altivec_vcmpgtsh, "V8sV8SsV8Ss", "")
	BUILTIN(__builtin_altivec_vcmpgtuh, "V8sV8UsV8Us", "")			BUILTIN(__builtin_altivec_vcmpgtuh, "V8sV8UsV8Us", "")
	BUILTIN(__builtin_altivec_vcmpgtsw, "V4iV4SiV4Si", "")			BUILTIN(__builtin_altivec_vcmpgtsw, "V4iV4SiV4Si", "")
	BUILTIN(__builtin_altivec_vcmpgtuw, "V4iV4UiV4Ui", "")			BUILTIN(__builtin_altivec_vcmpgtuw, "V4iV4UiV4Ui", "")
	BUILTIN(__builtin_altivec_vcmpgtsd, "V2LLiV2LLiV2LLi", "")			BUILTIN(__builtin_altivec_vcmpgtsd, "V2LLiV2LLiV2LLi", "")
	BUILTIN(__builtin_altivec_vcmpgtud, "V2LLiV2ULLiV2ULLi", "")			BUILTIN(__builtin_altivec_vcmpgtud, "V2LLiV2ULLiV2ULLi", "")
	▲ Show 20 Lines • Show All 73 Lines • ▼ Show 20 Lines
	BUILTIN(__builtin_altivec_vcmpgefp_p, "iiV4fV4f", "")			BUILTIN(__builtin_altivec_vcmpgefp_p, "iiV4fV4f", "")

	BUILTIN(__builtin_altivec_vcmpequb_p, "iiV16cV16c", "")			BUILTIN(__builtin_altivec_vcmpequb_p, "iiV16cV16c", "")
	BUILTIN(__builtin_altivec_vcmpequh_p, "iiV8sV8s", "")			BUILTIN(__builtin_altivec_vcmpequh_p, "iiV8sV8s", "")
	BUILTIN(__builtin_altivec_vcmpequw_p, "iiV4iV4i", "")			BUILTIN(__builtin_altivec_vcmpequw_p, "iiV4iV4i", "")
	BUILTIN(__builtin_altivec_vcmpequd_p, "iiV2LLiV2LLi", "")			BUILTIN(__builtin_altivec_vcmpequd_p, "iiV2LLiV2LLi", "")
	BUILTIN(__builtin_altivec_vcmpeqfp_p, "iiV4fV4f", "")			BUILTIN(__builtin_altivec_vcmpeqfp_p, "iiV4fV4f", "")

				BUILTIN(__builtin_altivec_vcmpneb_p, "iiV16cV16c", "")
				BUILTIN(__builtin_altivec_vcmpneh_p, "iiV8sV8s", "")
				BUILTIN(__builtin_altivec_vcmpnew_p, "iiV4iV4i", "")
				BUILTIN(__builtin_altivec_vcmpned_p, "iiV2LLiV2LLi", "")

	BUILTIN(__builtin_altivec_vcmpgtsb_p, "iiV16ScV16Sc", "")			BUILTIN(__builtin_altivec_vcmpgtsb_p, "iiV16ScV16Sc", "")
	BUILTIN(__builtin_altivec_vcmpgtub_p, "iiV16UcV16Uc", "")			BUILTIN(__builtin_altivec_vcmpgtub_p, "iiV16UcV16Uc", "")
	BUILTIN(__builtin_altivec_vcmpgtsh_p, "iiV8SsV8Ss", "")			BUILTIN(__builtin_altivec_vcmpgtsh_p, "iiV8SsV8Ss", "")
	BUILTIN(__builtin_altivec_vcmpgtuh_p, "iiV8UsV8Us", "")			BUILTIN(__builtin_altivec_vcmpgtuh_p, "iiV8UsV8Us", "")
	BUILTIN(__builtin_altivec_vcmpgtsw_p, "iiV4SiV4Si", "")			BUILTIN(__builtin_altivec_vcmpgtsw_p, "iiV4SiV4Si", "")
	BUILTIN(__builtin_altivec_vcmpgtuw_p, "iiV4UiV4Ui", "")			BUILTIN(__builtin_altivec_vcmpgtuw_p, "iiV4UiV4Ui", "")
	BUILTIN(__builtin_altivec_vcmpgtsd_p, "iiV2LLiV2LLi", "")			BUILTIN(__builtin_altivec_vcmpgtsd_p, "iiV2LLiV2LLi", "")
	BUILTIN(__builtin_altivec_vcmpgtud_p, "iiV2ULLiV2ULLi", "")			BUILTIN(__builtin_altivec_vcmpgtud_p, "iiV2ULLiV2ULLi", "")
	Show All 15 Lines
	BUILTIN(__builtin_altivec_crypto_vpmsumh, "V8UsV8UsV8Us", "")			BUILTIN(__builtin_altivec_crypto_vpmsumh, "V8UsV8UsV8Us", "")
	BUILTIN(__builtin_altivec_crypto_vpmsumw, "V4UiV4UiV4Ui", "")			BUILTIN(__builtin_altivec_crypto_vpmsumw, "V4UiV4UiV4Ui", "")
	BUILTIN(__builtin_altivec_crypto_vpmsumd, "V2ULLiV2ULLiV2ULLi", "")			BUILTIN(__builtin_altivec_crypto_vpmsumd, "V2ULLiV2ULLiV2ULLi", "")

	BUILTIN(__builtin_altivec_vclzb, "V16UcV16Uc", "")			BUILTIN(__builtin_altivec_vclzb, "V16UcV16Uc", "")
	BUILTIN(__builtin_altivec_vclzh, "V8UsV8Us", "")			BUILTIN(__builtin_altivec_vclzh, "V8UsV8Us", "")
	BUILTIN(__builtin_altivec_vclzw, "V4UiV4Ui", "")			BUILTIN(__builtin_altivec_vclzw, "V4UiV4Ui", "")
	BUILTIN(__builtin_altivec_vclzd, "V2ULLiV2ULLi", "")			BUILTIN(__builtin_altivec_vclzd, "V2ULLiV2ULLi", "")
				BUILTIN(__builtin_altivec_vctzb, "V16UcV16Uc", "")
				BUILTIN(__builtin_altivec_vctzh, "V8UsV8Us", "")
				BUILTIN(__builtin_altivec_vctzw, "V4UiV4Ui", "")
				BUILTIN(__builtin_altivec_vctzd, "V2ULLiV2ULLi", "")

				// Vector population count built-ins
				BUILTIN(__builtin_altivec_vpopcntb, "V16UcV16Uc", "")
				BUILTIN(__builtin_altivec_vpopcnth, "V8UsV8Us", "")
				BUILTIN(__builtin_altivec_vpopcntw, "V4UiV4Ui", "")
				BUILTIN(__builtin_altivec_vpopcntd, "V2ULLiV2ULLi", "")

	// VSX built-ins.			// VSX built-ins.

	BUILTIN(__builtin_vsx_lxvd2x, "V2divC*", "")			BUILTIN(__builtin_vsx_lxvd2x, "V2divC*", "")
	BUILTIN(__builtin_vsx_lxvw4x, "V4iivC*", "")			BUILTIN(__builtin_vsx_lxvw4x, "V4iivC*", "")

	BUILTIN(__builtin_vsx_stxvd2x, "vV2div*", "")			BUILTIN(__builtin_vsx_stxvd2x, "vV2div*", "")
	BUILTIN(__builtin_vsx_stxvw4x, "vV4iiv*", "")			BUILTIN(__builtin_vsx_stxvw4x, "vV4iiv*", "")
	▲ Show 20 Lines • Show All 118 Lines • Show Last 20 Lines

include/clang/Driver/Options.td

Show First 20 Lines • Show All 1,558 Lines • ▼ Show 20 Lines	def mamdgpu_debugger_abi : Joined<["-"], "mamdgpu-debugger-abi=">,
MetaVarName<"<version>">;		MetaVarName<"<version>">;

def mvsx : Flag<["-"], "mvsx">, Group<m_ppc_Features_Group>;		def mvsx : Flag<["-"], "mvsx">, Group<m_ppc_Features_Group>;
def mno_vsx : Flag<["-"], "mno-vsx">, Group<m_ppc_Features_Group>;		def mno_vsx : Flag<["-"], "mno-vsx">, Group<m_ppc_Features_Group>;
def mpower8_vector : Flag<["-"], "mpower8-vector">,		def mpower8_vector : Flag<["-"], "mpower8-vector">,
Group<m_ppc_Features_Group>;		Group<m_ppc_Features_Group>;
def mno_power8_vector : Flag<["-"], "mno-power8-vector">,		def mno_power8_vector : Flag<["-"], "mno-power8-vector">,
Group<m_ppc_Features_Group>;		Group<m_ppc_Features_Group>;
		def mpower9_vector : Flag<["-"], "mpower9-vector">,
		Group<m_ppc_Features_Group>;
		def mno_power9_vector : Flag<["-"], "mno-power9-vector">,
		Group<m_ppc_Features_Group>;
def mpower8_crypto : Flag<["-"], "mcrypto">,		def mpower8_crypto : Flag<["-"], "mcrypto">,
Group<m_ppc_Features_Group>;		Group<m_ppc_Features_Group>;
def mnopower8_crypto : Flag<["-"], "mno-crypto">,		def mnopower8_crypto : Flag<["-"], "mno-crypto">,
Group<m_ppc_Features_Group>;		Group<m_ppc_Features_Group>;
def mdirect_move : Flag<["-"], "mdirect-move">,		def mdirect_move : Flag<["-"], "mdirect-move">,
Group<m_ppc_Features_Group>;		Group<m_ppc_Features_Group>;
def mnodirect_move : Flag<["-"], "mno-direct-move">,		def mnodirect_move : Flag<["-"], "mno-direct-move">,
Group<m_ppc_Features_Group>;		Group<m_ppc_Features_Group>;
▲ Show 20 Lines • Show All 724 Lines • Show Last 20 Lines

lib/Basic/Targets.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 864 Lines • ▼ Show 20 Lines	class PPCTargetInfo : public TargetInfo {
bool HasVSX;		bool HasVSX;
bool HasP8Vector;		bool HasP8Vector;
bool HasP8Crypto;		bool HasP8Crypto;
bool HasDirectMove;		bool HasDirectMove;
bool HasQPX;		bool HasQPX;
bool HasHTM;		bool HasHTM;
bool HasBPERMD;		bool HasBPERMD;
bool HasExtDiv;		bool HasExtDiv;
		bool HasP9Vector;

protected:		protected:
std::string ABI;		std::string ABI;

public:		public:
PPCTargetInfo(const llvm::Triple &Triple, const TargetOptions &)		PPCTargetInfo(const llvm::Triple &Triple, const TargetOptions &)
: TargetInfo(Triple), HasVSX(false), HasP8Vector(false),		: TargetInfo(Triple), HasVSX(false), HasP8Vector(false),
HasP8Crypto(false), HasDirectMove(false), HasQPX(false), HasHTM(false),		HasP8Crypto(false), HasDirectMove(false), HasQPX(false), HasHTM(false),
HasBPERMD(false), HasExtDiv(false) {		HasBPERMD(false), HasExtDiv(false), HasP9Vector(false) {
BigEndian = (Triple.getArch() != llvm::Triple::ppc64le);		BigEndian = (Triple.getArch() != llvm::Triple::ppc64le);
SimdDefaultAlign = 128;		SimdDefaultAlign = 128;
LongDoubleWidth = LongDoubleAlign = 128;		LongDoubleWidth = LongDoubleAlign = 128;
LongDoubleFormat = &llvm::APFloat::PPCDoubleDouble;		LongDoubleFormat = &llvm::APFloat::PPCDoubleDouble;
}		}

/// \brief Flags for architecture specific defines.		/// \brief Flags for architecture specific defines.
typedef enum {		typedef enum {
▲ Show 20 Lines • Show All 263 Lines • ▼ Show 20 Lines	for (const auto &Feature : Features) {
} else if (Feature == "+direct-move") {		} else if (Feature == "+direct-move") {
HasDirectMove = true;		HasDirectMove = true;
} else if (Feature == "+qpx") {		} else if (Feature == "+qpx") {
HasQPX = true;		HasQPX = true;
} else if (Feature == "+htm") {		} else if (Feature == "+htm") {
HasHTM = true;		HasHTM = true;
} else if (Feature == "+float128") {		} else if (Feature == "+float128") {
HasFloat128 = true;		HasFloat128 = true;
		} else if (Feature == "+power9-vector") {
		HasP9Vector = true;
}		}
// TODO: Finish this list and add an assert that we've handled them		// TODO: Finish this list and add an assert that we've handled them
// all.		// all.
}		}

return true;		return true;
}		}

▲ Show 20 Lines • Show All 153 Lines • ▼ Show 20 Lines	void PPCTargetInfo::getTargetDefines(const LangOptions &Opts,
if (HasP8Vector)		if (HasP8Vector)
Builder.defineMacro("__POWER8_VECTOR__");		Builder.defineMacro("__POWER8_VECTOR__");
if (HasP8Crypto)		if (HasP8Crypto)
Builder.defineMacro("__CRYPTO__");		Builder.defineMacro("__CRYPTO__");
if (HasHTM)		if (HasHTM)
Builder.defineMacro("__HTM__");		Builder.defineMacro("__HTM__");
if (HasFloat128)		if (HasFloat128)
Builder.defineMacro("__FLOAT128__");		Builder.defineMacro("__FLOAT128__");
		if (HasP9Vector)
		Builder.defineMacro("__POWER9_VECTOR__");

Builder.defineMacro("__GCC_HAVE_SYNC_COMPARE_AND_SWAP_1");		Builder.defineMacro("__GCC_HAVE_SYNC_COMPARE_AND_SWAP_1");
Builder.defineMacro("__GCC_HAVE_SYNC_COMPARE_AND_SWAP_2");		Builder.defineMacro("__GCC_HAVE_SYNC_COMPARE_AND_SWAP_2");
Builder.defineMacro("__GCC_HAVE_SYNC_COMPARE_AND_SWAP_4");		Builder.defineMacro("__GCC_HAVE_SYNC_COMPARE_AND_SWAP_4");
if (PointerWidth == 64)		if (PointerWidth == 64)
Builder.defineMacro("__GCC_HAVE_SYNC_COMPARE_AND_SWAP_8");		Builder.defineMacro("__GCC_HAVE_SYNC_COMPARE_AND_SWAP_8");

// FIXME: The following are not yet generated here by Clang, but are		// FIXME: The following are not yet generated here by Clang, but are
Show All 13 Lines	void PPCTargetInfo::getTargetDefines(const LangOptions &Opts,
// __CMODEL_MEDIUM__		// __CMODEL_MEDIUM__
// __CMODEL_LARGE__		// __CMODEL_LARGE__
// _CALL_SYSV		// _CALL_SYSV
// _CALL_DARWIN		// _CALL_DARWIN
// __NO_FPRS__		// __NO_FPRS__
}		}

// Handle explicit options being passed to the compiler here: if we've		// Handle explicit options being passed to the compiler here: if we've
// explicitly turned off vsx and turned on power8-vector or direct-move then		// explicitly turned off vsx and turned on power8-vector or direct-move then
		kbartonUnsubmitted Not Done Reply Inline Actions Please update this comment to also include float128 and power9-vector. kbarton: Please update this comment to also include float128 and power9-vector.
// go ahead and error since the customer has expressed a somewhat incompatible		// go ahead and error since the customer has expressed a somewhat incompatible
// set of options.		// set of options.
static bool ppcUserFeaturesCheck(DiagnosticsEngine &Diags,		static bool ppcUserFeaturesCheck(DiagnosticsEngine &Diags,
const std::vector<std::string> &FeaturesVec) {		const std::vector<std::string> &FeaturesVec) {

if (std::find(FeaturesVec.begin(), FeaturesVec.end(), "-vsx") !=		if (std::find(FeaturesVec.begin(), FeaturesVec.end(), "-vsx") !=
FeaturesVec.end()) {		FeaturesVec.end()) {
if (std::find(FeaturesVec.begin(), FeaturesVec.end(), "+power8-vector") !=		if (std::find(FeaturesVec.begin(), FeaturesVec.end(), "+power8-vector") !=
Show All 11 Lines	if (std::find(FeaturesVec.begin(), FeaturesVec.end(), "-vsx") !=
}		}

if (std::find(FeaturesVec.begin(), FeaturesVec.end(), "+float128") !=		if (std::find(FeaturesVec.begin(), FeaturesVec.end(), "+float128") !=
FeaturesVec.end()) {		FeaturesVec.end()) {
Diags.Report(diag::err_opt_not_valid_with_opt) << "-mfloat128"		Diags.Report(diag::err_opt_not_valid_with_opt) << "-mfloat128"
<< "-mno-vsx";		<< "-mno-vsx";
return false;		return false;
}		}

		if (std::find(FeaturesVec.begin(), FeaturesVec.end(), "+power9-vector") !=
		echristoUnsubmitted Not Done Reply Inline Actions Can you combine this with the power8 check above? echristo: Can you combine this with the power8 check above?
		FeaturesVec.end()) {
		Diags.Report(diag::err_opt_not_valid_with_opt) << "-mpower9-vector"
		<< "-mno-vsx";
		return false;
		}
}		}

return true;		return true;
}		}

bool PPCTargetInfo::initFeatureMap(		bool PPCTargetInfo::initFeatureMap(
llvm::StringMap<bool> &Features, DiagnosticsEngine &Diags, StringRef CPU,		llvm::StringMap<bool> &Features, DiagnosticsEngine &Diags, StringRef CPU,
const std::vector<std::string> &FeaturesVec) const {		const std::vector<std::string> &FeaturesVec) const {
Features["altivec"] = llvm::StringSwitch<bool>(CPU)		Features["altivec"] = llvm::StringSwitch<bool>(CPU)
.Case("7400", true)		.Case("7400", true)
.Case("g4", true)		.Case("g4", true)
.Case("7450", true)		.Case("7450", true)
.Case("g4+", true)		.Case("g4+", true)
.Case("970", true)		.Case("970", true)
.Case("g5", true)		.Case("g5", true)
.Case("pwr6", true)		.Case("pwr6", true)
.Case("pwr7", true)		.Case("pwr7", true)
.Case("pwr8", true)		.Case("pwr8", true)
.Case("pwr9", true)		.Case("pwr9", true)
.Case("ppc64", true)		.Case("ppc64", true)
.Case("ppc64le", true)		.Case("ppc64le", true)
.Default(false);		.Default(false);

Features["qpx"] = (CPU == "a2q");		Features["qpx"] = (CPU == "a2q");
		Features["power9-vector"] = (CPU == "pwr9");
Features["crypto"] = llvm::StringSwitch<bool>(CPU)		Features["crypto"] = llvm::StringSwitch<bool>(CPU)
.Case("ppc64le", true)		.Case("ppc64le", true)
.Case("pwr9", true)		.Case("pwr9", true)
.Case("pwr8", true)		.Case("pwr8", true)
.Default(false);		.Default(false);
Features["power8-vector"] = llvm::StringSwitch<bool>(CPU)		Features["power8-vector"] = llvm::StringSwitch<bool>(CPU)
.Case("ppc64le", true)		.Case("ppc64le", true)
.Case("pwr9", true)		.Case("pwr9", true)
Show All 36 Lines	return llvm::StringSwitch<bool>(Feature)
.Case("power8-vector", HasP8Vector)		.Case("power8-vector", HasP8Vector)
.Case("crypto", HasP8Crypto)		.Case("crypto", HasP8Crypto)
.Case("direct-move", HasDirectMove)		.Case("direct-move", HasDirectMove)
.Case("qpx", HasQPX)		.Case("qpx", HasQPX)
.Case("htm", HasHTM)		.Case("htm", HasHTM)
.Case("bpermd", HasBPERMD)		.Case("bpermd", HasBPERMD)
.Case("extdiv", HasExtDiv)		.Case("extdiv", HasExtDiv)
.Case("float128", HasFloat128)		.Case("float128", HasFloat128)
		.Case("power9-vector", HasP9Vector)
.Default(false);		.Default(false);
}		}

void PPCTargetInfo::setFeatureEnabled(llvm::StringMap<bool> &Features,		void PPCTargetInfo::setFeatureEnabled(llvm::StringMap<bool> &Features,
StringRef Name, bool Enabled) const {		StringRef Name, bool Enabled) const {
// If we're enabling direct-move or power8-vector go ahead and enable vsx		// If we're enabling direct-move or power8-vector go ahead and enable vsx
// as well. Do the inverse if we're disabling vsx. We'll diagnose any user		// as well. Do the inverse if we're disabling vsx. We'll diagnose any user
// incompatible options.		// incompatible options.
if (Enabled) {		if (Enabled) {
if (Name == "direct-move") {		if (Name == "direct-move" \|\|
Features[Name] = Features["vsx"] = true;		Name == "power8-vector" \|\|
} else if (Name == "power8-vector") {		Name == "float128" \|\|
Features[Name] = Features["vsx"] = true;		Name == "power9-vector") {
} else if (Name == "float128") {		// power9-vector is really a superset of power8-vector so encode that.
Features[Name] = Features["vsx"] = true;		Features[Name] = Features["vsx"] = true;
		if (Name == "power9-vector")
		Features["power8-vector"] = true;
} else {		} else {
Features[Name] = true;		Features[Name] = true;
}		}
} else {		} else {
if (Name == "vsx") {		if (Name == "vsx") {
Features[Name] = Features["direct-move"] = Features["power8-vector"] =		Features[Name] = Features["direct-move"] = Features["power8-vector"] =
Features["float128"] = false;		Features["float128"] = Features["power9-vector"] = false;
		echristoUnsubmitted Not Done Reply Inline Actions This is starting to look a bit complicated here as I think you're now enabling power9 vector on power8? :\ echristo: This is starting to look a bit complicated here as I think you're now enabling power9 vector on…
} else {		} else {
Features[Name] = false;		Features[Name] = false;
}		}
}		}
}		}

const char * const PPCTargetInfo::GCCRegNames[] = {		const char * const PPCTargetInfo::GCCRegNames[] = {
"r0", "r1", "r2", "r3", "r4", "r5", "r6", "r7",		"r0", "r1", "r2", "r3", "r4", "r5", "r6", "r7",
▲ Show 20 Lines • Show All 7,198 Lines • Show Last 20 Lines

lib/CodeGen/CGBuiltin.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,515 Lines • ▼ Show 20 Lines	Value *CodeGenFunction::EmitPPCBuiltinExpr(unsigned BuiltinID,
case PPC::BI__builtin_altivec_vclzw:		case PPC::BI__builtin_altivec_vclzw:
case PPC::BI__builtin_altivec_vclzd: {		case PPC::BI__builtin_altivec_vclzd: {
llvm::Type *ResultType = ConvertType(E->getType());		llvm::Type *ResultType = ConvertType(E->getType());
Value *X = EmitScalarExpr(E->getArg(0));		Value *X = EmitScalarExpr(E->getArg(0));
Value *Undef = ConstantInt::get(Builder.getInt1Ty(), false);		Value *Undef = ConstantInt::get(Builder.getInt1Ty(), false);
Function *F = CGM.getIntrinsic(Intrinsic::ctlz, ResultType);		Function *F = CGM.getIntrinsic(Intrinsic::ctlz, ResultType);
return Builder.CreateCall(F, {X, Undef});		return Builder.CreateCall(F, {X, Undef});
}		}
		case PPC::BI__builtin_altivec_vctzb:
		case PPC::BI__builtin_altivec_vctzh:
		case PPC::BI__builtin_altivec_vctzw:
		case PPC::BI__builtin_altivec_vctzd: {
		llvm::Type *ResultType = ConvertType(E->getType());
		Value *X = EmitScalarExpr(E->getArg(0));
		Value *Undef = ConstantInt::get(Builder.getInt1Ty(), false);
		Function *F = CGM.getIntrinsic(Intrinsic::cttz, ResultType);
		return Builder.CreateCall(F, {X, Undef});
		}
		case PPC::BI__builtin_altivec_vpopcntb:
		case PPC::BI__builtin_altivec_vpopcnth:
		case PPC::BI__builtin_altivec_vpopcntw:
		case PPC::BI__builtin_altivec_vpopcntd: {
		llvm::Type *ResultType = ConvertType(E->getType());
		Value *X = EmitScalarExpr(E->getArg(0));
		llvm::Function *F = CGM.getIntrinsic(Intrinsic::ctpop, ResultType);
		return Builder.CreateCall(F, X);
		}
// Copy sign		// Copy sign
case PPC::BI__builtin_vsx_xvcpsgnsp:		case PPC::BI__builtin_vsx_xvcpsgnsp:
case PPC::BI__builtin_vsx_xvcpsgndp: {		case PPC::BI__builtin_vsx_xvcpsgndp: {
llvm::Type *ResultType = ConvertType(E->getType());		llvm::Type *ResultType = ConvertType(E->getType());
Value *X = EmitScalarExpr(E->getArg(0));		Value *X = EmitScalarExpr(E->getArg(0));
Value *Y = EmitScalarExpr(E->getArg(1));		Value *Y = EmitScalarExpr(E->getArg(1));
ID = Intrinsic::copysign;		ID = Intrinsic::copysign;
llvm::Function *F = CGM.getIntrinsic(ID, ResultType);		llvm::Function *F = CGM.getIntrinsic(ID, ResultType);
▲ Show 20 Lines • Show All 602 Lines • Show Last 20 Lines

lib/Headers/altivec.h

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,589 Lines • ▼ Show 20 Lines

#ifdef __VSX__		#ifdef __VSX__
static __inline__ vector bool long long __ATTRS_o_ai		static __inline__ vector bool long long __ATTRS_o_ai
vec_cmpeq(vector double __a, vector double __b) {		vec_cmpeq(vector double __a, vector double __b) {
return (vector bool long long)__builtin_vsx_xvcmpeqdp(__a, __b);		return (vector bool long long)__builtin_vsx_xvcmpeqdp(__a, __b);
}		}
#endif		#endif

		#ifdef __POWER9_VECTOR__
		/* vec_cmpne */

		static __inline__ vector bool char __ATTRS_o_ai
		vec_cmpne(vector bool char __a, vector bool char __b) {
		return (vector bool char)__builtin_altivec_vcmpneb((vector char)__a,
		(vector char)__b);
		}

		static __inline__ vector bool char __ATTRS_o_ai
		vec_cmpne(vector signed char __a, vector signed char __b) {
		return (vector bool char)__builtin_altivec_vcmpneb((vector char)__a,
		(vector char)__b);
		}

		static __inline__ vector bool char __ATTRS_o_ai
		vec_cmpne(vector unsigned char __a, vector unsigned char __b) {
		return (vector bool char)__builtin_altivec_vcmpneb((vector char)__a,
		(vector char)__b);
		}

		static __inline__ vector bool short __ATTRS_o_ai
		vec_cmpne(vector bool short __a, vector bool short __b) {
		return (vector bool short)__builtin_altivec_vcmpneh((vector short)__a,
		(vector short)__b);
		}

		static __inline__ vector bool short __ATTRS_o_ai
		vec_cmpne(vector signed short __a, vector signed short __b) {
		return (vector bool short)__builtin_altivec_vcmpneh((vector short)__a,
		(vector short)__b);
		}

		static __inline__ vector bool short __ATTRS_o_ai
		vec_cmpne(vector unsigned short __a, vector unsigned short __b) {
		return (vector bool short)__builtin_altivec_vcmpneh((vector short)__a,
		(vector short)__b);
		}

		static __inline__ vector bool int __ATTRS_o_ai
		vec_cmpne(vector bool int __a, vector bool int __b) {
		return (vector bool int)__builtin_altivec_vcmpnew((vector int)__a,
		(vector int)__b);
		}

		static __inline__ vector bool int __ATTRS_o_ai
		vec_cmpne(vector signed int __a, vector signed int __b) {
		return (vector bool int)__builtin_altivec_vcmpnew((vector int)__a,
		(vector int)__b);
		}

		static __inline__ vector bool int __ATTRS_o_ai
		vec_cmpne(vector unsigned int __a, vector unsigned int __b) {
		return (vector bool int)__builtin_altivec_vcmpnew((vector int)__a,
		(vector int)__b);
		}

		static __inline__ vector bool long long __ATTRS_o_ai
		vec_cmpne(vector bool long long __a, vector bool long long __b) {
		return (vector bool long long)
		~(__builtin_altivec_vcmpequd((vector long long)__a, (vector long long)__b));
		}

		static __inline__ vector bool long long __ATTRS_o_ai
		vec_cmpne(vector signed long long __a, vector signed long long __b) {
		return (vector bool long long)
		~(__builtin_altivec_vcmpequd((vector long long)__a, (vector long long)__b));
		}

		static __inline__ vector bool long long __ATTRS_o_ai
		vec_cmpne(vector unsigned long long __a, vector unsigned long long __b) {
		return (vector bool long long)
		~(__builtin_altivec_vcmpequd((vector long long)__a, (vector long long)__b));
		}

		static __inline__ vector bool int __ATTRS_o_ai
		vec_cmpne(vector float __a, vector float __b) {
		return (vector bool int)__builtin_altivec_vcmpnew((vector int)__a,
		(vector int)__b);
		}

		static __inline__ vector bool long long __ATTRS_o_ai
		vec_cmpne(vector double __a, vector double __b) {
		return (vector bool long long)
		~(__builtin_altivec_vcmpequd((vector long long)__a, (vector long long)__b));
		}

		/* vec_cmpnez */

		static __inline__ vector bool char __ATTRS_o_ai
		vec_cmpnez(vector signed char __a, vector signed char __b) {
		return (vector bool char)__builtin_altivec_vcmpnezb((vector char)__a,
		(vector char)__b);
		}

		static __inline__ vector bool char __ATTRS_o_ai
		vec_cmpnez(vector unsigned char __a, vector unsigned char __b) {
		return (vector bool char)__builtin_altivec_vcmpnezb((vector char)__a,
		(vector char)__b);
		}

		static __inline__ vector bool short __ATTRS_o_ai
		vec_cmpnez(vector signed short __a, vector signed short __b) {
		return (vector bool short)__builtin_altivec_vcmpnezh((vector short)__a,
		(vector short)__b);
		}

		static __inline__ vector bool short __ATTRS_o_ai
		vec_cmpnez(vector unsigned short __a, vector unsigned short __b) {
		return (vector bool short)__builtin_altivec_vcmpnezh((vector short)__a,
		(vector short)__b);
		}

		static __inline__ vector bool int __ATTRS_o_ai
		vec_cmpnez(vector signed int __a, vector signed int __b) {
		return (vector bool int)__builtin_altivec_vcmpnezw((vector int)__a,
		(vector int)__b);
		}

		static __inline__ vector bool int __ATTRS_o_ai
		vec_cmpnez(vector unsigned int __a, vector unsigned int __b) {
		return (vector bool int)__builtin_altivec_vcmpnezw((vector int)__a,
		(vector int)__b);
		}

		#endif

/* vec_cmpgt */		/* vec_cmpgt */

static __inline__ vector bool char __ATTRS_o_ai		static __inline__ vector bool char __ATTRS_o_ai
vec_cmpgt(vector signed char __a, vector signed char __b) {		vec_cmpgt(vector signed char __a, vector signed char __b) {
return (vector bool char)__builtin_altivec_vcmpgtsb(__a, __b);		return (vector bool char)__builtin_altivec_vcmpgtsb(__a, __b);
}		}

static __inline__ vector bool char __ATTRS_o_ai		static __inline__ vector bool char __ATTRS_o_ai
▲ Show 20 Lines • Show All 271 Lines • ▼ Show 20 Lines	vec_cmplt(vector signed long long __a, vector signed long long __b) {
return vec_cmpgt(__b, __a);		return vec_cmpgt(__b, __a);
}		}

static __inline__ vector bool long long __ATTRS_o_ai		static __inline__ vector bool long long __ATTRS_o_ai
vec_cmplt(vector unsigned long long __a, vector unsigned long long __b) {		vec_cmplt(vector unsigned long long __a, vector unsigned long long __b) {
return vec_cmpgt(__b, __a);		return vec_cmpgt(__b, __a);
}		}

		/* vec_popcnt */

		static __inline__ vector signed char __ATTRS_o_ai
		vec_popcnt(vector signed char __a) {
		return __builtin_altivec_vpopcntb(__a);
		}
		static __inline__ vector unsigned char __ATTRS_o_ai
		vec_popcnt(vector unsigned char __a) {
		return __builtin_altivec_vpopcntb(__a);
		}
		static __inline__ vector signed short __ATTRS_o_ai
		vec_popcnt(vector signed short __a) {
		return __builtin_altivec_vpopcnth(__a);
		}
		static __inline__ vector unsigned short __ATTRS_o_ai
		vec_popcnt(vector unsigned short __a) {
		return __builtin_altivec_vpopcnth(__a);
		}
		static __inline__ vector signed int __ATTRS_o_ai
		vec_popcnt(vector signed int __a) {
		return __builtin_altivec_vpopcntw(__a);
		}
		static __inline__ vector unsigned int __ATTRS_o_ai
		vec_popcnt(vector unsigned int __a) {
		return __builtin_altivec_vpopcntw(__a);
		}
		static __inline__ vector signed long long __ATTRS_o_ai
		vec_popcnt(vector signed long long __a) {
		return __builtin_altivec_vpopcntd(__a);
		}
		static __inline__ vector unsigned long long __ATTRS_o_ai
		vec_popcnt(vector unsigned long long __a) {
		return __builtin_altivec_vpopcntd(__a);
		}

/* vec_cntlz */		/* vec_cntlz */

static __inline__ vector signed char __ATTRS_o_ai		static __inline__ vector signed char __ATTRS_o_ai
vec_cntlz(vector signed char __a) {		vec_cntlz(vector signed char __a) {
return __builtin_altivec_vclzb(__a);		return __builtin_altivec_vclzb(__a);
}		}
static __inline__ vector unsigned char __ATTRS_o_ai		static __inline__ vector unsigned char __ATTRS_o_ai
vec_cntlz(vector unsigned char __a) {		vec_cntlz(vector unsigned char __a) {
Show All 20 Lines	vec_cntlz(vector signed long long __a) {
return __builtin_altivec_vclzd(__a);		return __builtin_altivec_vclzd(__a);
}		}
static __inline__ vector unsigned long long __ATTRS_o_ai		static __inline__ vector unsigned long long __ATTRS_o_ai
vec_cntlz(vector unsigned long long __a) {		vec_cntlz(vector unsigned long long __a) {
return __builtin_altivec_vclzd(__a);		return __builtin_altivec_vclzd(__a);
}		}
#endif		#endif

		#ifdef __POWER9_VECTOR__

		/* vec_cnttz */

		static __inline__ vector signed char __ATTRS_o_ai
		vec_cnttz(vector signed char __a) {
		return __builtin_altivec_vctzb(__a);
		}
		static __inline__ vector unsigned char __ATTRS_o_ai
		vec_cnttz(vector unsigned char __a) {
		return __builtin_altivec_vctzb(__a);
		}
		static __inline__ vector signed short __ATTRS_o_ai
		vec_cnttz(vector signed short __a) {
		return __builtin_altivec_vctzh(__a);
		}
		static __inline__ vector unsigned short __ATTRS_o_ai
		vec_cnttz(vector unsigned short __a) {
		return __builtin_altivec_vctzh(__a);
		}
		static __inline__ vector signed int __ATTRS_o_ai
		vec_cnttz(vector signed int __a) {
		return __builtin_altivec_vctzw(__a);
		}
		static __inline__ vector unsigned int __ATTRS_o_ai
		vec_cnttz(vector unsigned int __a) {
		return __builtin_altivec_vctzw(__a);
		}
		static __inline__ vector signed long long __ATTRS_o_ai
		vec_cnttz(vector signed long long __a) {
		return __builtin_altivec_vctzd(__a);
		}
		static __inline__ vector unsigned long long __ATTRS_o_ai
		vec_cnttz(vector unsigned long long __a) {
		return __builtin_altivec_vctzd(__a);
		}

		/* vec_first_match_index */

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_index(vector signed char __a, vector signed char __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpeq(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpeq(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 3;
		}
		return __res[0] >> 3;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_index(vector unsigned char __a, vector unsigned char __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpeq(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpeq(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 3;
		}
		return __res[0] >> 3;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_index(vector signed short __a, vector signed short __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpeq(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpeq(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 4;
		}
		return __res[0] >> 4;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_index(vector unsigned short __a, vector unsigned short __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpeq(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpeq(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 4;
		}
		return __res[0] >> 4;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_index(vector signed int __a, vector signed int __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpeq(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpeq(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 5;
		}
		return __res[0] >> 5;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_index(vector unsigned int __a, vector unsigned int __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpeq(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpeq(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 5;
		}
		return __res[0] >> 5;
		}

		/* vec_first_match_or_eos_index */

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_or_eos_index(vector signed char __a, vector signed char __b) {
		/* Compare the result of the comparison of two vectors with either and OR the
		result. Either the elements are equal or one will equal the comparison
		result if either is zero.
		*/
		vector bool char __tmp1 = vec_cmpeq(__a, __b);
		vector bool char __tmp2 = __tmp1 \| vec_cmpeq(__tmp1, __a) \|
		vec_cmpeq(__tmp1, __b);

		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)__tmp2);
		#else
		vec_cntlz((vector unsigned long long)__tmp2);
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 3;
		}
		return __res[0] >> 3;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_or_eos_index(vector unsigned char __a,
		vector unsigned char __b) {
		vector bool char __tmp1 = vec_cmpeq(__a, __b);
		vector bool char __tmp2 = __tmp1 \| vec_cmpeq(__tmp1, __a) \|
		vec_cmpeq(__tmp1, __b);

		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)__tmp2);
		#else
		vec_cntlz((vector unsigned long long)__tmp2);
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 3;
		}
		return __res[0] >> 3;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_or_eos_index(vector signed short __a, vector signed short __b) {
		vector bool short __tmp1 = vec_cmpeq(__a, __b);
		vector bool short __tmp2 = __tmp1 \| vec_cmpeq(__tmp1, __a) \|
		vec_cmpeq(__tmp1, __b);

		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)__tmp2);
		#else
		vec_cntlz((vector unsigned long long)__tmp2);
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 4;
		}
		return __res[0] >> 4;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_or_eos_index(vector unsigned short __a,
		vector unsigned short __b) {
		vector bool short __tmp1 = vec_cmpeq(__a, __b);
		vector bool short __tmp2 = __tmp1 \| vec_cmpeq(__tmp1, __a) \|
		vec_cmpeq(__tmp1, __b);

		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)__tmp2);
		#else
		vec_cntlz((vector unsigned long long)__tmp2);
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 4;
		}
		return __res[0] >> 4;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_or_eos_index(vector signed int __a, vector signed int __b) {
		vector bool int __tmp1 = vec_cmpeq(__a, __b);
		vector bool int __tmp2 = __tmp1 \| vec_cmpeq(__tmp1, __a) \|
		vec_cmpeq(__tmp1, __b);

		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)__tmp2);
		#else
		vec_cntlz((vector unsigned long long)__tmp2);
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 5;
		}
		return __res[0] >> 5;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_match_or_eos_index(vector unsigned int __a,
		vector unsigned int __b) {
		vector bool int __tmp1 = vec_cmpeq(__a, __b);
		vector bool int __tmp2 = __tmp1 \| vec_cmpeq(__tmp1, __a) \|
		vec_cmpeq(__tmp1, __b);

		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)__tmp2);
		#else
		vec_cntlz((vector unsigned long long)__tmp2);
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 5;
		}
		return __res[0] >> 5;
		}

		/* vec_first_mismatch_index */

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_index(vector signed char __a, vector signed char __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpne(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpne(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 3;
		}
		return __res[0] >> 3;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_index(vector unsigned char __a, vector unsigned char __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpne(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpne(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 3;
		}
		return __res[0] >> 3;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_index(vector signed short __a, vector signed short __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpne(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpne(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 4;
		}
		return __res[0] >> 4;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_index(vector unsigned short __a, vector unsigned short __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpne(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpne(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 4;
		}
		return __res[0] >> 4;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_index(vector signed int __a, vector signed int __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpne(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpne(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 5;
		}
		return __res[0] >> 5;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_index(vector unsigned int __a, vector unsigned int __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpne(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpne(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 5;
		}
		return __res[0] >> 5;
		}

		/* vec_first_mismatch_or_eos_index */

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_or_eos_index(vector signed char __a,
		vector signed char __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpnez(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpnez(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 3;
		}
		return __res[0] >> 3;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_or_eos_index(vector unsigned char __a,
		vector unsigned char __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpnez(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpnez(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 3;
		}
		return __res[0] >> 3;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_or_eos_index(vector signed short __a,
		vector signed short __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpnez(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpnez(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 4;
		}
		return __res[0] >> 4;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_or_eos_index(vector unsigned short __a,
		vector unsigned short __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpnez(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpnez(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 4;
		}
		return __res[0] >> 4;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_or_eos_index(vector signed int __a, vector signed int __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpnez(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpnez(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 5;
		}
		return __res[0] >> 5;
		}

		static __inline__ unsigned __ATTRS_o_ai
		vec_first_mismatch_or_eos_index(vector unsigned int __a,
		vector unsigned int __b) {
		vector unsigned long long __res =
		#ifdef __LITTLE_ENDIAN__
		vec_cnttz((vector unsigned long long)vec_cmpnez(__a, __b));
		#else
		vec_cntlz((vector unsigned long long)vec_cmpnez(__a, __b));
		#endif
		if (__res[0] == 64) {
		return (__res[1] + 64) >> 5;
		}
		return __res[0] >> 5;
		}

		#endif

/* vec_cpsgn */		/* vec_cpsgn */

#ifdef __VSX__		#ifdef __VSX__
static __inline__ vector float __ATTRS_o_ai vec_cpsgn(vector float __a,		static __inline__ vector float __ATTRS_o_ai vec_cpsgn(vector float __a,
vector float __b) {		vector float __b) {
return __builtin_vsx_xvcpsgnsp(__a, __b);		return __builtin_vsx_xvcpsgnsp(__a, __b);
}		}

▲ Show 20 Lines • Show All 12,530 Lines • Show Last 20 Lines

test/CodeGen/builtins-ppc-p9vector.c

				// REQUIRES: powerpc-registered-target
				// RUN: %clang_cc1 -faltivec -target-feature +power9-vector \
				echristoUnsubmitted Not Done Reply Inline Actions Please no code generation checks in clang. I doubt you really need to even pipe this through opt unless you feel that alloca issues are going to cause problems. I'd prefer just to check unoptimized IR. echristo: Please no code generation checks in clang. I doubt you really need to even pipe this through…
				// RUN: -triple powerpc64-unknown-unknown -emit-llvm %s \
				// RUN: -O2 -o - \| FileCheck %s -check-prefix=CHECK-BE

				// RUN: %clang_cc1 -faltivec -target-feature +power9-vector \
				// RUN: -triple powerpc64le-unknown-unknown -emit-llvm %s \
				// RUN: -O2 -o - \| FileCheck %s

				#include <altivec.h>

				vector signed char vsca, vscb;
				vector unsigned char vuca, vucb;
				vector bool char vbca, vbcb;
				vector signed short vssa, vssb;
				vector unsigned short vusa, vusb;
				vector bool short vbsa, vbsb;
				vector signed int vsia, vsib;
				vector unsigned int vuia, vuib;
				vector bool int vbia, vbib;
				vector signed long long vsla, vslb;
				vector unsigned long long vula, vulb;
				vector bool long long vbla, vblb;
				vector float vfa, vfb;
				vector double vda, vdb;

				unsigned test1(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 3
				// CHECK: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 3
				return vec_first_match_index (vsca, vscb);
				}
				unsigned test2(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 3
				// CHECK: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 3
				return vec_first_match_index (vuca, vucb);
				}
				unsigned test3(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 5
				// CHECK: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 5
				return vec_first_match_index (vsia, vsib);
				}
				unsigned test4(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 5
				// CHECK: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 5
				return vec_first_match_index (vuia, vuib);
				}
				unsigned test5(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 4
				// CHECK: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 4
				return vec_first_match_index (vssa, vssb);
				}
				unsigned test6(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 4
				// CHECK: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 4
				return vec_first_match_index (vusa, vusb);
				}
				unsigned test7(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK-BE: or <16 x i8>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK-BE: or <16 x i8>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 3
				// CHECK: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK: or <16 x i8>
				// CHECK: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK: or <16 x i8>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 3
				return vec_first_match_or_eos_index (vsca, vscb);
				}
				unsigned test8(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK-BE: or <16 x i8>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK-BE: or <16 x i8>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 3
				// CHECK: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK: or <16 x i8>
				// CHECK: @llvm.ppc.altivec.vcmpequb(<16 x i8>
				// CHECK: or <16 x i8>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 3
				return vec_first_match_or_eos_index (vuca, vucb);
				}
				unsigned test9(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK-BE: or <4 x i32>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK-BE: or <4 x i32>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 5
				// CHECK: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK: or <4 x i32>
				// CHECK: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK: or <4 x i32>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 5
				return vec_first_match_or_eos_index (vsia, vsib);
				}
				unsigned test10(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK-BE: or <4 x i32>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK-BE: or <4 x i32>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 5
				// CHECK: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK: or <4 x i32>
				// CHECK: @llvm.ppc.altivec.vcmpequw(<4 x i32>
				// CHECK: or <4 x i32>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 5
				return vec_first_match_or_eos_index (vuia, vuib);
				}
				unsigned test11(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK-BE: or <8 x i16>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK-BE: or <8 x i16>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 4
				// CHECK: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK: or <8 x i16>
				// CHECK: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK: or <8 x i16>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 4
				return vec_first_match_or_eos_index (vssa, vssb);
				}
				unsigned test12(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK-BE: or <8 x i16>
				// CHECK-BE: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK-BE: or <8 x i16>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 4
				// CHECK: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK: or <8 x i16>
				// CHECK: @llvm.ppc.altivec.vcmpequh(<8 x i16>
				// CHECK: or <8 x i16>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 4
				return vec_first_match_or_eos_index (vusa, vusb);
				}
				unsigned test13(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 3
				// CHECK: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 3
				return vec_first_mismatch_index (vsca, vscb);
				}
				unsigned test14(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 3
				// CHECK: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 3
				return vec_first_mismatch_index (vuca, vucb);
				}
				unsigned test15(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 5
				// CHECK: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 5
				return vec_first_mismatch_index (vsia, vsib);
				}
				unsigned test16(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 5
				// CHECK: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 5
				return vec_first_mismatch_index (vuia, vuib);
				}
				unsigned test17(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 4
				// CHECK: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 4
				return vec_first_mismatch_index (vssa, vssb);
				}
				unsigned test18(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 4
				// CHECK: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 4
				return vec_first_mismatch_index (vusa, vusb);
				}
				unsigned test19(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnezb(<16 x i8>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 3
				// CHECK: @llvm.ppc.altivec.vcmpnezb(<16 x i8>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 3
				return vec_first_mismatch_or_eos_index (vsca, vscb);
				}
				unsigned test20(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnezb(<16 x i8>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 3
				// CHECK: @llvm.ppc.altivec.vcmpnezb(<16 x i8>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 3
				return vec_first_mismatch_or_eos_index (vuca, vucb);
				}
				unsigned test21(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnezw(<4 x i32>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 5
				// CHECK: @llvm.ppc.altivec.vcmpnezw(<4 x i32>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 5
				return vec_first_mismatch_or_eos_index (vsia, vsib);
				}
				unsigned test22(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnezw(<4 x i32>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 5
				// CHECK: @llvm.ppc.altivec.vcmpnezw(<4 x i32>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 5
				return vec_first_mismatch_or_eos_index (vuia, vuib);
				}
				unsigned test23(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnezh(<8 x i16>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 4
				// CHECK: @llvm.ppc.altivec.vcmpnezh(<8 x i16>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 4
				return vec_first_mismatch_or_eos_index (vssa, vssb);
				}
				unsigned test24(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnezh(<8 x i16>
				// CHECK-BE: @llvm.ctlz.v2i64(<2 x i64>
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: icmp eq i64 {{.*}}, 64
				// CHECK-BE: extractelement <2 x i64>
				// CHECK-BE: add i64 {{.*}}, 64
				// CHECK-BE: select i1
				// CHECK-BE: lshr i64 {{.*}}, 4
				// CHECK: @llvm.ppc.altivec.vcmpnezh(<8 x i16>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK: extractelement <2 x i64>
				// CHECK: icmp eq i64 {{.*}}, 64
				// CHECK: extractelement <2 x i64>
				// CHECK: add i64 {{.*}}, 64
				// CHECK: select i1
				// CHECK: lshr i64 {{.*}}, 4
				return vec_first_mismatch_or_eos_index (vusa, vusb);
				}
				vector bool char test25(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK-BE-NEXT: ret <16 x i8>
				// CHECK: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK-NEXT: ret <16 x i8>
				return vec_cmpne (vbca, vbcb);
				}
				vector bool char test26(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK-BE-NEXT: ret <16 x i8>
				// CHECK: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK-NEXT: ret <16 x i8>
				return vec_cmpne (vsca, vscb);
				}
				vector bool char test27(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK-BE-NEXT: ret <16 x i8>
				// CHECK: @llvm.ppc.altivec.vcmpneb(<16 x i8>
				// CHECK-NEXT: ret <16 x i8>
				return vec_cmpne (vuca, vucb);
				}
				vector bool int test28(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-BE-NEXT: ret <4 x i32>
				// CHECK: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-NEXT: ret <4 x i32>
				return vec_cmpne (vbia, vbib);
				}
				vector bool int test29(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-BE-NEXT: ret <4 x i32>
				// CHECK: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-NEXT: ret <4 x i32>
				return vec_cmpne (vsia, vsib);
				}
				vector bool int test30(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-BE-NEXT: ret <4 x i32>
				// CHECK: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-NEXT: ret <4 x i32>
				return vec_cmpne (vuia, vuib);
				}
				vector bool long long test31(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequd(<2 x i64>
				// CHECK-BE: xor <2 x i64>
				// CHECK-BE-NEXT: ret <2 x i64>
				// CHECK: @llvm.ppc.altivec.vcmpequd(<2 x i64>
				// CHECK: xor <2 x i64>
				// CHECK-NEXT: ret <2 x i64>
				return vec_cmpne (vbla, vblb);
				}
				vector bool long long test32(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequd(<2 x i64>
				// CHECK-BE: xor <2 x i64>
				// CHECK-BE-NEXT: ret <2 x i64>
				// CHECK: @llvm.ppc.altivec.vcmpequd(<2 x i64>
				// CHECK: xor <2 x i64>
				// CHECK-NEXT: ret <2 x i64>
				return vec_cmpne (vsla, vslb);
				}
				vector bool long long test33(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequd(<2 x i64>
				// CHECK-BE: xor <2 x i64>
				// CHECK-BE-NEXT: ret <2 x i64>
				// CHECK: @llvm.ppc.altivec.vcmpequd(<2 x i64>
				// CHECK: xor <2 x i64>
				// CHECK-NEXT: ret <2 x i64>
				return vec_cmpne (vula, vulb);
				}
				vector bool short test34(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK-BE-NEXT: ret <8 x i16>
				// CHECK: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK-NEXT: ret <8 x i16>
				return vec_cmpne (vbsa, vbsb);
				}
				vector bool short test35(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK-BE-NEXT: ret <8 x i16>
				// CHECK: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK-NEXT: ret <8 x i16>
				return vec_cmpne (vssa, vssb);
				}
				vector bool short test36(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK-BE-NEXT: ret <8 x i16>
				// CHECK: @llvm.ppc.altivec.vcmpneh(<8 x i16>
				// CHECK-NEXT: ret <8 x i16>
				return vec_cmpne (vusa, vusb);
				}
				vector bool long long test37(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpequd(<2 x i64>
				// CHECK-BE: xor <2 x i64>
				// CHECK-BE-NEXT: ret <2 x i64>
				// CHECK: @llvm.ppc.altivec.vcmpequd(<2 x i64>
				// CHECK: xor <2 x i64>
				// CHECK-NEXT: ret <2 x i64>
				return vec_cmpne (vda, vdb);
				}
				vector bool int test38(void) {
				// CHECK-BE: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-BE-NEXT: ret <4 x i32>
				// CHECK: @llvm.ppc.altivec.vcmpnew(<4 x i32>
				// CHECK-NEXT: ret <4 x i32>
				return vec_cmpne (vfa, vfb);
				}
				vector signed char test39(void) {
				// CHECK-BE: @llvm.cttz.v16i8(<16 x i8>
				// CHECK-BE-NEXT: ret <16 x i8>
				// CHECK: @llvm.cttz.v16i8(<16 x i8>
				// CHECK-NEXT: ret <16 x i8>
				return vec_cnttz (vsca);
				}
				vector unsigned char test40(void) {
				// CHECK-BE: @llvm.cttz.v16i8(<16 x i8>
				// CHECK-BE-NEXT: ret <16 x i8>
				// CHECK: @llvm.cttz.v16i8(<16 x i8>
				// CHECK-NEXT: ret <16 x i8>
				return vec_cnttz (vuca);
				}
				vector signed int test41(void) {
				// CHECK-BE: @llvm.cttz.v4i32(<4 x i32>
				// CHECK-BE-NEXT: ret <4 x i32>
				// CHECK: @llvm.cttz.v4i32(<4 x i32>
				// CHECK-NEXT: ret <4 x i32>
				return vec_cnttz (vsia);
				}
				vector unsigned int test42(void) {
				// CHECK-BE: @llvm.cttz.v4i32(<4 x i32>
				// CHECK-BE-NEXT: ret <4 x i32>
				// CHECK: @llvm.cttz.v4i32(<4 x i32>
				// CHECK-NEXT: ret <4 x i32>
				return vec_cnttz (vuia);
				}
				vector signed long long test43(void) {
				// CHECK-BE: @llvm.cttz.v2i64(<2 x i64>
				// CHECK-BE-NEXT: ret <2 x i64>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK-NEXT: ret <2 x i64>
				return vec_cnttz (vsla);
				}
				vector unsigned long long test44(void) {
				// CHECK-BE: @llvm.cttz.v2i64(<2 x i64>
				// CHECK-BE-NEXT: ret <2 x i64>
				// CHECK: @llvm.cttz.v2i64(<2 x i64>
				// CHECK-NEXT: ret <2 x i64>
				return vec_cnttz (vula);
				}
				vector signed short test45(void) {
				// CHECK-BE: @llvm.cttz.v8i16(<8 x i16>
				// CHECK-BE-NEXT: ret <8 x i16>
				// CHECK: @llvm.cttz.v8i16(<8 x i16>
				// CHECK-NEXT: ret <8 x i16>
				return vec_cnttz (vssa);
				}
				vector unsigned short test46(void) {
				// CHECK-BE: @llvm.cttz.v8i16(<8 x i16>
				// CHECK-BE-NEXT: ret <8 x i16>
				// CHECK: @llvm.cttz.v8i16(<8 x i16>
				// CHECK-NEXT: ret <8 x i16>
				return vec_cnttz (vusa);
				}
				vector unsigned char test47(void) {
				// CHECK-BE: @llvm.ctpop.v16i8(<16 x i8>
				// CHECK-BE-NEXT: ret <16 x i8>
				// CHECK: @llvm.ctpop.v16i8(<16 x i8>
				// CHECK-NEXT: ret <16 x i8>
				return vec_popcnt (vsca);
				}
				vector unsigned char test48(void) {
				// CHECK-BE: @llvm.ctpop.v16i8(<16 x i8>
				// CHECK-BE-NEXT: ret <16 x i8>
				// CHECK: @llvm.ctpop.v16i8(<16 x i8>
				// CHECK-NEXT: ret <16 x i8>
				return vec_popcnt (vuca);
				}
				vector unsigned int test49(void) {
				// CHECK-BE: @llvm.ctpop.v4i32(<4 x i32>
				// CHECK-BE-NEXT: ret <4 x i32>
				// CHECK: @llvm.ctpop.v4i32(<4 x i32>
				// CHECK-NEXT: ret <4 x i32>
				return vec_popcnt (vsia);
				}
				vector unsigned int test50(void) {
				// CHECK-BE: @llvm.ctpop.v4i32(<4 x i32>
				// CHECK-BE-NEXT: ret <4 x i32>
				// CHECK: @llvm.ctpop.v4i32(<4 x i32>
				// CHECK-NEXT: ret <4 x i32>
				return vec_popcnt (vuia);
				}
				vector unsigned long long test51(void) {
				// CHECK-BE: @llvm.ctpop.v2i64(<2 x i64>
				// CHECK-BE-NEXT: ret <2 x i64>
				// CHECK: @llvm.ctpop.v2i64(<2 x i64>
				// CHECK-NEXT: ret <2 x i64>
				return vec_popcnt (vsla);
				}
				vector unsigned long long test52(void) {
				// CHECK-BE: @llvm.ctpop.v2i64(<2 x i64>
				// CHECK-BE-NEXT: ret <2 x i64>
				// CHECK: @llvm.ctpop.v2i64(<2 x i64>
				// CHECK-NEXT: ret <2 x i64>
				return vec_popcnt (vula);
				}
				vector unsigned short test53(void) {
				// CHECK-BE: @llvm.ctpop.v8i16(<8 x i16>
				// CHECK-BE-NEXT: ret <8 x i16>
				// CHECK: @llvm.ctpop.v8i16(<8 x i16>
				// CHECK-NEXT: ret <8 x i16>
				return vec_popcnt (vssa);
				}
				vector unsigned short test54(void) {
				// CHECK-BE: @llvm.ctpop.v8i16(<8 x i16>
				// CHECK-BE-NEXT: ret <8 x i16>
				// CHECK: @llvm.ctpop.v8i16(<8 x i16>
				// CHECK-NEXT: ret <8 x i16>
				return vec_popcnt (vusa);
				}