This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Headers/
-
Headers/
11
altivec.h
-
test/CodeGen/
-
CodeGen/
-
builtins-ppc-altivec.c
-
builtins-ppc-p8vector.c
-
builtins-ppc-quadword.c
-
builtins-ppc-vsx.c

Differential D26544

[PPC] support for arithmetic builtins in the FE
ClosedPublic

Authored by amehsan on Nov 11 2016, 5:48 AM.

Download Raw Diff

Details

Reviewers

lei
syzaara
kbarton
sfertile
jtony
hfinkel
nemanjai

Summary

This includes various overloads of the following builtins:

vec_neg
vec_nabs
vec_adde
vec_addec
vec_sube
vec_subec
vec_subc

Note that for vec_sub builtins on 32 bit integers, the semantics is similar to what ISA describes for instructions working on quadwords: the first operand is added to the one's complement of the second operand. (As opposed to two's complement which I expected).

There is a small backend patch as well that I will post separately, but this is independent.

Diff Detail

Event Timeline

amehsan updated this revision to Diff 77608.Nov 11 2016, 5:48 AM

amehsan retitled this revision from to [PPC] support for arithmetic builtins in the FE.

amehsan updated this object.

amehsan added reviewers: kbarton, nemanjai, sfertile, jtony, hfinkel, syzaara, lei.

amehsan added subscribers: cfe-commits, echristo.

nemanjai added inline comments.Nov 13 2016, 6:49 PM

lib/Headers/altivec.h
313	I don't understand why we're adding `__mask` to the sum of `__a` and `__b`. Shouldn't that be `__carry`?
321	Same comment as above.
348	Is it not a little cleaner and more readable to just mask out the `__c` parameter before the loop (similarly to the masking you've done with `__mask` above)?
349	I think it's a little more clear and obvious what is happening if you actually have just a single cast and mask - i.e. unsigned long long __longa = ((unsigned long long) __a[i]) & 0xFFFFFFFF;

There are several inline comments that need to be addressed.
I also think it's worthwhile putting a comment at the top of the review indicating the (assumed) semantics for the vec_sube and vec_subec instructions that are being implemented (i.e., the behaviour mimics the vec_subeuqm instructions and thus uses one's compliment plus the carry)

lib/Headers/altivec.h
306	please remove blank line
349	Is a mask actually needed? This seems to be what is done in the vec_addec function below, without the cast. I agree that is cleaner. The other minor nit is to pick a single value for the mask (1, 0x01, 0x00000001) and use it consistently.
10515	Why do we mask the carries for sign/unsigned ints, but not __128 ints?
10523	Please reorder these to put the __128 below signed and unsigned.
10542	Is it possible to use vec_adde(a, ~b, __carry)?

kbarton requested changes to this revision.Nov 15 2016, 6:03 AM

kbarton edited edge metadata.

This revision now requires changes to proceed.Nov 15 2016, 6:03 AM

amehsan added inline comments.Nov 16 2016, 6:59 AM

lib/Headers/altivec.h

349

To come up with this code pattern I looked at the following pieces of codes:

unsigned long long f (int t) {
  return (unsigned long long)  t;
}

When compiled with optimization produces

define i64 @f(i32 signext %t) local_unnamed_addr #0 {
entry:
  %conv = sext i32 %t to i64
  ret i64 %conv
}

Which is incorrect. Also

unsigned long long f (int t) {
  return (unsigned long long)(unsigned)  t;
}
~

results in

define i64 @f(i32 signext %t) local_unnamed_addr #0 {
entry:
  %conv = zext i32 %t to i64
  ret i64 %conv
}

and

.Lfunc_begin0:
# BB#0:                                 # %entry
        clrldi   3, 3, 32
        blr

So I think the code here is optimal and correct and there is no need to change it.

10515

for quadword, hardware does the masking (implicitly, by only looking at the rightmost bit)

amehsan updated this revision to Diff 78376.Nov 17 2016, 9:53 AM

amehsan updated this object.

amehsan edited edge metadata.

LGTM

This revision is now accepted and ready to land.Nov 18 2016, 10:25 AM

https://reviews.llvm.org/rL287872

Revision Contents

Path

Size

lib/

Headers/

altivec.h

176 lines

test/

CodeGen/

builtins-ppc-altivec.c

80 lines

builtins-ppc-p8vector.c

86 lines

builtins-ppc-quadword.c

35 lines

builtins-ppc-vsx.c

20 lines

Diff 78376

lib/Headers/altivec.h

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	#ifdef __VSX__
return __builtin_vsx_xvabssp(__a);		return __builtin_vsx_xvabssp(__a);
#else		#else
vector unsigned int __res =		vector unsigned int __res =
(vector unsigned int)__a & (vector unsigned int)(0x7FFFFFFF);		(vector unsigned int)__a & (vector unsigned int)(0x7FFFFFFF);
return (vector float)__res;		return (vector float)__res;
#endif		#endif
}		}

#if defined(__POWER8_VECTOR__) && defined(__powerpc64__)		#ifdef __VSX__
static __inline__ vector double __ATTRS_o_ai vec_abs(vector double __a) {		static __inline__ vector double __ATTRS_o_ai vec_abs(vector double __a) {
return __builtin_vsx_xvabsdp(__a);		return __builtin_vsx_xvabsdp(__a);
}		}
#endif		#endif

/* vec_abss */		/* vec_abss */
#define __builtin_altivec_abss_v16qi vec_abss		#define __builtin_altivec_abss_v16qi vec_abss
#define __builtin_altivec_abss_v8hi vec_abss		#define __builtin_altivec_abss_v8hi vec_abss
▲ Show 20 Lines • Show All 152 Lines • ▼ Show 20 Lines	vec_adde(vector signed __int128 __a, vector signed __int128 __b,
return __builtin_altivec_vaddeuqm(__a, __b, __c);		return __builtin_altivec_vaddeuqm(__a, __b, __c);
}		}

static __inline__ vector unsigned __int128 __ATTRS_o_ai		static __inline__ vector unsigned __int128 __ATTRS_o_ai
vec_adde(vector unsigned __int128 __a, vector unsigned __int128 __b,		vec_adde(vector unsigned __int128 __a, vector unsigned __int128 __b,
vector unsigned __int128 __c) {		vector unsigned __int128 __c) {
return __builtin_altivec_vaddeuqm(__a, __b, __c);		return __builtin_altivec_vaddeuqm(__a, __b, __c);
}		}
#endif		#endif
		kbartonUnsubmitted Not Done Reply Inline Actions please remove blank line kbarton: please remove blank line

		static __inline__ vector signed int __ATTRS_o_ai
		vec_adde(vector signed int __a, vector signed int __b,
		vector signed int __c) {
		vector signed int __mask = {1, 1, 1, 1};
		vector signed int __carry = __c & __mask;
		return vec_add(vec_add(__a, __b), __carry);
		nemanjaiUnsubmitted Not Done Reply Inline Actions I don't understand why we're adding `__mask` to the sum of `__a` and `__b`. Shouldn't that be `__carry`? nemanjai: I don't understand why we're adding `__mask` to the sum of `__a` and `__b`. Shouldn't that be…
		}

		static __inline__ vector unsigned int __ATTRS_o_ai
		vec_adde(vector unsigned int __a, vector unsigned int __b,
		vector unsigned int __c) {
		vector unsigned int __mask = {1, 1, 1, 1};
		vector unsigned int __carry = __c & __mask;
		return vec_add(vec_add(__a, __b), __carry);
		nemanjaiUnsubmitted Not Done Reply Inline Actions Same comment as above. nemanjai: Same comment as above.
		}

/* vec_addec */		/* vec_addec */

#if defined(__POWER8_VECTOR__) && defined(__powerpc64__)		#if defined(__POWER8_VECTOR__) && defined(__powerpc64__)
static __inline__ vector signed __int128 __ATTRS_o_ai		static __inline__ vector signed __int128 __ATTRS_o_ai
vec_addec(vector signed __int128 __a, vector signed __int128 __b,		vec_addec(vector signed __int128 __a, vector signed __int128 __b,
vector signed __int128 __c) {		vector signed __int128 __c) {
return __builtin_altivec_vaddecuq(__a, __b, __c);		return __builtin_altivec_vaddecuq(__a, __b, __c);
}		}

static __inline__ vector unsigned __int128 __ATTRS_o_ai		static __inline__ vector unsigned __int128 __ATTRS_o_ai
vec_addec(vector unsigned __int128 __a, vector unsigned __int128 __b,		vec_addec(vector unsigned __int128 __a, vector unsigned __int128 __b,
vector unsigned __int128 __c) {		vector unsigned __int128 __c) {
return __builtin_altivec_vaddecuq(__a, __b, __c);		return __builtin_altivec_vaddecuq(__a, __b, __c);
}		}

		static __inline__ vector signed int __ATTRS_o_ai
		vec_addec(vector signed int __a, vector signed int __b,
		vector signed int __c) {

		signed int __result[4];
		for (int i = 0; i < 4; i++) {
		unsigned int __tempa = (unsigned int) __a[i];
		unsigned int __tempb = (unsigned int) __b[i];
		unsigned int __tempc = (unsigned int) __c[i];
		__tempc = __tempc & 0x00000001;
		nemanjaiUnsubmitted Not Done Reply Inline Actions Is it not a little cleaner and more readable to just mask out the `__c` parameter before the loop (similarly to the masking you've done with `__mask` above)? nemanjai: Is it not a little cleaner and more readable to just mask out the `__c` parameter before the…
		unsigned long long __longa = (unsigned long long) __tempa;
		nemanjaiUnsubmitted Not Done Reply Inline Actions I think it's a little more clear and obvious what is happening if you actually have just a single cast and mask - i.e. unsigned long long __longa = ((unsigned long long) __a[i]) & 0xFFFFFFFF; nemanjai: I think it's a little more clear and obvious what is happening if you actually have just a…
		kbartonUnsubmitted Not Done Reply Inline Actions Is a mask actually needed? This seems to be what is done in the vec_addec function below, without the cast. I agree that is cleaner. The other minor nit is to pick a single value for the mask (1, 0x01, 0x00000001) and use it consistently. kbarton: Is a mask actually needed? This seems to be what is done in the vec_addec function below…
		amehsanAuthorUnsubmitted Not Done Reply Inline Actions To come up with this code pattern I looked at the following pieces of codes: unsigned long long f (int t) { return (unsigned long long) t; } When compiled with optimization produces define i64 @f(i32 signext %t) local_unnamed_addr #0 { entry: %conv = sext i32 %t to i64 ret i64 %conv } Which is incorrect. Also unsigned long long f (int t) { return (unsigned long long)(unsigned) t; } ~ results in define i64 @f(i32 signext %t) local_unnamed_addr #0 { entry: %conv = zext i32 %t to i64 ret i64 %conv } and .Lfunc_begin0: # BB#0: # %entry clrldi 3, 3, 32 blr So I think the code here is optimal and correct and there is no need to change it. amehsan: To come up with this code pattern I looked at the following pieces of codes: ``` unsigned…
		unsigned long long __longb = (unsigned long long) __tempb;
		unsigned long long __longc = (unsigned long long) __tempc;
		unsigned long long __sum = __longa + __longb + __longc;
		unsigned long long __res = (__sum >> 32) & 0x01;
		unsigned long long __tempres = (unsigned int) __res;
		__result[i] = (signed int) __tempres;
		}

		vector signed int ret = { __result[0], __result[1], __result[2], __result[3] };
		return ret;
		}

		static __inline__ vector unsigned int __ATTRS_o_ai
		vec_addec(vector unsigned int __a, vector unsigned int __b,
		vector unsigned int __c) {

		unsigned int __result[4];
		for (int i = 0; i < 4; i++) {
		unsigned int __tempc = __c[i] & 1;
		unsigned long long __longa = (unsigned long long) __a[i];
		unsigned long long __longb = (unsigned long long) __b[i];
		unsigned long long __longc = (unsigned long long) __tempc;
		unsigned long long __sum = __longa + __longb + __longc;
		unsigned long long __res = (__sum >> 32) & 0x01;
		unsigned long long __tempres = (unsigned int) __res;
		__result[i] = (signed int) __tempres;
		}

		vector unsigned int ret = { __result[0], __result[1], __result[2], __result[3] };
		return ret;
		}

#endif		#endif

/* vec_vaddubm */		/* vec_vaddubm */

#define __builtin_altivec_vaddubm vec_vaddubm		#define __builtin_altivec_vaddubm vec_vaddubm

static __inline__ vector signed char __ATTRS_o_ai		static __inline__ vector signed char __ATTRS_o_ai
vec_vaddubm(vector signed char __a, vector signed char __b) {		vec_vaddubm(vector signed char __a, vector signed char __b) {
▲ Show 20 Lines • Show All 9,826 Lines • ▼ Show 20 Lines

static __inline__ vector float __attribute__((__always_inline__))		static __inline__ vector float __attribute__((__always_inline__))
vec_vsubfp(vector float __a, vector float __b) {		vec_vsubfp(vector float __a, vector float __b) {
return __a - __b;		return __a - __b;
}		}

/* vec_subc */		/* vec_subc */

		static __inline__ vector signed int __ATTRS_o_ai
		vec_subc(vector signed int __a, vector signed int __b) {
		return __builtin_altivec_vsubcuw(__a, __b);
		}

static __inline__ vector unsigned int __ATTRS_o_ai		static __inline__ vector unsigned int __ATTRS_o_ai
vec_subc(vector unsigned int __a, vector unsigned int __b) {		vec_subc(vector unsigned int __a, vector unsigned int __b) {
return __builtin_altivec_vsubcuw(__a, __b);		return __builtin_altivec_vsubcuw(__a, __b);
}		}

#if defined(__POWER8_VECTOR__) && defined(__powerpc64__)		#if defined(__POWER8_VECTOR__) && defined(__powerpc64__)
static __inline__ vector unsigned __int128 __ATTRS_o_ai		static __inline__ vector unsigned __int128 __ATTRS_o_ai
vec_subc(vector unsigned __int128 __a, vector unsigned __int128 __b) {		vec_subc(vector unsigned __int128 __a, vector unsigned __int128 __b) {
▲ Show 20 Lines • Show All 229 Lines • ▼ Show 20 Lines
}		}

static __inline__ vector unsigned __int128 __ATTRS_o_ai		static __inline__ vector unsigned __int128 __ATTRS_o_ai
vec_vsubeuqm(vector unsigned __int128 __a, vector unsigned __int128 __b,		vec_vsubeuqm(vector unsigned __int128 __a, vector unsigned __int128 __b,
vector unsigned __int128 __c) {		vector unsigned __int128 __c) {
return __builtin_altivec_vsubeuqm(__a, __b, __c);		return __builtin_altivec_vsubeuqm(__a, __b, __c);
}		}

		static __inline__ vector signed __int128 __ATTRS_o_ai
		vec_sube(vector signed __int128 __a, vector signed __int128 __b,
		vector signed __int128 __c) {
		return __builtin_altivec_vsubeuqm(__a, __b, __c);
		}

		static __inline__ vector unsigned __int128 __ATTRS_o_ai
		vec_sube(vector unsigned __int128 __a, vector unsigned __int128 __b,
		vector unsigned __int128 __c) {
		return __builtin_altivec_vsubeuqm(__a, __b, __c);
		}

/* vec_vsubcuq */		/* vec_vsubcuq */

static __inline__ vector signed __int128 __ATTRS_o_ai		static __inline__ vector signed __int128 __ATTRS_o_ai
vec_vsubcuq(vector signed __int128 __a, vector signed __int128 __b) {		vec_vsubcuq(vector signed __int128 __a, vector signed __int128 __b) {
return __builtin_altivec_vsubcuq(__a, __b);		return __builtin_altivec_vsubcuq(__a, __b);
}		}

static __inline__ vector unsigned __int128 __ATTRS_o_ai		static __inline__ vector unsigned __int128 __ATTRS_o_ai
Show All 9 Lines	vec_vsubecuq(vector signed __int128 __a, vector signed __int128 __b,
return __builtin_altivec_vsubecuq(__a, __b, __c);		return __builtin_altivec_vsubecuq(__a, __b, __c);
}		}

static __inline__ vector unsigned __int128 __ATTRS_o_ai		static __inline__ vector unsigned __int128 __ATTRS_o_ai
vec_vsubecuq(vector unsigned __int128 __a, vector unsigned __int128 __b,		vec_vsubecuq(vector unsigned __int128 __a, vector unsigned __int128 __b,
vector unsigned __int128 __c) {		vector unsigned __int128 __c) {
return __builtin_altivec_vsubecuq(__a, __b, __c);		return __builtin_altivec_vsubecuq(__a, __b, __c);
}		}

		static __inline__ vector signed int __ATTRS_o_ai
		vec_subec(vector signed int __a, vector signed int __b,
		vector signed int __c) {
		return vec_addec(__a, ~__b, __c);
		kbartonUnsubmitted Not Done Reply Inline Actions Why do we mask the carries for sign/unsigned ints, but not __128 ints? kbarton: Why do we mask the carries for sign/unsigned ints, but not __128 ints?
		amehsanAuthorUnsubmitted Not Done Reply Inline Actions for quadword, hardware does the masking (implicitly, by only looking at the rightmost bit) amehsan: for quadword, hardware does the masking (implicitly, by only looking at the rightmost bit)
		}

		static __inline__ vector unsigned int __ATTRS_o_ai
		vec_subec(vector unsigned int __a, vector unsigned int __b,
		vector unsigned int __c) {
		return vec_addec(__a, ~__b, __c);
		}

		kbartonUnsubmitted Not Done Reply Inline Actions Please reorder these to put the __128 below signed and unsigned. kbarton: Please reorder these to put the __128 below signed and unsigned.
		static __inline__ vector signed __int128 __ATTRS_o_ai
		vec_subec(vector signed __int128 __a, vector signed __int128 __b,
		vector signed __int128 __c) {
		return __builtin_altivec_vsubecuq(__a, __b, __c);
		}

		static __inline__ vector unsigned __int128 __ATTRS_o_ai
		vec_subec(vector unsigned __int128 __a, vector unsigned __int128 __b,
		vector unsigned __int128 __c) {
		return __builtin_altivec_vsubecuq(__a, __b, __c);
		}
#endif // defined(__POWER8_VECTOR__) && defined(__powerpc64__)		#endif // defined(__POWER8_VECTOR__) && defined(__powerpc64__)

		static __inline__ vector signed int __ATTRS_o_ai
		vec_sube(vector signed int __a, vector signed int __b,
		vector signed int __c) {
		vector signed int __mask = {1, 1, 1, 1};
		vector signed int __carry = __c & __mask;
		return vec_adde(__a, ~__b, __carry);
		kbartonUnsubmitted Not Done Reply Inline Actions Is it possible to use vec_adde(a, ~b, __carry)? kbarton: Is it possible to use vec_adde(__a, ~__b, __carry)?
		}

		static __inline__ vector unsigned int __ATTRS_o_ai
		vec_sube(vector unsigned int __a, vector unsigned int __b,
		vector unsigned int __c) {
		vector unsigned int __mask = {1, 1, 1, 1};
		vector unsigned int __carry = __c & __mask;
		return vec_adde(__a, ~__b, __carry);
		}
/* vec_sum4s */		/* vec_sum4s */

static __inline__ vector int __ATTRS_o_ai vec_sum4s(vector signed char __a,		static __inline__ vector int __ATTRS_o_ai vec_sum4s(vector signed char __a,
vector int __b) {		vector int __b) {
return __builtin_altivec_vsum4sbs(__a, __b);		return __builtin_altivec_vsum4sbs(__a, __b);
}		}

static __inline__ vector unsigned int __ATTRS_o_ai		static __inline__ vector unsigned int __ATTRS_o_ai
▲ Show 20 Lines • Show All 4,585 Lines • ▼ Show 20 Lines
static __inline__ vector unsigned long long __attribute__((__always_inline__))		static __inline__ vector unsigned long long __attribute__((__always_inline__))
vec_bperm(vector unsigned __int128 __a, vector unsigned char __b) {		vec_bperm(vector unsigned __int128 __a, vector unsigned char __b) {
return __builtin_altivec_vbpermq((vector unsigned char)__a,		return __builtin_altivec_vbpermq((vector unsigned char)__a,
(vector unsigned char)__b);		(vector unsigned char)__b);
}		}
#endif		#endif
#endif		#endif

		static vector float __ATTRS_o_ai vec_neg(vector float __a) {
		return -__a;
		}

		#ifdef __VSX__
		static vector double __ATTRS_o_ai vec_neg(vector double __a) {
		return -__a;
		}

		#endif

		#if defined(__POWER8_VECTOR__) && defined(__powerpc64__)
		static vector long long __ATTRS_o_ai vec_neg(vector long long __a) {
		return -__a;
		}
		#endif

		static vector signed int __ATTRS_o_ai vec_neg(vector signed int __a) {
		return -__a;
		}

		static vector signed short __ATTRS_o_ai vec_neg(vector signed short __a) {
		return -__a;
		}

		static vector signed char __ATTRS_o_ai vec_neg(vector signed char __a) {
		return -__a;
		}

		static vector float __ATTRS_o_ai vec_nabs(vector float __a) {
		return - vec_abs(__a);
		}

		#ifdef __VSX__
		static vector double __ATTRS_o_ai vec_nabs(vector double __a) {
		return - vec_abs(__a);
		}

		#endif

		#if defined(__POWER8_VECTOR__) && defined(__powerpc64__)
		static vector long long __ATTRS_o_ai vec_nabs(vector long long __a) {
		return __builtin_altivec_vminsd(__a, -__a);
		}
		#endif

		static vector signed int __ATTRS_o_ai vec_nabs(vector signed int __a) {
		return __builtin_altivec_vminsw(__a, -__a);
		}

		static vector signed short __ATTRS_o_ai vec_nabs(vector signed short __a) {
		return __builtin_altivec_vminsh(__a, -__a);
		}

		static vector signed char __ATTRS_o_ai vec_nabs(vector signed char __a) {
		return __builtin_altivec_vminsb(__a, -__a);
		}

#undef __ATTRS_o_ai		#undef __ATTRS_o_ai

#endif /* __ALTIVEC_H */		#endif /* __ALTIVEC_H */

test/CodeGen/builtins-ppc-altivec.c

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines
// CHECK: and <4 x i32> {{.*}}, <i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>		// CHECK: and <4 x i32> {{.*}}, <i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>
// CHECK: bitcast <4 x i32> %{{.*}} to <4 x float>		// CHECK: bitcast <4 x i32> %{{.*}} to <4 x float>
// CHECK: store <4 x float> %{{.}}, <4 x float> @vf		// CHECK: store <4 x float> %{{.}}, <4 x float> @vf
// CHECK-LE: bitcast <4 x float> %{{.*}} to <4 x i32>		// CHECK-LE: bitcast <4 x float> %{{.*}} to <4 x i32>
// CHECK-LE: and <4 x i32> {{.*}}, <i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>		// CHECK-LE: and <4 x i32> {{.*}}, <i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>
// CHECK-LE: bitcast <4 x i32> %{{.*}} to <4 x float>		// CHECK-LE: bitcast <4 x i32> %{{.*}} to <4 x float>
// CHECK-LE: store <4 x float> %{{.}}, <4 x float> @vf		// CHECK-LE: store <4 x float> %{{.}}, <4 x float> @vf
// CHECK-NOALTIVEC: error: use of undeclared identifier 'vf'		// CHECK-NOALTIVEC: error: use of undeclared identifier 'vf'
// CHECK-NOALTIVEC: vf = vec_abs(vf)		// CHECK-NOALTIVEC: vf = vec_abs(vf)

		vsc = vec_nabs(vsc);
		// CHECK: sub <16 x i8> zeroinitializer
		// CHECK: @llvm.ppc.altivec.vminsb
		// CHECK-LE: sub <16 x i8> zeroinitializer
		// CHECK-LE: @llvm.ppc.altivec.vminsb

		vs = vec_nabs(vs);
		// CHECK: sub <8 x i16> zeroinitializer
		// CHECK: @llvm.ppc.altivec.vminsh
		// CHECK-LE: sub <8 x i16> zeroinitializer
		// CHECK-LE: @llvm.ppc.altivec.vminsh

		vi = vec_nabs(vi);
		// CHECK: sub <4 x i32> zeroinitializer
		// CHECK: @llvm.ppc.altivec.vminsw
		// CHECK-LE: sub <4 x i32> zeroinitializer
		// CHECK-LE: @llvm.ppc.altivec.vminsw

		res_vi = vec_neg(vi);
		// CHECK: sub <4 x i32> zeroinitializer, {{%[0-9]+}}
		// CHECK-LE: sub <4 x i32> zeroinitializer, {{%[0-9]+}}
		// CHECK-NOALTIVEC: error: use of undeclared identifier 'vi'
		// CHECK-NOALTIVEC: vi = vec_neg(vi);

		res_vs = vec_neg(vs);
		// CHECK: sub <8 x i16> zeroinitializer, {{%[0-9]+}}
		// CHECK-LE: sub <8 x i16> zeroinitializer, {{%[0-9]+}}
		// CHECK-NOALTIVEC: error: use of undeclared identifier 'vs'
		// CHECK-NOALTIVEC: res_vs = vec_neg(vs);

		res_vsc = vec_neg(vsc);
		// CHECK: sub <16 x i8> zeroinitializer, {{%[0-9]+}}
		// CHECK-LE: sub <16 x i8> zeroinitializer, {{%[0-9]+}}
		// CHECK-NOALTIVEC: error: use of undeclared identifier 'vsc'
		// CHECK-NOALTIVEC: res_vsc = vec_neg(vsc);

/* vec_abs */		/* vec_abs */
vsc = vec_abss(vsc);		vsc = vec_abss(vsc);
// CHECK: @llvm.ppc.altivec.vsubsbs		// CHECK: @llvm.ppc.altivec.vsubsbs
// CHECK: @llvm.ppc.altivec.vmaxsb		// CHECK: @llvm.ppc.altivec.vmaxsb
// CHECK-LE: @llvm.ppc.altivec.vsubsbs		// CHECK-LE: @llvm.ppc.altivec.vsubsbs
// CHECK-LE: @llvm.ppc.altivec.vmaxsb		// CHECK-LE: @llvm.ppc.altivec.vmaxsb

vs = vec_abss(vs);		vs = vec_abss(vs);
▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	// CHECK-LE: add <4 x i32>
res_vui = vec_add(vui, vbi);		res_vui = vec_add(vui, vbi);
// CHECK: add <4 x i32>		// CHECK: add <4 x i32>
// CHECK-LE: add <4 x i32>		// CHECK-LE: add <4 x i32>

res_vf = vec_add(vf, vf);		res_vf = vec_add(vf, vf);
// CHECK: fadd <4 x float>		// CHECK: fadd <4 x float>
// CHECK-LE: fadd <4 x float>		// CHECK-LE: fadd <4 x float>

		res_vi = vec_adde(vi, vi, vi);
		// CHECK: and <4 x i32>
		// CHECK: add <4 x i32>
		// CHECK: add <4 x i32>
		// CHECK-LE: and <4 x i32>
		// CHECK-LE: add <4 x i32>
		// CHECK-LE: add <4 x i32>

		res_vui = vec_adde(vui, vui, vui);
		// CHECK: and <4 x i32>
		// CHECK: add <4 x i32>
		// CHECK: add <4 x i32>
		// CHECK-LE: and <4 x i32>
		// CHECK-LE: add <4 x i32>
		// CHECK-LE: add <4 x i32>

res_vsc = vec_vaddubm(vsc, vsc);		res_vsc = vec_vaddubm(vsc, vsc);
// CHECK: add <16 x i8>		// CHECK: add <16 x i8>
// CHECK-LE: add <16 x i8>		// CHECK-LE: add <16 x i8>

res_vsc = vec_vaddubm(vbc, vsc);		res_vsc = vec_vaddubm(vbc, vsc);
// CHECK: add <16 x i8>		// CHECK: add <16 x i8>
// CHECK-LE: add <16 x i8>		// CHECK-LE: add <16 x i8>

▲ Show 20 Lines • Show All 4,959 Lines • ▼ Show 20 Lines	// CHECK-LE: sub <4 x i32>
res_vui = vec_sub(vui, vbi);		res_vui = vec_sub(vui, vbi);
// CHECK: sub <4 x i32>		// CHECK: sub <4 x i32>
// CHECK-LE: sub <4 x i32>		// CHECK-LE: sub <4 x i32>

res_vf = vec_sub(vf, vf);		res_vf = vec_sub(vf, vf);
// CHECK: fsub <4 x float>		// CHECK: fsub <4 x float>
// CHECK-LE: fsub <4 x float>		// CHECK-LE: fsub <4 x float>



res_vsc = vec_vsububm(vsc, vsc);		res_vsc = vec_vsububm(vsc, vsc);
// CHECK: sub <16 x i8>		// CHECK: sub <16 x i8>
// CHECK-LE: sub <16 x i8>		// CHECK-LE: sub <16 x i8>

res_vsc = vec_vsububm(vbc, vsc);		res_vsc = vec_vsububm(vbc, vsc);
// CHECK: sub <16 x i8>		// CHECK: sub <16 x i8>
// CHECK-LE: sub <16 x i8>		// CHECK-LE: sub <16 x i8>

▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
// CHECK: fsub <4 x float>		// CHECK: fsub <4 x float>
// CHECK-LE: fsub <4 x float>		// CHECK-LE: fsub <4 x float>

/* vec_subc */		/* vec_subc */
res_vui = vec_subc(vui, vui);		res_vui = vec_subc(vui, vui);
// CHECK: @llvm.ppc.altivec.vsubcuw		// CHECK: @llvm.ppc.altivec.vsubcuw
// CHECK-LE: @llvm.ppc.altivec.vsubcuw		// CHECK-LE: @llvm.ppc.altivec.vsubcuw

		res_vi = vec_subc(vi, vi);
		// CHECK: @llvm.ppc.altivec.vsubcuw
		// CHECK-LE: @llvm.ppc.altivec.vsubcuw

res_vui = vec_vsubcuw(vui, vui);		res_vui = vec_vsubcuw(vui, vui);
// CHECK: @llvm.ppc.altivec.vsubcuw		// CHECK: @llvm.ppc.altivec.vsubcuw
// CHECK-LE: @llvm.ppc.altivec.vsubcuw		// CHECK-LE: @llvm.ppc.altivec.vsubcuw

/* vec_subs */		/* vec_subs */
res_vsc = vec_subs(vsc, vsc);		res_vsc = vec_subs(vsc, vsc);
// CHECK: @llvm.ppc.altivec.vsubsbs		// CHECK: @llvm.ppc.altivec.vsubsbs
// CHECK-LE: @llvm.ppc.altivec.vsubsbs		// CHECK-LE: @llvm.ppc.altivec.vsubsbs
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	// CHECK-LE: @llvm.ppc.altivec.vsubuws
res_vui = vec_subs(vbi, vui);		res_vui = vec_subs(vbi, vui);
// CHECK: @llvm.ppc.altivec.vsubuws		// CHECK: @llvm.ppc.altivec.vsubuws
// CHECK-LE: @llvm.ppc.altivec.vsubuws		// CHECK-LE: @llvm.ppc.altivec.vsubuws

res_vui = vec_subs(vui, vbi);		res_vui = vec_subs(vui, vbi);
// CHECK: @llvm.ppc.altivec.vsubuws		// CHECK: @llvm.ppc.altivec.vsubuws
// CHECK-LE: @llvm.ppc.altivec.vsubuws		// CHECK-LE: @llvm.ppc.altivec.vsubuws

		res_vi = vec_sube(vi, vi, vi);
		// CHECK: and <4 x i32>
		// CHECK: xor <4 x i32> {{%[0-9]+}}, <i32 -1, i32 -1, i32 -1, i32 -1>
		// CHECK: add <4 x i32>
		// CHECK: add <4 x i32>
		// CHECK-LE: and <4 x i32>
		// CHECK-LE: xor <4 x i32> {{%[0-9]+}}, <i32 -1, i32 -1, i32 -1, i32 -1>
		// CHECK-LE: add <4 x i32>
		// CHECK-LE: add <4 x i32>

		res_vui = vec_sube(vui, vui, vui);
		// CHECK: and <4 x i32>
		// CHECK: xor <4 x i32> {{%[0-9]+}}, <i32 -1, i32 -1, i32 -1, i32 -1>
		// CHECK: add <4 x i32>
		// CHECK: add <4 x i32>
		// CHECK-LE: and <4 x i32>
		// CHECK-LE: xor <4 x i32> {{%[0-9]+}}, <i32 -1, i32 -1, i32 -1, i32 -1>
		// CHECK-LE: add <4 x i32>
		// CHECK-LE: add <4 x i32>

res_vsc = vec_vsubsbs(vsc, vsc);		res_vsc = vec_vsubsbs(vsc, vsc);
// CHECK: @llvm.ppc.altivec.vsubsbs		// CHECK: @llvm.ppc.altivec.vsubsbs
// CHECK-LE: @llvm.ppc.altivec.vsubsbs		// CHECK-LE: @llvm.ppc.altivec.vsubsbs

res_vsc = vec_vsubsbs(vbc, vsc);		res_vsc = vec_vsubsbs(vbc, vsc);
// CHECK: @llvm.ppc.altivec.vsubsbs		// CHECK: @llvm.ppc.altivec.vsubsbs
// CHECK-LE: @llvm.ppc.altivec.vsubsbs		// CHECK-LE: @llvm.ppc.altivec.vsubsbs

▲ Show 20 Lines • Show All 3,671 Lines • Show Last 20 Lines

test/CodeGen/builtins-ppc-p8vector.c

	Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines
	void test1() {			void test1() {

	/* vec_abs */			/* vec_abs */
	res_vsll = vec_abs(vsll);			res_vsll = vec_abs(vsll);
	// CHECK: call <2 x i64> @llvm.ppc.altivec.vmaxsd(<2 x i64> %{{[0-9]*}}, <2 x i64>			// CHECK: call <2 x i64> @llvm.ppc.altivec.vmaxsd(<2 x i64> %{{[0-9]*}}, <2 x i64>
	// CHECK-LE: call <2 x i64> @llvm.ppc.altivec.vmaxsd(<2 x i64> %{{[0-9]*}}, <2 x i64>			// CHECK-LE: call <2 x i64> @llvm.ppc.altivec.vmaxsd(<2 x i64> %{{[0-9]*}}, <2 x i64>
	// CHECK-PPC: error: call to 'vec_abs' is ambiguous			// CHECK-PPC: error: call to 'vec_abs' is ambiguous

	res_vd = vec_abs(vda);
	// CHECK: call <2 x double> @llvm.fabs.v2f64(<2 x double> %{{.*}})
	// CHECK: store <2 x double> %{{.}}, <2 x double> @res_vd
	// CHECK-LE: call <2 x double> @llvm.fabs.v2f64(<2 x double> %{{.*}})
	// CHECK-LE: store <2 x double> %{{.}}, <2 x double> @res_vd
	// CHECK-PPC: error: call to 'vec_abs' is ambiguous

	/* vec_add */			/* vec_add */
	res_vsll = vec_add(vsll, vsll);			res_vsll = vec_add(vsll, vsll);
	// CHECK: add <2 x i64>			// CHECK: add <2 x i64>
	// CHECK-LE: add <2 x i64>			// CHECK-LE: add <2 x i64>
	// CHECK-PPC: error: call to 'vec_add' is ambiguous			// CHECK-PPC: error: call to 'vec_add' is ambiguous

	res_vull = vec_add(vull, vull);			res_vull = vec_add(vull, vull);
	// CHECK: add <2 x i64>			// CHECK: add <2 x i64>
	▲ Show 20 Lines • Show All 1,408 Lines • ▼ Show 20 Lines
	// CHECK: llvm.ppc.altivec.vgbbd			// CHECK: llvm.ppc.altivec.vgbbd
	// CHECK-LE: llvm.ppc.altivec.vgbbd			// CHECK-LE: llvm.ppc.altivec.vgbbd
	// CHECK-PPC: warning: implicit declaration of function 'vec_gb'			// CHECK-PPC: warning: implicit declaration of function 'vec_gb'

	res_vull = vec_bperm(vux, vux);			res_vull = vec_bperm(vux, vux);
	// CHECK: llvm.ppc.altivec.vbpermq			// CHECK: llvm.ppc.altivec.vbpermq
	// CHECK-LE: llvm.ppc.altivec.vbpermq			// CHECK-LE: llvm.ppc.altivec.vbpermq
	// CHECK-PPC: warning: implicit declaration of function 'vec_bperm'			// CHECK-PPC: warning: implicit declaration of function 'vec_bperm'

				res_vsll = vec_neg(vsll);
				// CHECK: sub <2 x i64> zeroinitializer, {{%[0-9]+}}
				// CHECK-LE: sub <2 x i64> zeroinitializer, {{%[0-9]+}}
				// CHECK_PPC: call to 'vec_neg' is ambiguous


				}


				vector signed int test_vec_addec_signed (vector signed int a, vector signed int b, vector signed int c) {
				return vec_addec(a, b, c);
				// CHECK-LABEL: @test_vec_addec_signed
				// CHECK-LABEL: for.cond.i:
				// CHECK: icmp slt i32 {{%[0-9]+}}, 4
				// CHECK-LABEL: for.body.i:
				// CHECK: extractelement
				// CHECK: extractelement
				// CHECK: extractelement
				// CHECK: and i32 {{%[0-9]+}}, 1
				// CHECK: zext
				// CHECK: zext
				// CHECK: zext
				// CHECK: add i64
				// CHECK: add i64
				// CHECK: lshr i64
				// CHECK: and i64
				// CHECK: trunc i64 {{%[0-9]+}} to i32
				// CHECK: zext i32
				// CHECK: trunc i64 {{%[0-9]+}} to i32
				// CHECK: sext i32
				// CHECK: add nsw i32
				// CHECK: br label
				// CHECK: ret <4 x i32>

				}


				vector unsigned int test_vec_addec_unsigned (vector unsigned int a, vector unsigned int b, vector unsigned int c) {
				return vec_addec(a, b, c);

				// CHECK-LABEL: @test_vec_addec_unsigned
				// CHECK-LABEL: for.cond.i:
				// CHECK: icmp slt i32 {{%[0-9]+}}, 4
				// CHECK-LABEL: for.body.i:
				// CHECK: extractelement
				// CHECK: and i32
				// CHECK: extractelement
				// CHECK: zext i32
				// CHECK: extractelement
				// CHECK: zext i32
				// CHECK: zext i32
				// CHECK: add i64
				// CHECK: lshr i64
				// CHECK: and i64
				// CHECK: trunc i64 {{%[0-9]+}} to i32
				// CHECK: zext i32
				// CHECK: trunc i64 {{%[0-9]+}} to i32
				// CHECK: sext i32
				// CHECK: add nsw i32
				// CHECK: br label
				// CHECK: ret <4 x i32>
				}

				vector signed int test_vec_subec_signed (vector signed int a, vector signed int b, vector signed int c) {
				return vec_subec(a, b, c);
				// CHECK-LABEL: @test_vec_subec_signed
				// CHECK: xor <4 x i32> {{%[0-9]+}}, <i32 -1, i32 -1, i32 -1, i32 -1>
				// CHECK-LABEL: for.cond.i.i:
				// CHECK: ret <4 x i32>
				}

				vector unsigned int test_vec_subec_unsigned (vector unsigned int a, vector unsigned int b, vector unsigned int c) {
				return vec_subec(a, b, c);

				// CHECK-LABEL: @test_vec_subec_unsigned
				// CHECK: xor <4 x i32> {{%[0-9]+}}, <i32 -1, i32 -1, i32 -1, i32 -1>
				// CHECK-LABEL: for.cond.i.i:
				// CHECK: ret <4 x i32>
	}			}

test/CodeGen/builtins-ppc-quadword.c

	Show First 20 Lines • Show All 113 Lines • ▼ Show 20 Lines
	// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'			// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'

	/* vec_vsubeuqm */			/* vec_vsubeuqm */
	res_vlll = vec_vsubeuqm(vlll, vlll, vlll);			res_vlll = vec_vsubeuqm(vlll, vlll, vlll);
	// CHECK: @llvm.ppc.altivec.vsubeuqm			// CHECK: @llvm.ppc.altivec.vsubeuqm
	// CHECK-LE: @llvm.ppc.altivec.vsubeuqm			// CHECK-LE: @llvm.ppc.altivec.vsubeuqm
	// CHECK-PPC: error: assigning to '__vector __int128' (vector of 1 '__int128' value) from incompatible type 'int'			// CHECK-PPC: error: assigning to '__vector __int128' (vector of 1 '__int128' value) from incompatible type 'int'

				/* vec_sube */
				res_vlll = vec_sube(vlll, vlll, vlll);
				// CHECK: @llvm.ppc.altivec.vsubeuqm
				// CHECK-LE: @llvm.ppc.altivec.vsubeuqm
				// CHECK-PPC: error: call to 'vec_sube' is ambiguous

				res_vulll = vec_sube(vulll, vulll, vulll);
				// CHECK: @llvm.ppc.altivec.vsubeuqm
				// CHECK-LE: @llvm.ppc.altivec.vsubeuqm
				// CHECK-PPC: error: call to 'vec_sube' is ambiguous

				res_vlll = vec_sube(vlll, vlll, vlll);
				// CHECK: @llvm.ppc.altivec.vsubeuqm
				// CHECK-LE: @llvm.ppc.altivec.vsubeuqm
				// CHECK-PPC: error: call to 'vec_sube' is ambiguous

	res_vulll = vec_vsubeuqm(vulll, vulll, vulll);			res_vulll = vec_vsubeuqm(vulll, vulll, vulll);
	// CHECK: @llvm.ppc.altivec.vsubeuqm			// CHECK: @llvm.ppc.altivec.vsubeuqm
	// CHECK-LE: @llvm.ppc.altivec.vsubeuqm			// CHECK-LE: @llvm.ppc.altivec.vsubeuqm
	// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'			// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'

				res_vulll = vec_sube(vulll, vulll, vulll);
				// CHECK: @llvm.ppc.altivec.vsubeuqm
				// CHECK-LE: @llvm.ppc.altivec.vsubeuqm
				// CHECK-PPC: error: call to 'vec_sube' is ambiguous

	/* vec_subc */			/* vec_subc */
	res_vlll = vec_subc(vlll, vlll);			res_vlll = vec_subc(vlll, vlll);
	// CHECK: @llvm.ppc.altivec.vsubcuq			// CHECK: @llvm.ppc.altivec.vsubcuq
	// CHECK-LE: @llvm.ppc.altivec.vsubcuq			// CHECK-LE: @llvm.ppc.altivec.vsubcuq
	// KCHECK-PPC: error: call to 'vec_subc' is ambiguous			// KCHECK-PPC: error: call to 'vec_subc' is ambiguous

	res_vulll = vec_subc(vulll, vulll);			res_vulll = vec_subc(vulll, vulll);
	// CHECK: @llvm.ppc.altivec.vsubcuq			// CHECK: @llvm.ppc.altivec.vsubcuq
	Show All 10 Lines
	// CHECK: @llvm.ppc.altivec.vsubcuq			// CHECK: @llvm.ppc.altivec.vsubcuq
	// CHECK-LE: @llvm.ppc.altivec.vsubcuq			// CHECK-LE: @llvm.ppc.altivec.vsubcuq
	// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'			// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'

	/* vec_vsubecuq */			/* vec_vsubecuq */
	res_vlll = vec_vsubecuq(vlll, vlll, vlll);			res_vlll = vec_vsubecuq(vlll, vlll, vlll);
	// CHECK: @llvm.ppc.altivec.vsubecuq			// CHECK: @llvm.ppc.altivec.vsubecuq
	// CHECK-LE: @llvm.ppc.altivec.vsubecuq			// CHECK-LE: @llvm.ppc.altivec.vsubecuq
	// CHECK-PPC: error: assigning to '__vector __int128' (vector of 1 '__int128' value) from incompatible type 'int'			// CHECK-PPC: error: assigning to '__vector __int128' (vector of 1 '__int128' value) from incompatible type 'int'

	res_vulll = vec_vsubecuq(vulll, vulll, vulll);			res_vulll = vec_vsubecuq(vulll, vulll, vulll);
	// CHECK: @llvm.ppc.altivec.vsubecuq			// CHECK: @llvm.ppc.altivec.vsubecuq
	// CHECK-LE: @llvm.ppc.altivec.vsubecuq			// CHECK-LE: @llvm.ppc.altivec.vsubecuq
	// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'			// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'

				res_vlll = vec_subec(vlll, vlll, vlll);
				// CHECK: @llvm.ppc.altivec.vsubecuq
				// CHECK-LE: @llvm.ppc.altivec.vsubecuq
				// CHECK-PPC: error: assigning to '__vector __int128' (vector of 1 '__int128' value) from incompatible type 'int'

				res_vulll = vec_subec(vulll, vulll, vulll);
				// CHECK: @llvm.ppc.altivec.vsubecuq
				// CHECK-LE: @llvm.ppc.altivec.vsubecuq
				// CHECK-PPC: error: assigning to '__vector unsigned __int128' (vector of 1 'unsigned __int128' value) from incompatible type 'int'

	}			}

test/CodeGen/builtins-ppc-vsx.c

	Show First 20 Lines • Show All 63 Lines • ▼ Show 20 Lines
	void test1() {			void test1() {
	// CHECK-LABEL: define void @test1			// CHECK-LABEL: define void @test1
	// CHECK-LE-LABEL: define void @test1			// CHECK-LE-LABEL: define void @test1

	res_vf = vec_abs(vf);			res_vf = vec_abs(vf);
	// CHECK: call <4 x float> @llvm.fabs.v4f32(<4 x float> %{{[0-9]*}})			// CHECK: call <4 x float> @llvm.fabs.v4f32(<4 x float> %{{[0-9]*}})
	// CHECK-LE: call <4 x float> @llvm.fabs.v4f32(<4 x float> %{{[0-9]*}})			// CHECK-LE: call <4 x float> @llvm.fabs.v4f32(<4 x float> %{{[0-9]*}})

				res_vd = vec_abs(vd);
				// CHECK: call <2 x double> @llvm.fabs.v2f64(<2 x double> %{{[0-9]*}})
				// CHECK-LE: call <2 x double> @llvm.fabs.v2f64(<2 x double> %{{[0-9]*}})

				res_vf = vec_nabs(vf);
				// CHECK: [[VEC:%[0-9]+]] = call <4 x float> @llvm.fabs.v4f32(<4 x float> %{{[0-9]*}})
				// CHECK-NEXT: fsub <4 x float> <float -0.000000e+00, float -0.000000e+00, float -0.000000e+00, float -0.000000e+00>, [[VEC]]

				res_vd = vec_nabs(vd);
				// CHECK: [[VECD:%[0-9]+]] = call <2 x double> @llvm.fabs.v2f64(<2 x double> %{{[0-9]*}})
				// CHECK: fsub <2 x double> <double -0.000000e+00, double -0.000000e+00>, [[VECD]]

	dummy();			dummy();
	// CHECK: call void @dummy()			// CHECK: call void @dummy()
	// CHECK-LE: call void @dummy()			// CHECK-LE: call void @dummy()

	res_vd = vec_add(vd, vd);			res_vd = vec_add(vd, vd);
	// CHECK: fadd <2 x double>			// CHECK: fadd <2 x double>
	// CHECK-LE: fadd <2 x double>			// CHECK-LE: fadd <2 x double>

	▲ Show 20 Lines • Show All 995 Lines • ▼ Show 20 Lines
	// CHECK-LE: uitofp <2 x i64> %{{.*}} to <2 x double>			// CHECK-LE: uitofp <2 x i64> %{{.*}} to <2 x double>
	// CHECK-LE: fmul <2 x double>			// CHECK-LE: fmul <2 x double>

	res_vd = vec_ctf(vull, 31);			res_vd = vec_ctf(vull, 31);
	// CHECK: uitofp <2 x i64> %{{.*}} to <2 x double>			// CHECK: uitofp <2 x i64> %{{.*}} to <2 x double>
	// CHECK: fmul <2 x double>			// CHECK: fmul <2 x double>
	// CHECK-LE: uitofp <2 x i64> %{{.*}} to <2 x double>			// CHECK-LE: uitofp <2 x i64> %{{.*}} to <2 x double>
	// CHECK-LE: fmul <2 x double>			// CHECK-LE: fmul <2 x double>

				res_vf = vec_neg(vf);
				// CHECK: fsub <4 x float> <float -0.000000e+00, float -0.000000e+00, float -0.000000e+00, float -0.000000e+00>, {{%[0-9]+}}
				// CHECK-LE: fsub <4 x float> <float -0.000000e+00, float -0.000000e+00, float -0.000000e+00, float -0.000000e+00>, {{%[0-9]+}}

				res_vd = vec_neg(vd);
				// CHECK: fsub <2 x double> <double -0.000000e+00, double -0.000000e+00>, {{%[0-9]+}}
				// CHECK-LE: fsub <2 x double> <double -0.000000e+00, double -0.000000e+00>, {{%[0-9]+}}
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[PPC] support for arithmetic builtins in the FEClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 78376

lib/Headers/altivec.h

test/CodeGen/builtins-ppc-altivec.c

test/CodeGen/builtins-ppc-p8vector.c

test/CodeGen/builtins-ppc-quadword.c

test/CodeGen/builtins-ppc-vsx.c

[PPC] support for arithmetic builtins in the FE
ClosedPublic