This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/Basic/Targets/
-
Basic/
-
Targets/
1/3
PPC.h
-
test/CodeGen/
-
CodeGen/
-
ppc64-inline-asm.c
-
llvm/
-
lib/Target/PowerPC/
-
Target/
-
PowerPC/
2/4
PPCISelLowering.cpp
-
test/CodeGen/PowerPC/
-
CodeGen/
-
PowerPC/
1/2
inlineasm-vsx-reg.ll
-
vec-asm-disabled.ll

Differential D64119

[PowerPC] Support constraint code "ww"
ClosedPublic

Authored by MaskRay on Jul 2 2019, 10:28 PM.

Download Raw Diff

Details

Reviewers

awilfox
echristo
hfinkel
jsji
kbarton
nemanjai

Commits

rG1f333562de96: [PowerPC] Support constraint code "ww"
rL365106: [PowerPC] Support constraint code "ww"
rC365106: [PowerPC] Support constraint code "ww"

Summary

"ww" and "ws" are both constraint codes for VSX vector registers that
holds scalar double data. "ww" is preferred for float while "ws" is
preferred for double.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 34245
Build 34244: arc lint + arc unit

Event Timeline

MaskRay created this revision.Jul 2 2019, 10:28 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJul 2 2019, 10:28 PM

Herald added subscribers: llvm-commits, cfe-commits, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B34245: Diff 207707.Jul 2 2019, 10:28 PM

It is great to add ww for compatibility.
However if we are going to add ww, looks like we should update ws as well?

clang/lib/Basic/Targets/PPC.h
211	Add some more comments for `w` to distinguish it from `s`? Do we want to keep compatibility with GCC? According to https://gcc.gnu.org/onlinedocs/gcc-9.1.0/gcc/Machine-Constraints.html#Machine-Constraints, `ww` is `FP or VSX register to perform float operations under -mvsx or NO_REGS.`, while `ws` is `VSX vector register to hold scalar double values` . So `ww` can use `FP` while `ws` can NOT ?
llvm/lib/Target/PowerPC/PPCISelLowering.cpp
14080	Should we exclude `FP` for `ws` and return `VFRCRegClass` instead of `VSFRCRegClass` ?
llvm/test/CodeGen/PowerPC/inlineasm-vsx-reg.ll
42	Maybe we should add another test for ws as well? The above test is actually for 'x' modifier?

MaskRay marked an inline comment as done.Jul 3 2019, 6:49 PM

MaskRay added inline comments.

llvm/lib/Target/PowerPC/PPCISelLowering.cpp
14080	Can you elaborate on what I should do? I'm not familiar with the register info stuff...

MaskRay marked an inline comment as done.Jul 3 2019, 8:02 PM

MaskRay added inline comments.

llvm/test/CodeGen/PowerPC/inlineasm-vsx-reg.ll
42	I think it is incorrect if the 'x' modifier is not used, so we probably don't have to check the no-modifier case.

MaskRay marked an inline comment as done.Jul 3 2019, 8:12 PM

MaskRay added inline comments.

clang/lib/Basic/Targets/PPC.h
211	I played with "ws" and "ww" but can't find any behavior difference from assembly produced by powerpc64le-linux-gnu-gcc. I'll keep the current form (which is known to make musl fmax/fmaxf build) unless the gcc semantics are clearer.

LGTM. Thanks for investigating GCC behavior.

clang/lib/Basic/Targets/PPC.h
211	OK. Thanks. So maybe it is just the misleading doc problem of GCC.
llvm/lib/Target/PowerPC/PPCISelLowering.cpp
14080	`VSFRC` contains both `F8RC` and `VFRC`. `F8RC` is FP. So if `ws` can NOT use FP, then we should not use `VSFRC`. However, if `ws` can use `FP` as well as you found in later GCC experiments, then we don't need to do this.

This revision is now accepted and ready to land.Jul 3 2019, 8:26 PM

float ws_float(float x, float y) {
  __asm__ ("xsadddp %x0, %x1, %x2" : "=ws"(x) : "ws"(x), "ws"(y));
  return x;
}
float ww_float(float x, float y) {
  __asm__ ("xsadddp %x0, %x1, %x2" : "=ww"(x) : "ww"(x), "ww"(y));
  return x;
}

double ws_double(double x, double y) {
  __asm__ ("xsadddp %x0, %x1, %x2" : "=ws"(x) : "ws"(x), "ws"(y));
  return x;
}
double ww_double(double x, double y) {
  __asm__ ("xsadddp %x0, %x1, %x2" : "=ww"(x) : "ww"(x), "ww"(y));
  return x;
}

% powerpc64le-linux-gnu-gcc -O2 a.c -S -o - | grep xsadd
        xsadddp 1, 1, 2
        xsadddp 1, 1, 2
        xsadddp 1, 1, 2
        xsadddp 1, 1, 2
% clang -target ppc64le -O2 a.c -S -o - | grep xsadd
# same output

float scalar_ww_float(float x, float y) {
  __asm__ ("fadds %0, %1, %2" : "=ww"(x) : "ww"(x), "ww"(y));
  return x;
}
float scalar_ws_float(float x, float y) {
  __asm__ ("fadds %0, %1, %2" : "=ws"(x) : "ws"(x), "ws"(y));
  return x;
}
double scalar_ww_double(double x, double y) {
  __asm__ ("fadds %0, %1, %2" : "=ww"(x) : "ww"(x), "ww"(y));
  return x;
}
double scalar_ws_double(double x, double y) {
  __asm__ ("fadds %0, %1, %2" : "=ws"(x) : "ws"(x), "ws"(y));
  return x;
}

% powerpc64le-linux-gnu-gcc -O2 a.c -S -o - | grep fadds
        fadds 1, 1, 2
        fadds 1, 1, 2
        fadds 1, 1, 2
        fadds 1, 1, 2
% clang -target ppc64le -O2 a.c -S -o - | grep fadds
# same output

llvm/lib/Target/PowerPC/PPCISelLowering.cpp

14080

float ws_float(float x, float y) {
  __asm__ ("xsadddp %0, %1, %2" : "=ws"(x) : "ws"(x), "ws"(y));
  return x;
}
float ww_float(float x, float y) {
  __asm__ ("xsadddp %0, %1, %2" : "=ww"(x) : "ww"(x), "ww"(y));
  return x;
}

double ws_double(double x, double y) {
  __asm__ ("xsadddp %0, %1, %2" : "=ws"(x) : "ws"(x), "ws"(y));
  return x;
}
double ww_double(double x, double y) {
  __asm__ ("xsadddp %0, %1, %2" : "=ww"(x) : "ww"(x), "ww"(y));
  return x;
}

% powerpc64le-linux-gnu-gcc -O2 a.c -S -o - | grep xsadd
        xsadddp 1, 1, 2
        xsadddp 1, 1, 2
        xsadddp 1, 1, 2
        xsadddp 1, 1, 2

Closed by commit rL365106: [PowerPC] Support constraint code "ww" (authored by MaskRay). · Explain WhyJul 3 2019, 9:46 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

clang/

lib/

Basic/

Targets/

PPC.h

3 lines

test/

CodeGen/

ppc64-inline-asm.c

13 lines

llvm/

lib/

Target/

PowerPC/

PPCISelLowering.cpp

10 lines

test/

CodeGen/

PowerPC/

inlineasm-vsx-reg.ll

9 lines

vec-asm-disabled.ll

12 lines

Diff 207707

clang/lib/Basic/Targets/PPC.h

Show First 20 Lines • Show All 201 Lines • ▼ Show 20 Lines	case 'v': // Altivec vector register
if (FloatABI == SoftFloat)		if (FloatABI == SoftFloat)
return false;		return false;
Info.setAllowsRegister();		Info.setAllowsRegister();
break;		break;
case 'w':		case 'w':
switch (Name[1]) {		switch (Name[1]) {
case 'd': // VSX vector register to hold vector double data		case 'd': // VSX vector register to hold vector double data
case 'f': // VSX vector register to hold vector float data		case 'f': // VSX vector register to hold vector float data
case 's': // VSX vector register to hold scalar float data		case 's': // VSX vector register to hold scalar double data
		case 'w': // VSX vector register to hold scalar double data
		jsjiUnsubmitted Not Done Reply Inline Actions Add some more comments for `w` to distinguish it from `s`? Do we want to keep compatibility with GCC? According to https://gcc.gnu.org/onlinedocs/gcc-9.1.0/gcc/Machine-Constraints.html#Machine-Constraints, `ww` is `FP or VSX register to perform float operations under -mvsx or NO_REGS.`, while `ws` is `VSX vector register to hold scalar double values` . So `ww` can use `FP` while `ws` can NOT ? jsji: Add some more comments for `w` to distinguish it from `s`? Do we want to keep compatibility…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions I played with "ws" and "ww" but can't find any behavior difference from assembly produced by powerpc64le-linux-gnu-gcc. I'll keep the current form (which is known to make musl fmax/fmaxf build) unless the gcc semantics are clearer. MaskRay: I played with "ws" and "ww" but can't find any behavior difference from assembly produced by…
		jsjiUnsubmitted Not Done Reply Inline Actions OK. Thanks. So maybe it is just the misleading doc problem of GCC. jsji: OK. Thanks. So maybe it is just the misleading doc problem of GCC.
case 'a': // Any VSX register		case 'a': // Any VSX register
case 'c': // An individual CR bit		case 'c': // An individual CR bit
case 'i': // FP or VSX register to hold 64-bit integers data		case 'i': // FP or VSX register to hold 64-bit integers data
break;		break;
default:		default:
return false;		return false;
}		}
Info.setAllowsRegister();		Info.setAllowsRegister();
▲ Show 20 Lines • Show All 256 Lines • Show Last 20 Lines

clang/test/CodeGen/ppc64-inline-asm.c

	Show All 18 Lines
	unsigned char test_wc_i8(unsigned char b1, unsigned char b2) {			unsigned char test_wc_i8(unsigned char b1, unsigned char b2) {
	unsigned char o;			unsigned char o;
	asm("crand %0, %1, %2" : "=wc"(o) : "wc"(b1), "wc"(b2) : );			asm("crand %0, %1, %2" : "=wc"(o) : "wc"(b1), "wc"(b2) : );
	return o;			return o;
	// CHECK-LABEL: zeroext i8 @test_wc_i8(i8 zeroext %b1, i8 zeroext %b2)			// CHECK-LABEL: zeroext i8 @test_wc_i8(i8 zeroext %b1, i8 zeroext %b2)
	// CHECK: call i8 asm "crand $0, $1, $2", "=^wc,^wc,^wc"(i8 %b1, i8 %b2)			// CHECK: call i8 asm "crand $0, $1, $2", "=^wc,^wc,^wc"(i8 %b1, i8 %b2)
	}			}

				float test_fmaxf(float x, float y) {
				asm("xsmaxdp %x0, %x1, %x2" : "=ww"(x) : "ww"(x), "ww"(y));
				return x;
				// CHECK-LABEL: float @test_fmaxf(float %x, float %y)
				// CHECK: call float asm "xsmaxdp ${0:x}, ${1:x}, ${2:x}", "=^ww,^ww,^ww"(float %x, float %y)
				}

				double test_fmax(double x, double y) {
				asm("xsmaxdp %x0, %x1, %x2" : "=ws"(x) : "ws"(x), "ws"(y));
				return x;
				// CHECK-LABEL: double @test_fmax(double %x, double %y)
				// CHECK: call double asm "xsmaxdp ${0:x}, ${1:x}, ${2:x}", "=^ws,^ws,^ws"(double %x, double %y)
				}

llvm/lib/Target/PowerPC/PPCISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 13,952 Lines • ▼ Show 20 Lines	case 'Z':
// and forming the complete address in the second register. This is		// and forming the complete address in the second register. This is
// suboptimal.		// suboptimal.
return C_Memory;		return C_Memory;
}		}
} else if (Constraint == "wc") { // individual CR bits.		} else if (Constraint == "wc") { // individual CR bits.
return C_RegisterClass;		return C_RegisterClass;
} else if (Constraint == "wa" \|\| Constraint == "wd" \|\|		} else if (Constraint == "wa" \|\| Constraint == "wd" \|\|
Constraint == "wf" \|\| Constraint == "ws" \|\|		Constraint == "wf" \|\| Constraint == "ws" \|\|
Constraint == "wi") {		Constraint == "wi" \|\| Constraint == "ww") {
return C_RegisterClass; // VSX registers.		return C_RegisterClass; // VSX registers.
}		}
return TargetLowering::getConstraintType(Constraint);		return TargetLowering::getConstraintType(Constraint);
}		}

/// Examine constraint type and operand type and determine a weight value.		/// Examine constraint type and operand type and determine a weight value.
/// This object must already have been set up with the operand type		/// This object must already have been set up with the operand type
/// and the current alternative constraint selected.		/// and the current alternative constraint selected.
Show All 11 Lines	PPCTargetLowering::getSingleConstraintMatchWeight(
// Look at the constraint type.		// Look at the constraint type.
if (StringRef(constraint) == "wc" && type->isIntegerTy(1))		if (StringRef(constraint) == "wc" && type->isIntegerTy(1))
return CW_Register; // an individual CR bit.		return CW_Register; // an individual CR bit.
else if ((StringRef(constraint) == "wa" \|\|		else if ((StringRef(constraint) == "wa" \|\|
StringRef(constraint) == "wd" \|\|		StringRef(constraint) == "wd" \|\|
StringRef(constraint) == "wf") &&		StringRef(constraint) == "wf") &&
type->isVectorTy())		type->isVectorTy())
return CW_Register;		return CW_Register;
else if (StringRef(constraint) == "ws" && type->isDoubleTy())
return CW_Register;
else if (StringRef(constraint) == "wi" && type->isIntegerTy(64))		else if (StringRef(constraint) == "wi" && type->isIntegerTy(64))
return CW_Register; // just hold 64-bit integers data.		return CW_Register; // just hold 64-bit integers data.
		else if (StringRef(constraint) == "ws" && type->isDoubleTy())
		return CW_Register;
		else if (StringRef(constraint) == "ww" && type->isFloatTy())
		return CW_Register;

switch (*constraint) {		switch (*constraint) {
default:		default:
weight = TargetLowering::getSingleConstraintMatchWeight(info, constraint);		weight = TargetLowering::getSingleConstraintMatchWeight(info, constraint);
break;		break;
case 'b':		case 'b':
if (type->isIntegerTy())		if (type->isIntegerTy())
weight = CW_Register;		weight = CW_Register;
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	if (Constraint.size() == 1) {
}		}
} else if (Constraint == "wc" && Subtarget.useCRBits()) {		} else if (Constraint == "wc" && Subtarget.useCRBits()) {
// An individual CR bit.		// An individual CR bit.
return std::make_pair(0U, &PPC::CRBITRCRegClass);		return std::make_pair(0U, &PPC::CRBITRCRegClass);
} else if ((Constraint == "wa" \|\| Constraint == "wd" \|\|		} else if ((Constraint == "wa" \|\| Constraint == "wd" \|\|
Constraint == "wf" \|\| Constraint == "wi") &&		Constraint == "wf" \|\| Constraint == "wi") &&
Subtarget.hasVSX()) {		Subtarget.hasVSX()) {
return std::make_pair(0U, &PPC::VSRCRegClass);		return std::make_pair(0U, &PPC::VSRCRegClass);
} else if (Constraint == "ws" && Subtarget.hasVSX()) {		} else if ((Constraint == "ws" \|\| Constraint == "ww") && Subtarget.hasVSX()) {
		jsjiUnsubmitted Not Done Reply Inline Actions Should we exclude `FP` for `ws` and return `VFRCRegClass` instead of `VSFRCRegClass` ? jsji: Should we exclude `FP` for `ws` and return `VFRCRegClass` instead of `VSFRCRegClass` ?
		MaskRayAuthorUnsubmitted Done Reply Inline Actions Can you elaborate on what I should do? I'm not familiar with the register info stuff... MaskRay: Can you elaborate on what I should do? I'm not familiar with the register info stuff...
		jsjiUnsubmitted Not Done Reply Inline Actions `VSFRC` contains both `F8RC` and `VFRC`. `F8RC` is FP. So if `ws` can NOT use FP, then we should not use `VSFRC`. However, if `ws` can use `FP` as well as you found in later GCC experiments, then we don't need to do this. jsji: `VSFRC` contains both `F8RC` and `VFRC`. `F8RC` is FP. So if `ws` can NOT use FP, then we…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions float ws_float(float x, float y) { __asm__ ("xsadddp %0, %1, %2" : "=ws"(x) : "ws"(x), "ws"(y)); return x; } float ww_float(float x, float y) { __asm__ ("xsadddp %0, %1, %2" : "=ww"(x) : "ww"(x), "ww"(y)); return x; } double ws_double(double x, double y) { __asm__ ("xsadddp %0, %1, %2" : "=ws"(x) : "ws"(x), "ws"(y)); return x; } double ww_double(double x, double y) { __asm__ ("xsadddp %0, %1, %2" : "=ww"(x) : "ww"(x), "ww"(y)); return x; } % powerpc64le-linux-gnu-gcc -O2 a.c -S -o - \| grep xsadd xsadddp 1, 1, 2 xsadddp 1, 1, 2 xsadddp 1, 1, 2 xsadddp 1, 1, 2 MaskRay: ``` float ws_float(float x, float y) { __asm__ ("xsadddp %0, %1, %2" : "=ws"(x) : "ws"(x)…
if (VT == MVT::f32 && Subtarget.hasP8Vector())		if (VT == MVT::f32 && Subtarget.hasP8Vector())
return std::make_pair(0U, &PPC::VSSRCRegClass);		return std::make_pair(0U, &PPC::VSSRCRegClass);
else		else
return std::make_pair(0U, &PPC::VSFRCRegClass);		return std::make_pair(0U, &PPC::VSFRCRegClass);
}		}

std::pair<unsigned, const TargetRegisterClass *> R =		std::pair<unsigned, const TargetRegisterClass *> R =
TargetLowering::getRegForInlineAsmConstraint(TRI, Constraint, VT);		TargetLowering::getRegForInlineAsmConstraint(TRI, Constraint, VT);
▲ Show 20 Lines • Show All 1,206 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/inlineasm-vsx-reg.ll

Show All 32 Lines	define double @test() {
entry:		entry:
%0 = tail call double asm "mtvsrd ${0:x}, 1", "=^ws,~{f0},~{f1},~{f2},~{f3},~{f4},~{f5},~{f6},~{f7},~{f8},~{f9},~{f10},~{f11},~{f12},~{f13},~{f14}"()		%0 = tail call double asm "mtvsrd ${0:x}, 1", "=^ws,~{f0},~{f1},~{f2},~{f3},~{f4},~{f5},~{f6},~{f7},~{f8},~{f9},~{f10},~{f11},~{f12},~{f13},~{f14}"()
ret double %0		ret double %0
; CHECK-LABEL: test:		; CHECK-LABEL: test:
; CHECK: #APP		; CHECK: #APP
; CHECK: mtvsrd v2, r1		; CHECK: mtvsrd v2, r1
; CHECK: #NO_APP		; CHECK: #NO_APP
}		}

		define float @test_ww(float %x, float %y) {
		jsjiUnsubmitted Not Done Reply Inline Actions Maybe we should add another test for ws as well? The above test is actually for 'x' modifier? jsji: Maybe we should add another test for ws as well? The above test is actually for 'x' modifier?
		MaskRayAuthorUnsubmitted Done Reply Inline Actions I think it is incorrect if the 'x' modifier is not used, so we probably don't have to check the no-modifier case. MaskRay: I think it is incorrect if the 'x' modifier is not used, so we probably don't have to check the…
		%1 = tail call float asm "xsmaxdp ${0:x}, ${1:x}, ${2:x}", "=^ww,^ww,^ww"(float %x, float %y)
		ret float %1
		; CHECK-LABEL: test_ww:
		; CHECK: #APP
		; CHECK: xsmaxdp f1, f1, f2
		; CHECK: #NO_APP
		}

llvm/test/CodeGen/PowerPC/vec-asm-disabled.ll

	Show All 13 Lines
	entry:			entry:
	%0 = tail call { i32, <4 x float> } asm "xxsldwi ${1:x},${2:x},${2:x},3", "=^wi,=&^wi,^wi"(<4 x float> %__A) #0			%0 = tail call { i32, <4 x float> } asm "xxsldwi ${1:x},${2:x},${2:x},3", "=^wi,=&^wi,^wi"(<4 x float> %__A) #0
	%asmresult = extractvalue { i32, <4 x float> } %0, 0			%asmresult = extractvalue { i32, <4 x float> } %0, 0
	ret i32 %asmresult			ret i32 %asmresult

	; CHECK: error: couldn't allocate output register for constraint 'wi'			; CHECK: error: couldn't allocate output register for constraint 'wi'
	}			}

				define float @test_ww(float %x, float %y) #0 {
				%1 = tail call float asm "xsmaxdp ${0:x},${1:x},${2:x}", "=^ww,^ww,^ww"(float %x, float %y) #0
				ret float %1
				; CHECK: error: couldn't allocate output register for constraint 'ww'
				}

				define double @test_ws(double %x, double %y) #0 {
				%1 = tail call double asm "xsmaxdp ${0:x},${1:x},${2:x}", "=^ws,^ws,^ws"(double %x, double %y) #0
				ret double %1
				; CHECK: error: couldn't allocate output register for constraint 'ws'
				}

	attributes #0 = { nounwind "target-features"="-vsx" }			attributes #0 = { nounwind "target-features"="-vsx" }

This is an archive of the discontinued LLVM Phabricator instance.

[PowerPC] Support constraint code "ww"ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 207707

clang/lib/Basic/Targets/PPC.h

clang/test/CodeGen/ppc64-inline-asm.c

llvm/lib/Target/PowerPC/PPCISelLowering.cpp

llvm/test/CodeGen/PowerPC/inlineasm-vsx-reg.ll

llvm/test/CodeGen/PowerPC/vec-asm-disabled.ll

[PowerPC] Support constraint code "ww"
ClosedPublic