This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Basic/
-
clang/
-
Basic/
-
TargetInfo.h
-
lib/
-
Basic/
-
TargetInfo.cpp
-
CodeGen/
1
CGStmt.cpp
-
test/
-
CodeGen/
-
SystemZ/
-
systemz-inline-asm-02.c
-
systemz-inline-asm.c
-
aarch64-inline-asm.c
-
asm-goto.c
-
ms-intrinsics.c
-
x86-asm-register-constraint-mix.c
-
z-hard-register-inline-asm.c
-
Sema/
-
z-hard-register-inline-asm.c

Differential D105142

RFC: Implementing new mechanism for hard register operands to inline asm as a constraint.
Needs ReviewPublic

Authored by anirudhp on Jun 29 2021, 12:18 PM.

Download Raw Diff

Details

Reviewers

uweigand
Kai
abhina.sreeskantharajan
yusra.syeda
programmerjake
jyknight
rengolin
theraven
madhur13490
MaskRay
rnk
jonpa
nemanjai
stefanp
hubert.reinterpretcast
kbarton

Summary

Relevant RFCs posted here

https://lists.llvm.org/pipermail/llvm-dev/2021-June/151370.html
https://gcc.gnu.org/pipermail/gcc/2021-June/236269.html

This is put up as an RFC patch to get feedback about the introduction of a new inline asm constraint which supports hard register operands
This is mostly a clang change, since the LLVM IR for the inline assembly already supports the {...} syntax which the backend recognizes. This change merely completes the loop in terms of introducing a user facing constraint which maps to it.

The following design decisions were taken for this patch:

For the Sema side:

We validate the "{" constraint using a two-phase validation approach. Firstly, we check if there is a target dependent implementation to handle the {....} constraint. If it fails, then we move on to a target agnostic check where we parse and validate the {....} constraint.
Why do we do this? Well, there are some targets which already process the {...} as a user facing constraint. For example the AMDGPU target. It supports syntax of the form {register-name} as well as {register-name[...]}. Moving this implementation to the target agnostic side seems to set a precedent, that we can keep extending the target agnostic implementation based on new cases for certain targets, which would be better served of moving it to the respective target.
In terms of the target agnostic validation, we simply check for the following syntax {.*}. The parsed out content within the two curly braces is checked to see whether its a "valid GCC register".

For the Clang CodeGen side:

Most of the work is done in the AddVariableConstraints function in CGStmt.cpp, which is responsible for emitting the LLVM IR corresponding to the usage of an actual register that the backend can use. Coincidentally, the LLVM IR is also {...}. As mentioned above, this is essentially mapping the LLVM Inline Assembly IR back to a user facing inline asm constraint.
Within this function we add in logic to check if the constraint is of the form [&]{...} in addition to the "register asm" construct.
A scenario where it was applicable to apply both the "register asm" construct and the "hard register inline asm constraint" was diagnosed as an unsupported error because there's no way the compiler will know which register the user meant. The safest option here is to error out explicitly, and put onus back on the user.
To achieve this, and refactor it in a nice way, most of the logic pertaining to the "register asm" construct has been moved into a separate function called ShouldApplyRegisterVariableConstraint which deduces whether to apply the "register asm" construct or not.
Furthermore, the GCCReg field is set with the "Register" only if the register is validated to be a "valid GCC Register" type. To me it doesn't make a lot of sense to set GCC Reg to something that might not necessarily be a "valid GCC register" as defined by respective targets. Would we have a case where we could have a {.*} constraint where the contents inside the curly brace is not a valid register? Yes. For example, The x86 target supports the "@cca" constraint, which is validated on the Sema side as a valid constraint, and before processing is converted to "{@cca}" (via the convertAsmConstraint function. "@cca" is not a valid "GCC register name". So in these case, we'll simply emit the constraint without setting GCCReg (with the assumption that the respective backends deal with it appropriately)

Tests:

Various tests were updated to account for the new behaviour.
I added a few SystemZ tests because we work on the Z backend, but I have no issues adding testing for multiple targets.
There were a few mentions of "waiting" for the GCC implementation of the same to land before the Clang side lands. As mentioned above, the intent to implement it on the GCC side has already been put forward via the RFC. I have no issues "parking" this implementation until its ready to be merged in. However, it might be good to hash out any open questions/concerns in the interim.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

anirudhp created this revision.Jun 29 2021, 12:18 PM

Herald added subscribers: pengfei, jfb, tpr. · View Herald TranscriptJun 29 2021, 12:18 PM

anirudhp requested review of this revision.Jun 29 2021, 12:18 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 29 2021, 12:18 PM

Herald added a subscriber: cfe-commits. · View Herald Transcript

anirudhp updated this revision to Diff 355332.Jun 29 2021, 12:22 PM

anirudhp added reviewers: uweigand, Kai, abhina.sreeskantharajan, yusra.syeda, programmerjake, jyknight, rengolin, theraven, madhur13490.Jun 29 2021, 12:43 PM

Harbormaster completed remote builds in B111597: Diff 355332.Jun 29 2021, 1:30 PM

The code looks fine but it would be good to see some docs along with it. We're currently missing docs on inline assembly entirely and the GCC ones are somewhat... opaque when it comes to describing how constraints work.

In D105142#2849835, @theraven wrote:

The code looks fine but it would be good to see some docs along with it. We're currently missing docs on inline assembly entirely and the GCC ones are somewhat... opaque when it comes to describing how constraints work.

Thank you for your feedback! By docs do you mind updating/adding some information to the existing LLVM docs(like the langref https://llvm.org/docs/LangRef.html for example), or more comments to the code?

anirudhp added reviewers: MaskRay, rnk.Jun 30 2021, 8:38 AM

anirudhp added a reviewer: jonpa.Jun 30 2021, 8:43 AM

anirudhp added reviewers: nemanjai, stefanp, hubert.reinterpretcast.Jun 30 2021, 9:20 AM

anirudhp added a reviewer: kbarton.Jul 5 2021, 7:24 AM

This is great.

unsigned long foo(unsigned long addr, unsigned long a0,
                  unsigned long a1, unsigned long a2,
                  unsigned long a3, unsigned long a4,
                  unsigned long a5) {
  unsigned long result asm("rax");
  unsigned long b2 asm("rdx") = a2;
  unsigned long b3 asm("rcx") = a3;
  unsigned long b4 asm("r8") = a4;
  unsigned long b5 asm("r9") = a5;
  asm("call *%1" : "=r" (result) : "{rax}"(addr), "{rdi}"(a0), "{rsi}"(a1), "r"(b2), "r"(b3), "r"(b4), "r"(b5));
  return result;
}

this compiles to`%0 = tail call i64 asm "call *$1", "=r,{r{ax}x},{r{dx}i},{rsi},r,r,r,r,~{dirflag},~{fpsr},~{flags}"(i64 %addr, i64 %a0, i64 %a1, i64 %a2, i64 %a3, i64 %a4, i64 %a5) #1, !srcloc !3`
(note {r{ax}x},{r{dx}i}) which will cause a backend failure error: couldn't allocate input reg for constraint '{r{dx}'.
Can you investigate it?

For example the AMDGPU target. It supports syntax of the form {register-name} as well as {register-name[...]}.

CC @arsenm @rampitec for AMDGPU thoughts.

In D105142#2860885, @MaskRay wrote:
This is great.
unsigned long foo(unsigned long addr, unsigned long a0,
                  unsigned long a1, unsigned long a2,
                  unsigned long a3, unsigned long a4,
                  unsigned long a5) {
  unsigned long result asm("rax");
  unsigned long b2 asm("rdx") = a2;
  unsigned long b3 asm("rcx") = a3;
  unsigned long b4 asm("r8") = a4;
  unsigned long b5 asm("r9") = a5;
  asm("call *%1" : "=r" (result) : "{rax}"(addr), "{rdi}"(a0), "{rsi}"(a1), "r"(b2), "r"(b3), "r"(b4), "r"(b5));
  return result;
}
this compiles to`%0 = tail call i64 asm "call *$1", "=r,{r{ax}x},{r{dx}i},{rsi},r,r,r,r,~{dirflag},~{fpsr},~{flags}"(i64 %addr, i64 %a0, i64 %a1, i64 %a2, i64 %a3, i64 %a4, i64 %a5) #1, !srcloc !3`
(note {r{ax}x},{r{dx}i}) which will cause a backend failure error: couldn't allocate input reg for constraint '{r{dx}'.
Can you investigate it?

Definitely!

I'm assuming this is the X86 target from the usage of the registers in the example. So the issue here seems to be that before a constraint is emitted into the IR, the SimplifyConstraint (in CGStmt.cpp) /TargetInfo::ConvertConstraint (Target overridden) function(s) is/are called to "convert" the constraint into a simpler form if it exists. So, in this particular case, {rax}, gets "simplified" to {r{ax}} because:

{ -> no simplification rule exists -> simply emit it
r -> no simplification rule exists -> simply emit it
a -> simplified to {ax}            -> {ax} is emitted
x -> no simplification rule exists -> simply emit it
} -> no simplification rule exists -> simply emit it

Thanks for bringing up this point because I missed it.

Looking into it in more detail, I think we can just forego the "simplification"/"conversion" of the constraint while emitting, if we have the [&]{.*} form, since this already maps to the lower level LLVM inline assembly IR which is responsible for telling the backend to allocate a specific register. All validation is already performed in the Sema stage, and/or in the AddVariableConstraints function. So what can be done imo, is to simply early return in the SimplifyConstraint function in CGStmt.cpp, when you have the constraint of the form [&]{.*}. What do you think?

Disable constraint simplification when you already have a constraint of the form {...}. Constraint simplification is usually done character by character, with different targets having different implementations.
Furthermore, a constraint of the form {...} already maps to the LLVM inline assembly IR that tells the backend to allocate a suitable physical register.

Something went wrong with the previous time updating the diff. I'm not too sure, but I'm just doing it again, and this time the it looks a lot better.

@MaskRay Could you please look at the latest changeset, I have added your example as a separate test case for the x86 target.

Harbormaster completed remote builds in B112792: Diff 356968.Jul 7 2021, 9:00 AM

This code doesn't handle multiple alternatives in a constraint.

E.g. "={eax}{ebx}" or "={eax}{ebx},m".

See the GCC docs for the C-level syntax
https://gcc.gnu.org/onlinedocs/gcc/Multi-Alternative.html#Multi-Alternative
and LLVM IR docs for the IR syntax:
https://llvm.org/docs/LangRef.html#constraint-codes

LLVM doesn't handle alternatives very well in the backend, but Clang at least should parse and properly generate LLVM asm strings for these cases I think.

In D105142#2850247, @anirudhp wrote:

In D105142#2849835, @theraven wrote:

The code looks fine but it would be good to see some docs along with it. We're currently missing docs on inline assembly entirely and the GCC ones are somewhat... opaque when it comes to describing how constraints work.

Thank you for your feedback! By docs do you mind updating/adding some information to the existing LLVM docs(like the langref https://llvm.org/docs/LangRef.html for example), or more comments to the code?

I meant user-facing clang docs. This is not an IR change, so it does not belong in LangRef, but the only reference to inline assembly in clang's documentation is a reference to the GCC docs (which are almost incomprehensible in general because they were very x86-specific and were then tweaked a bit to be portable, and specifically don't mention this feature). If we are adding a new user-facing feature, we need to provide user-facing documentation for it. Ideally this would provide complete documentation of inline assembly supported by clang, but at least we should document this feature as an extension.

In D105142#2862592, @jyknight wrote:

This code doesn't handle multiple alternatives in a constraint.

E.g. "={eax}{ebx}" or "={eax}{ebx},m".

See the GCC docs for the C-level syntax
https://gcc.gnu.org/onlinedocs/gcc/Multi-Alternative.html#Multi-Alternative
and LLVM IR docs for the IR syntax:
https://llvm.org/docs/LangRef.html#constraint-codes

LLVM doesn't handle alternatives very well in the backend, but Clang at least should parse and properly generate LLVM asm strings for these cases I think.

You're right, when it came to the {...} constraint, initially I purposely errored out when there were multiple constraints after {.*}. To me, it didn't make sense to issue another constraint when you're already being specific enough by asking the backend to use a specific register. For example, the AMDGPU target, currently errors out when you have something like "{reg-name}a". Ideally if its an option, I'd like to make it even more strict to ensure that there is no other constraint applied with {.*}. Ie. if a user specifies a hard register constraint, only emit that, otherwise diagnose as an error, because a hard register constraint is quite specific to begin with. Maybe this could be documented as a special feature (a standalone constraint, can't be used as a multi-alternative constraint) since this is something new? (From the backend's perspective, is it a valid scenario to try to allocate two registers for an operand? will the last one occurring be the one allocated? what happens if the user wanted the "first" register?)

If not, I have no issue with changing it to emit multiple constraints in the IR, but I'd like to get your thoughts on the above first.

What's the status of this? Did the GCC proposal go anywhere? I'd be happy to see this move forward if you're also pushing it on the GCC side.

clang/lib/CodeGen/CGStmt.cpp
2077	I don't think the FIXME seems useful. I don't see any reason why anyone should validly use both at the same time.

Herald added a project: Restricted Project. · View Herald TranscriptApr 7 2022, 8:22 AM

Herald added a subscriber: StephenFan. · View Herald Transcript

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

TargetInfo.h

7 lines

lib/

Basic/

TargetInfo.cpp

54 lines

CodeGen/

CGStmt.cpp

96 lines

test/

CodeGen/

SystemZ/

systemz-inline-asm-02.c

10 lines

19 lines

10 lines

14 lines

42 lines

x86-asm-register-constraint-mix.c

63 lines

z-hard-register-inline-asm.c

48 lines

Sema/

z-hard-register-inline-asm.c

49 lines

Diff 356968

clang/include/clang/Basic/TargetInfo.h

Show First 20 Lines • Show All 1,071 Lines • ▼ Show 20 Lines	validateConstraintModifier(StringRef /Constraint/,
unsigned /Size/,		unsigned /Size/,
std::string &/SuggestedModifier/) const {		std::string &/SuggestedModifier/) const {
return true;		return true;
}		}
virtual bool		virtual bool
validateAsmConstraint(const char *&Name,		validateAsmConstraint(const char *&Name,
TargetInfo::ConstraintInfo &info) const = 0;		TargetInfo::ConstraintInfo &info) const = 0;

		// Validate the "hard register" inline asm constraint. This constraint is
		// of the form {<reg-name>}. This constraint is meant to be used
		// as an alternative for the "register asm" construct to put inline
		// asm operands into specific registers.
		bool validateHardRegisterAsmConstraint(const char *&Name,
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - bool validateHardRegisterAsmConstraint(const char &Name, + bool + validateHardRegisterAsmConstraint(const char &Name, Lint: Pre-merge checks: clang-format: please reformat the code ``` - bool validateHardRegisterAsmConstraint(const char…
		TargetInfo::ConstraintInfo &info) const;

bool resolveSymbolicName(const char *&Name,		bool resolveSymbolicName(const char *&Name,
ArrayRef<ConstraintInfo> OutputConstraints,		ArrayRef<ConstraintInfo> OutputConstraints,
unsigned &Index) const;		unsigned &Index) const;

// Constraint parm will be left pointing at the last character of		// Constraint parm will be left pointing at the last character of
// the constraint. In practice, it won't be changed unless the		// the constraint. In practice, it won't be changed unless the
// constraint is longer than one character.		// constraint is longer than one character.
virtual std::string convertConstraint(const char *&Constraint) const {		virtual std::string convertConstraint(const char *&Constraint) const {
▲ Show 20 Lines • Show All 500 Lines • Show Last 20 Lines

clang/lib/Basic/TargetInfo.cpp

Show First 20 Lines • Show All 631 Lines • ▼ Show 20 Lines	while (*Name) {
case '!': // Disparage severely.		case '!': // Disparage severely.
case '*': // Ignore for choosing register preferences.		case '*': // Ignore for choosing register preferences.
case 'i': // Ignore i,n,E,F as output constraints (match from the other		case 'i': // Ignore i,n,E,F as output constraints (match from the other
// chars)		// chars)
case 'n':		case 'n':
case 'E':		case 'E':
case 'F':		case 'F':
break; // Pass them.		break; // Pass them.
		case '{': {
		// First, check the target parser in case it validates
		// the {...} constraint differently.
		if (validateAsmConstraint(Name, Info))
		return true;

		// If not, that's okay, we will try to validate it
		// using a target agnostic implementation.
		if (!validateHardRegisterAsmConstraint(Name, Info))
		return false;
		break;
		}
}		}

Name++;		Name++;
}		}

// Early clobber with a read-write constraint which doesn't permit registers		// Early clobber with a read-write constraint which doesn't permit registers
// is invalid.		// is invalid.
if (Info.earlyClobber() && Info.isReadWrite() && !Info.allowsRegister())		if (Info.earlyClobber() && Info.isReadWrite() && !Info.allowsRegister())
return false;		return false;

// If a constraint allows neither memory nor register operands it contains		// If a constraint allows neither memory nor register operands it contains
// only modifiers. Reject it.		// only modifiers. Reject it.
return Info.allowsMemory() \|\| Info.allowsRegister();		return Info.allowsMemory() \|\| Info.allowsRegister();
}		}

		bool TargetInfo::validateHardRegisterAsmConstraint(
		const char *&Name, TargetInfo::ConstraintInfo &Info) const {
		// First, swallow the '{'.
		Name++;

		// Mark the start of the possible register name.
		const char *Start = Name;

		// Loop through rest of "Name".
		// In this loop, we check whether we have a closing curly brace which
		// validates the constraint. Also, this allows us to get the correct bounds to
		// set our register name.
		while (Name && Name != '}')
		Name++;

		// Missing '}' or if there is anything after '}', return false.
		if (!Name \|\| (Name + 1))
		return false;

		// Now we set the register name.
		std::string Register(Start, Name - Start);

		// We validate whether its a valid register to be used.
		if (!isValidGCCRegisterName(Register))
		return false;

		Info.setAllowsRegister();
		return true;
		}

bool TargetInfo::resolveSymbolicName(const char *&Name,		bool TargetInfo::resolveSymbolicName(const char *&Name,
ArrayRef<ConstraintInfo> OutputConstraints,		ArrayRef<ConstraintInfo> OutputConstraints,
unsigned &Index) const {		unsigned &Index) const {
assert(*Name == '[' && "Symbolic name did not start with '['");		assert(*Name == '[' && "Symbolic name did not start with '['");
Name++;		Name++;
const char *Start = Name;		const char *Start = Name;
while (Name && Name != ']')		while (Name && Name != ']')
Name++;		Name++;
▲ Show 20 Lines • Show All 116 Lines • ▼ Show 20 Lines	while (*Name) {
case '#': // Ignore as constraint.		case '#': // Ignore as constraint.
while (Name[1] && Name[1] != ',')		while (Name[1] && Name[1] != ',')
Name++;		Name++;
break;		break;
case '?': // Disparage slightly code.		case '?': // Disparage slightly code.
case '!': // Disparage severely.		case '!': // Disparage severely.
case '*': // Ignore for choosing register preferences.		case '*': // Ignore for choosing register preferences.
break; // Pass them.		break; // Pass them.
		case '{': {
		// First, check the target parser in case it validates
		// the {...} constraint differently.
		if (validateAsmConstraint(Name, Info))
		return true;

		// If not, that's okay, we will try to validate it
		// using a target agnostic implementation.
		if (!validateHardRegisterAsmConstraint(Name, Info))
		return false;
		break;
		}
}		}

Name++;		Name++;
}		}

return true;		return true;
}		}

▲ Show 20 Lines • Show All 65 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGStmt.cpp

Show First 20 Lines • Show All 1,960 Lines • ▼ Show 20 Lines	void CodeGenFunction::EmitSwitchStmt(const SwitchStmt &S) {
SwitchWeights = SavedSwitchWeights;		SwitchWeights = SavedSwitchWeights;
SwitchLikelihood = SavedSwitchLikelihood;		SwitchLikelihood = SavedSwitchLikelihood;
CaseRangeBlock = SavedCRBlock;		CaseRangeBlock = SavedCRBlock;
}		}

static std::string		static std::string
SimplifyConstraint(const char *Constraint, const TargetInfo &Target,		SimplifyConstraint(const char *Constraint, const TargetInfo &Target,
SmallVectorImpl<TargetInfo::ConstraintInfo> *OutCons=nullptr) {		SmallVectorImpl<TargetInfo::ConstraintInfo> *OutCons=nullptr) {
		// If we have only the {...} constraint, do not do any simplifications. This already
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - // If we have only the {...} constraint, do not do any simplifications. This already - // maps to the lower level LLVM inline assembly IR that tells the backend to allocate - // a specific register. Any validations would have already been done in the Sema stage - // or will be done in the AddVariableConstraints function. + // If we have only the {...} constraint, do not do any simplifications. This + // already maps to the lower level LLVM inline assembly IR that tells the + // backend to allocate a specific register. Any validations would have already + // been done in the Sema stage or will be done in the AddVariableConstraints + // function. Lint: Pre-merge checks: clang-format: please reformat the code ``` - // If we have only the {...} constraint, do not…
		// maps to the lower level LLVM inline assembly IR that tells the backend to allocate
		// a specific register. Any validations would have already been done in the Sema stage
		// or will be done in the AddVariableConstraints function.
		if (Constraint[0] == '{' \|\| (Constraint[0] == '&' && Constraint[1] == '{'))
		return std::string(Constraint);

std::string Result;		std::string Result;

while (*Constraint) {		while (*Constraint) {
switch (*Constraint) {		switch (*Constraint) {
default:		default:
Result += Target.convertConstraint(Constraint);		Result += Target.convertConstraint(Constraint);
break;		break;
// Ignore these		// Ignore these
Show All 31 Lines	while (*Constraint) {
}		}

Constraint++;		Constraint++;
}		}

return Result;		return Result;
}		}

/// AddVariableConstraints - Look at AsmExpr and if it is a variable declared		/// Is it valid to apply a register constraint for a variable marked with
/// as using a particular register add that as a constraint that will be used		/// the "register asm" construct?
/// in this asm stmt.		/// Optionally, if it is determined that we can, we set "Register" to the
static std::string		/// regiser name.
AddVariableConstraints(const std::string &Constraint, const Expr &AsmExpr,		static bool
const TargetInfo &Target, CodeGenModule &CGM,		ShouldApplyRegisterVariableConstraint(const Expr &AsmExpr,
const AsmStmt &Stmt, const bool EarlyClobber,		std::string *Register = nullptr) {
std::string *GCCReg = nullptr) {
const DeclRefExpr *AsmDeclRef = dyn_cast<DeclRefExpr>(&AsmExpr);		const DeclRefExpr *AsmDeclRef = dyn_cast<DeclRefExpr>(&AsmExpr);
if (!AsmDeclRef)		if (!AsmDeclRef)
return Constraint;		return false;
const ValueDecl &Value = *AsmDeclRef->getDecl();		const ValueDecl &Value = *AsmDeclRef->getDecl();
const VarDecl *Variable = dyn_cast<VarDecl>(&Value);		const VarDecl *Variable = dyn_cast<VarDecl>(&Value);
if (!Variable)		if (!Variable)
return Constraint;		return false;
if (Variable->getStorageClass() != SC_Register)		if (Variable->getStorageClass() != SC_Register)
return Constraint;		return false;
AsmLabelAttr *Attr = Variable->getAttr<AsmLabelAttr>();		AsmLabelAttr *Attr = Variable->getAttr<AsmLabelAttr>();
if (!Attr)		if (!Attr)
		return false;

		if (Register != nullptr)
		// Set the register to return from Attr.
		*Register = Attr->getLabel().str();
		return true;
		}

		/// AddVariableConstraints:
		/// Look at AsmExpr and if it is a variable declared as using a particular
		/// register add that as a constraint that will be used in this asm stmt.
		/// Whether it can be used or not is dependent on querying
		/// ShouldApplyRegisterVariableConstraint() Also check whether the "hard
		/// register" inline asm constraint (i.e. "{reg-name}") is specified. If so, add
		/// that as a constraint that will be used in this asm stmt.
		static std::string
		AddVariableConstraints(const std::string &Constraint, const Expr &AsmExpr,
		const TargetInfo &Target, CodeGenModule &CGM,
		const AsmStmt &Stmt, const bool EarlyClobber,
		std::string *GCCReg = nullptr) {
		// Do we have the "hard register" inline asm constraint.
		bool ApplyHardRegisterConstraint =
		Constraint[0] == '{' \|\| (EarlyClobber && Constraint[1] == '{');

		// Do we have "register asm" on a variable.
		std::string Reg = "";
		bool ApplyRegisterVariableConstraint =
		ShouldApplyRegisterVariableConstraint(AsmExpr, &Reg);

		// Diagnose the scenario where we apply both the register variable constraint
		// and a hard register variable constraint as an unsupported error.
		// Why? Because we could have a situation where the register passed in through
		// {...} and the register passed in through the "register asm" construct could
		// be different, and in this case, there's no way for the compiler to know
		// which one to emit.
		// FIXME: Should we add one additional level of granulariy, where the compiler
		jyknightUnsubmitted Not Done Reply Inline Actions I don't think the FIXME seems useful. I don't see any reason why anyone should validly use both at the same time. jyknight: I don't think the FIXME seems useful. I don't see any reason why anyone should validly use both…
		// can forgive the user, if they mention the same register to be used through
		// both "register asm" and the hard register inline asm constraint ({...})?
		if (ApplyHardRegisterConstraint && ApplyRegisterVariableConstraint) {
		CGM.ErrorUnsupported(&Stmt, "__asm__");
return Constraint;		return Constraint;
StringRef Register = Attr->getLabel();		}
assert(Target.isValidGCCRegisterName(Register));
		if (!ApplyHardRegisterConstraint && !ApplyRegisterVariableConstraint)
		return Constraint;

// We're using validateOutputConstraint here because we only care if		// We're using validateOutputConstraint here because we only care if
// this is a register constraint.		// this is a register constraint.
TargetInfo::ConstraintInfo Info(Constraint, "");		TargetInfo::ConstraintInfo Info(Constraint, "");
if (Target.validateOutputConstraint(Info) &&		if (Target.validateOutputConstraint(Info) && !Info.allowsRegister()) {
!Info.allowsRegister()) {
CGM.ErrorUnsupported(&Stmt, "__asm__");		CGM.ErrorUnsupported(&Stmt, "__asm__");
return Constraint;		return Constraint;
}		}

		if (ApplyHardRegisterConstraint) {
		int Start = EarlyClobber ? 2 : 1;
		int End = Constraint.find('}');
		Reg = Constraint.substr(Start, End - Start);
		// If we don't have a valid register name, simply return the constraint.
		// For example: There are some targets like X86 that use a constraint such
		// as "@cca", which is validated and then converted into {@cca}. Now this
		// isn't necessarily a "GCC Register", but in terms of emission, it is
		// valid since it lowered appropriately in the X86 backend. For the {..}
		// constraint, we shouldn't be too strict and error out if the register
		// itself isn't a valid "GCC register".
		if (!Target.isValidGCCRegisterName(Reg))
		return Constraint;
		}

		StringRef Register(Reg);
// Canonicalize the register here before returning it.		// Canonicalize the register here before returning it.
Register = Target.getNormalizedGCCRegisterName(Register);		Register = Target.getNormalizedGCCRegisterName(Register);
if (GCCReg != nullptr)		if (GCCReg != nullptr)
*GCCReg = Register.str();		*GCCReg = Register.str();
return (EarlyClobber ? "&{" : "{") + Register.str() + "}";		return (EarlyClobber ? "&{" : "{") + Register.str() + "}";
}		}

llvm::Value*		llvm::Value*
▲ Show 20 Lines • Show All 706 Lines • Show Last 20 Lines

clang/test/CodeGen/SystemZ/systemz-inline-asm-02.c

	// RUN: not %clang_cc1 -triple s390x-linux-gnu -O2 -emit-llvm -o - %s 2>&1 \			// RUN: not %clang_cc1 -triple s390x-linux-gnu -O2 -emit-llvm -o - %s 2>&1 \
	// RUN: \| FileCheck %s			// RUN: \| FileCheck %s
	// REQUIRES: systemz-registered-target			// REQUIRES: systemz-registered-target

	// Test that an error is given if a physreg is defined by multiple operands.			// Test that an error is given if a physreg is defined by multiple operands.
	int test_physreg_defs(void) {			int test_physreg_defs(void) {
	register int l __asm__("r7") = 0;			register int l __asm__("r7") = 0;
				int m;

	// CHECK: error: multiple outputs to hard register: r7			// CHECK: error: multiple outputs to hard register: r7
	__asm__("" : "+r"(l), "=r"(l));			__asm__(""
				: "+r"(l), "=r"(l));

	return l;			// CHECK: error: multiple outputs to hard register: r6
				__asm__(""
				: "+{r6}"(m), "={r6}"(m));

				return l + m;
	}			}

clang/test/CodeGen/SystemZ/systemz-inline-asm.c

	Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines
	// CHECK: [[RESULT:%.*]] = tail call fp128 asm "axbr $0, $2", "=f,0,f"(fp128 %f, fp128 %g)			// CHECK: [[RESULT:%.*]] = tail call fp128 asm "axbr $0, $2", "=f,0,f"(fp128 %f, fp128 %g)
	// CHECK: store fp128 [[RESULT]], fp128* [[DEST]]			// CHECK: store fp128 [[RESULT]], fp128* [[DEST]]
	}			}

	// Test that there are no tied physreg uses. TwoAddress pass cannot deal with them.			// Test that there are no tied physreg uses. TwoAddress pass cannot deal with them.
	int test_physregs(void) {			int test_physregs(void) {
	// CHECK-LABEL: define{{.*}} signext i32 @test_physregs()			// CHECK-LABEL: define{{.*}} signext i32 @test_physregs()
	register int l __asm__("r7") = 0;			register int l __asm__("r7") = 0;
				int m = 0;

	// CHECK: call i32 asm "lr $0, $1", "={r7},{r7}"			// CHECK: call i32 asm "lr $0, $1", "={r7},{r7}"
	__asm__("lr %0, %1" : "+r"(l));			__asm__("lr %0, %1"
				: "+r"(l));

	// CHECK: call i32 asm "$0 $1 $2", "={r7},{r7},{r7}"			// CHECK: call i32 asm "$0 $1 $2", "={r7},{r7},{r7}"
	__asm__("%0 %1 %2" : "+r"(l) : "r"(l));			__asm__("%0 %1 %2"
				: "+r"(l)
				: "r"(l));

				// CHECK: call i32 asm "lr $0, $1", "={r6},{r6}"
				__asm__("lr %0, %1"
				: "+{r6}"(m));

				// CHECK: call i32 asm "$0 $1 $2", "={r6},{r6},{r6}"
				__asm__("%0 %1 %2"
				: "+{r6}"(m)
				: "{r6}"(m));

	return l;			return l + m;
	}			}

clang/test/CodeGen/aarch64-inline-asm.c

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	void test_gcc_registers(void) {
// rN registers when used without register ... asm("rN") syntax		// rN registers when used without register ... asm("rN") syntax
// should not be remapped.		// should not be remapped.
asm volatile("mov r0, r1\n");		asm volatile("mov r0, r1\n");
// CHECK: call void asm sideeffect "mov r0, r1\0A", ""()		// CHECK: call void asm sideeffect "mov r0, r1\0A", ""()
}		}

void test_tied_earlyclobber(void) {		void test_tied_earlyclobber(void) {
register int a asm("x1");		register int a asm("x1");
asm("" : "+&r"(a));		asm(""
		: "+&r"(a));
		// CHECK: call i32 asm "", "=&{x1},0"(i32 %0)
		}

		void test_tied_earlyclobber2(void) {
		int a;
		asm(""
		: "+&{x1}"(a));
// CHECK: call i32 asm "", "=&{x1},0"(i32 %0)		// CHECK: call i32 asm "", "=&{x1},0"(i32 %0)
}		}

clang/test/CodeGen/asm-goto.c

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
loop:		loop:
return 0;		return 0;
label_true:		label_true:
return 1;		return 1;
}		}

int test4(int out1, int out2) {		int test4(int out1, int out2) {
// CHECK-LABEL: define{{.*}} i32 @test4(		// CHECK-LABEL: define{{.*}} i32 @test4(
// CHECK: callbr { i32, i32 } asm sideeffect "jne ${3:l}", "={si},={di},r,X,X,0,1		// CHECK: callbr { i32, i32 } asm sideeffect "jne ${3:l}", "={si},={di},r,X,X,{si},{di}
// CHECK: to label %asm.fallthrough [label %label_true, label %loop]		// CHECK: to label %asm.fallthrough [label %label_true, label %loop]
// CHECK-LABEL: asm.fallthrough:		// CHECK-LABEL: asm.fallthrough:
if (out1 < out2)		if (out1 < out2)
asm volatile goto("jne %l3" : "+S"(out1), "+D"(out2) : "r"(out1) :: label_true, loop);		asm volatile goto("jne %l3"
		: "+S"(out1), "+D"(out2)
		: "r"(out1)::label_true, loop);
else		else
asm volatile goto("jne %l5" : "+S"(out1), "+D"(out2) : "r"(out1), "r"(out2) :: label_true, loop);		asm volatile goto("jne %l5"
// CHECK: callbr { i32, i32 } asm sideeffect "jne ${5:l}", "={si},={di},r,r,X,X,0,1		: "+S"(out1), "+D"(out2)
		: "r"(out1), "r"(out2)::label_true, loop);
		// CHECK: callbr { i32, i32 } asm sideeffect "jne ${5:l}", "={si},={di},r,r,X,X,{si},{di}
// CHECK: to label %asm.fallthrough2 [label %label_true, label %loop]		// CHECK: to label %asm.fallthrough2 [label %label_true, label %loop]
// CHECK-LABEL: asm.fallthrough2:		// CHECK-LABEL: asm.fallthrough2:
return out1 + out2;		return out1 + out2;
loop:		loop:
return -1;		return -1;
label_true:		label_true:
return -2;		return -2;
}		}
Show All 13 Lines	asm goto(
: : t_err);		: : t_err);
return 0;		return 0;
t_err:		t_err:
return 1;		return 1;
}		}

int test6(int out1) {		int test6(int out1) {
// CHECK-LABEL: define{{.*}} i32 @test6(		// CHECK-LABEL: define{{.*}} i32 @test6(
// CHECK: callbr i32 asm sideeffect "testl $0, $0; testl $1, $1; jne ${2:l}", "={si},r,X,X,0,{{.}} i8 blockaddress(@test6, %label_true), i8* blockaddress(@test6, %landing)		// CHECK: callbr i32 asm sideeffect "testl $0, $0; testl $1, $1; jne ${2:l}", "={si},r,X,X,{si},{{.}} i8 blockaddress(@test6, %label_true), i8* blockaddress(@test6, %landing)
// CHECK: to label %asm.fallthrough [label %label_true, label %landing]		// CHECK: to label %asm.fallthrough [label %label_true, label %landing]
// CHECK-LABEL: asm.fallthrough:		// CHECK-LABEL: asm.fallthrough:
// CHECK-LABEL: landing:		// CHECK-LABEL: landing:
int out2 = 42;		int out2 = 42;
asm volatile goto("testl %0, %0; testl %1, %1; jne %l2" : "+S"(out2) : "r"(out1) :: label_true, landing);		asm volatile goto("testl %0, %0; testl %1, %1; jne %l2" : "+S"(out2) : "r"(out1) :: label_true, landing);
landing:		landing:
return out1 + out2;		return out1 + out2;
label_true:		label_true:
return -2;		return -2;
}		}

clang/test/CodeGen/ms-intrinsics.c

	Show All 30 Lines
	// CHECK-X64: tail call void @llvm.memset.p0i8.i64(i8* align 1 %Dest, i8 %Data, i64 %Count, i1 true)			// CHECK-X64: tail call void @llvm.memset.p0i8.i64(i8* align 1 %Dest, i8 %Data, i64 %Count, i1 true)
	// CHECK-X64: ret void			// CHECK-X64: ret void
	// CHECK-X64: }			// CHECK-X64: }

	void test__movsb(unsigned char Dest, unsigned char Src, size_t Count) {			void test__movsb(unsigned char Dest, unsigned char Src, size_t Count) {
	return __movsb(Dest, Src, Count);			return __movsb(Dest, Src, Count);
	}			}
	// CHECK-I386-LABEL: define{{.*}} void @test__movsb			// CHECK-I386-LABEL: define{{.*}} void @test__movsb
	// CHECK-I386: call { i8, i8, i32 } asm sideeffect "rep movsb", "={di},={si},={cx},0,1,2,~{memory},~{dirflag},~{fpsr},~{flags}"(i8* %Dest, i8* %Src, i32 %Count)			// CHECK-I386: call { i8, i8, i32 } asm sideeffect "rep movsb", "={di},={si},={cx},{di},{si},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i8* %Dest, i8* %Src, i32 %Count)
	// CHECK-I386: ret void			// CHECK-I386: ret void
	// CHECK-I386: }			// CHECK-I386: }

	// CHECK-X64-LABEL: define{{.*}} void @test__movsb			// CHECK-X64-LABEL: define{{.*}} void @test__movsb
	// CHECK-X64: call { i8, i8, i64 } asm sideeffect "rep movsb", "={di},={si},={cx},0,1,2,~{memory},~{dirflag},~{fpsr},~{flags}"(i8* %Dest, i8* %Src, i64 %Count)			// CHECK-X64: call { i8, i8, i64 } asm sideeffect "rep movsb", "={di},={si},={cx},{di},{si},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i8* %Dest, i8* %Src, i64 %Count)
	// CHECK-X64: ret void			// CHECK-X64: ret void
	// CHECK-X64: }			// CHECK-X64: }

	void test__stosw(unsigned short *Dest, unsigned short Data, size_t Count) {			void test__stosw(unsigned short *Dest, unsigned short Data, size_t Count) {
	return __stosw(Dest, Data, Count);			return __stosw(Dest, Data, Count);
	}			}
	// CHECK-I386-LABEL: define{{.*}} void @test__stosw			// CHECK-I386-LABEL: define{{.*}} void @test__stosw
	// CHECK-I386: call { i16, i32 } asm sideeffect "rep stosw", "={di},={cx},{ax},0,1,~{memory},~{dirflag},~{fpsr},~{flags}"(i16 %Data, i16 %Dest, i32 %Count)			// CHECK-I386: call { i16, i32 } asm sideeffect "rep stosw", "={di},={cx},{ax},{di},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i16 %Data, i16 %Dest, i32 %Count)
	// CHECK-I386: ret void			// CHECK-I386: ret void
	// CHECK-I386: }			// CHECK-I386: }

	// CHECK-X64-LABEL: define{{.*}} void @test__stosw			// CHECK-X64-LABEL: define{{.*}} void @test__stosw
	// CHECK-X64: call { i16, i64 } asm sideeffect "rep stosw", "={di},={cx},{ax},0,1,~{memory},~{dirflag},~{fpsr},~{flags}"(i16 %Data, i16 %Dest, i64 %Count)			// CHECK-X64: call { i16, i64 } asm sideeffect "rep stosw", "={di},={cx},{ax},{di},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i16 %Data, i16 %Dest, i64 %Count)
	// CHECK-X64: ret void			// CHECK-X64: ret void
	// CHECK-X64: }			// CHECK-X64: }

	void test__movsw(unsigned short Dest, unsigned short Src, size_t Count) {			void test__movsw(unsigned short Dest, unsigned short Src, size_t Count) {
	return __movsw(Dest, Src, Count);			return __movsw(Dest, Src, Count);
	}			}
	// CHECK-I386-LABEL: define{{.*}} void @test__movsw			// CHECK-I386-LABEL: define{{.*}} void @test__movsw
	// CHECK-I386: call { i16, i16, i32 } asm sideeffect "rep movsw", "={di},={si},={cx},0,1,2,~{memory},~{dirflag},~{fpsr},~{flags}"(i16* %Dest, i16* %Src, i32 %Count)			// CHECK-I386: call { i16, i16, i32 } asm sideeffect "rep movsw", "={di},={si},={cx},{di},{si},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i16* %Dest, i16* %Src, i32 %Count)
	// CHECK-I386: ret void			// CHECK-I386: ret void
	// CHECK-I386: }			// CHECK-I386: }

	// CHECK-X64-LABEL: define{{.*}} void @test__movsw			// CHECK-X64-LABEL: define{{.*}} void @test__movsw
	// CHECK-X64: call { i16, i16, i64 } asm sideeffect "rep movsw", "={di},={si},={cx},0,1,2,~{memory},~{dirflag},~{fpsr},~{flags}"(i16* %Dest, i16* %Src, i64 %Count)			// CHECK-X64: call { i16, i16, i64 } asm sideeffect "rep movsw", "={di},={si},={cx},{di},{si},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i16* %Dest, i16* %Src, i64 %Count)
	// CHECK-X64: ret void			// CHECK-X64: ret void
	// CHECK-X64: }			// CHECK-X64: }

	void test__stosd(unsigned long *Dest, unsigned long Data, size_t Count) {			void test__stosd(unsigned long *Dest, unsigned long Data, size_t Count) {
	return __stosd(Dest, Data, Count);			return __stosd(Dest, Data, Count);
	}			}
	// CHECK-I386-LABEL: define{{.*}} void @test__stosd			// CHECK-I386-LABEL: define{{.*}} void @test__stosd
	// CHECK-I386: call { i32, i32 } asm sideeffect "rep stosl", "={di},={cx},{ax},0,1,~{memory},~{dirflag},~{fpsr},~{flags}"(i32 %Data, i32 %Dest, i32 %Count)			// CHECK-I386: call { i32, i32 } asm sideeffect "rep stosl", "={di},={cx},{ax},{di},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i32 %Data, i32 %Dest, i32 %Count)
	// CHECK-I386: ret void			// CHECK-I386: ret void
	// CHECK-I386: }			// CHECK-I386: }

	// CHECK-X64-LABEL: define{{.*}} void @test__stosd			// CHECK-X64-LABEL: define{{.*}} void @test__stosd
	// CHECK-X64: call { i32, i64 } asm sideeffect "rep stosl", "={di},={cx},{ax},0,1,~{memory},~{dirflag},~{fpsr},~{flags}"(i32 %Data, i32 %Dest, i64 %Count)			// CHECK-X64: call { i32, i64 } asm sideeffect "rep stosl", "={di},={cx},{ax},{di},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i32 %Data, i32 %Dest, i64 %Count)
	// CHECK-X64: ret void			// CHECK-X64: ret void
	// CHECK-X64: }			// CHECK-X64: }

	void test__movsd(unsigned long Dest, unsigned long Src, size_t Count) {			void test__movsd(unsigned long Dest, unsigned long Src, size_t Count) {
	return __movsd(Dest, Src, Count);			return __movsd(Dest, Src, Count);
	}			}
	// CHECK-I386-LABEL: define{{.*}} void @test__movsd			// CHECK-I386-LABEL: define{{.*}} void @test__movsd
	// CHECK-I386: call { i32, i32, i32 } asm sideeffect "rep movsl", "={di},={si},={cx},0,1,2,~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Dest, i32* %Src, i32 %Count)			// CHECK-I386: call { i32, i32, i32 } asm sideeffect "rep movsl", "={di},={si},={cx},{di},{si},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Dest, i32* %Src, i32 %Count)
	// CHECK-I386: ret void			// CHECK-I386: ret void
	// CHECK-I386: }			// CHECK-I386: }

	// CHECK-X64-LABEL: define{{.*}} void @test__movsd			// CHECK-X64-LABEL: define{{.*}} void @test__movsd
	// CHECK-X64: call { i32, i32, i64 } asm sideeffect "rep movsl", "={di},={si},={cx},0,1,2,~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Dest, i32* %Src, i64 %Count)			// CHECK-X64: call { i32, i32, i64 } asm sideeffect "rep movsl", "={di},={si},={cx},{di},{si},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Dest, i32* %Src, i64 %Count)
	// CHECK-X64: ret void			// CHECK-X64: ret void
	// CHECK-X64: }			// CHECK-X64: }

	#ifdef __x86_64__			#ifdef __x86_64__
	void test__stosq(unsigned __int64 *Dest, unsigned __int64 Data, size_t Count) {			void test__stosq(unsigned __int64 *Dest, unsigned __int64 Data, size_t Count) {
	return __stosq(Dest, Data, Count);			return __stosq(Dest, Data, Count);
	}			}
	// CHECK-X64-LABEL: define{{.*}} void @test__stosq			// CHECK-X64-LABEL: define{{.*}} void @test__stosq
	// CHECK-X64: call { i64, i64 } asm sideeffect "rep stosq", "={di},={cx},{ax},0,1,~{memory},~{dirflag},~{fpsr},~{flags}"(i64 %Data, i64 %Dest, i64 %Count)			// CHECK-X64: call { i64, i64 } asm sideeffect "rep stosq", "={di},={cx},{ax},{di},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i64 %Data, i64 %Dest, i64 %Count)
	// CHECK-X64: ret void			// CHECK-X64: ret void
	// CHECK-X64: }			// CHECK-X64: }

	void test__movsq(unsigned __int64 Dest, unsigned __int64 Src, size_t Count) {			void test__movsq(unsigned __int64 Dest, unsigned __int64 Src, size_t Count) {
	return __movsq(Dest, Src, Count);			return __movsq(Dest, Src, Count);
	}			}
	// CHECK-X64-LABEL: define{{.*}} void @test__movsq			// CHECK-X64-LABEL: define{{.*}} void @test__movsq
	// CHECK-X64: call { i64, i64, i64 } asm sideeffect "rep movsq", "={di},={si},={cx},0,1,2,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Dest, i64* %Src, i64 %Count)			// CHECK-X64: call { i64, i64, i64 } asm sideeffect "rep movsq", "={di},={si},={cx},{di},{si},{cx},~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Dest, i64* %Src, i64 %Count)
	// CHECK-X64: ret void			// CHECK-X64: ret void
	// CHECK-X64: }			// CHECK-X64: }
	#endif			#endif

	void test__ud2(void) {			void test__ud2(void) {
	__ud2();			__ud2();
	}			}
	// CHECK-INTEL-LABEL: define{{.*}} void @test__ud2()			// CHECK-INTEL-LABEL: define{{.*}} void @test__ud2()
	▲ Show 20 Lines • Show All 509 Lines • ▼ Show 20 Lines
	}			}
	long test_InterlockedExchange_HLERelease(long volatile *Target, long Value) {			long test_InterlockedExchange_HLERelease(long volatile *Target, long Value) {
	// CHECK-INTEL: define{{.}} i32 @test_InterlockedExchange_HLERelease(i32{{[a-z_ ]}}%Target, i32{{[a-z_ ]}}%Value)			// CHECK-INTEL: define{{.}} i32 @test_InterlockedExchange_HLERelease(i32{{[a-z_ ]}}%Target, i32{{[a-z_ ]}}%Value)
	// CHECK-INTEL: call i32 asm sideeffect ".byte 0xf3 ; lock ; xchg $0, $1", "=r,=m,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Target, i32 %Value, i32* %Target)			// CHECK-INTEL: call i32 asm sideeffect ".byte 0xf3 ; lock ; xchg $0, $1", "=r,=m,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Target, i32 %Value, i32* %Target)
	return _InterlockedExchange_HLERelease(Target, Value);			return _InterlockedExchange_HLERelease(Target, Value);
	}			}
	long test_InterlockedCompareExchange_HLEAcquire(long volatile *Destination,			long test_InterlockedCompareExchange_HLEAcquire(long volatile *Destination,
	long Exchange, long Comparand) {			long Exchange, long Comparand) {
	// CHECK-INTEL: define{{.}} i32 @test_InterlockedCompareExchange_HLEAcquire(i32{{[a-z_ ]}}%Destination, i32{{[a-z_ ]}}%Exchange, i32{{[a-z_ ]*}}%Comparand)			// CHECK-INTEL: define{{.}} i32 @test_InterlockedCompareExchange_HLEAcquire(i32{{[a-z_ ]}}%Destination, i32{{[a-z_ ]}}%Exchange, i32{{[a-z_ ]*}}%Comparand)
	// CHECK-INTEL: call i32 asm sideeffect ".byte 0xf2 ; lock ; cmpxchg $2, $1", "={ax},=m,r,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Destination, i32 %Exchange, i32 %Comparand, i32* %Destination)			// CHECK-INTEL: call i32 asm sideeffect ".byte 0xf2 ; lock ; cmpxchg $2, $1", "={ax},=m,r,{ax},m,~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Destination, i32 %Exchange, i32 %Comparand, i32* %Destination)
	return _InterlockedCompareExchange_HLEAcquire(Destination, Exchange, Comparand);			return _InterlockedCompareExchange_HLEAcquire(Destination, Exchange, Comparand);
	}			}
	long test_InterlockedCompareExchange_HLERelease(long volatile *Destination,			long test_InterlockedCompareExchange_HLERelease(long volatile *Destination,
	long Exchange, long Comparand) {			long Exchange, long Comparand) {
	// CHECK-INTEL: define{{.}} i32 @test_InterlockedCompareExchange_HLERelease(i32{{[a-z_ ]}}%Destination, i32{{[a-z_ ]}}%Exchange, i32{{[a-z_ ]*}}%Comparand)			// CHECK-INTEL: define{{.}} i32 @test_InterlockedCompareExchange_HLERelease(i32{{[a-z_ ]}}%Destination, i32{{[a-z_ ]}}%Exchange, i32{{[a-z_ ]*}}%Comparand)
	// CHECK-INTEL: call i32 asm sideeffect ".byte 0xf3 ; lock ; cmpxchg $2, $1", "={ax},=m,r,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Destination, i32 %Exchange, i32 %Comparand, i32* %Destination)			// CHECK-INTEL: call i32 asm sideeffect ".byte 0xf3 ; lock ; cmpxchg $2, $1", "={ax},=m,r,{ax},m,~{memory},~{dirflag},~{fpsr},~{flags}"(i32* %Destination, i32 %Exchange, i32 %Comparand, i32* %Destination)
	return _InterlockedCompareExchange_HLERelease(Destination, Exchange, Comparand);			return _InterlockedCompareExchange_HLERelease(Destination, Exchange, Comparand);
	}			}
	#endif			#endif
	#if defined(__x86_64__)			#if defined(__x86_64__)
	__int64 test_InterlockedExchange64_HLEAcquire(__int64 volatile *Target, __int64 Value) {			__int64 test_InterlockedExchange64_HLEAcquire(__int64 volatile *Target, __int64 Value) {
	// CHECK-X64: define{{.}} i64 @test_InterlockedExchange64_HLEAcquire(i64{{[a-z_ ]}}%Target, i64{{[a-z_ ]}}%Value)			// CHECK-X64: define{{.}} i64 @test_InterlockedExchange64_HLEAcquire(i64{{[a-z_ ]}}%Target, i64{{[a-z_ ]}}%Value)
	// CHECK-X64: call i64 asm sideeffect ".byte 0xf2 ; lock ; xchg $0, $1", "=r,=m,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Target, i64 %Value, i64* %Target)			// CHECK-X64: call i64 asm sideeffect ".byte 0xf2 ; lock ; xchg $0, $1", "=r,=m,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Target, i64 %Value, i64* %Target)
	return _InterlockedExchange64_HLEAcquire(Target, Value);			return _InterlockedExchange64_HLEAcquire(Target, Value);
	}			}
	__int64 test_InterlockedExchange64_HLERelease(__int64 volatile *Target, __int64 Value) {			__int64 test_InterlockedExchange64_HLERelease(__int64 volatile *Target, __int64 Value) {
	// CHECK-X64: define{{.}} i64 @test_InterlockedExchange64_HLERelease(i64{{[a-z_ ]}}%Target, i64{{[a-z_ ]}}%Value)			// CHECK-X64: define{{.}} i64 @test_InterlockedExchange64_HLERelease(i64{{[a-z_ ]}}%Target, i64{{[a-z_ ]}}%Value)
	// CHECK-X64: call i64 asm sideeffect ".byte 0xf3 ; lock ; xchg $0, $1", "=r,=m,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Target, i64 %Value, i64* %Target)			// CHECK-X64: call i64 asm sideeffect ".byte 0xf3 ; lock ; xchg $0, $1", "=r,=m,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Target, i64 %Value, i64* %Target)
	return _InterlockedExchange64_HLERelease(Target, Value);			return _InterlockedExchange64_HLERelease(Target, Value);
	}			}
	__int64 test_InterlockedCompareExchange64_HLEAcquire(__int64 volatile *Destination,			__int64 test_InterlockedCompareExchange64_HLEAcquire(__int64 volatile *Destination,
	__int64 Exchange, __int64 Comparand) {			__int64 Exchange, __int64 Comparand) {
	// CHECK-X64: define{{.}} i64 @test_InterlockedCompareExchange64_HLEAcquire(i64{{[a-z_ ]}}%Destination, i64{{[a-z_ ]}}%Exchange, i64{{[a-z_ ]*}}%Comparand)			// CHECK-X64: define{{.}} i64 @test_InterlockedCompareExchange64_HLEAcquire(i64{{[a-z_ ]}}%Destination, i64{{[a-z_ ]}}%Exchange, i64{{[a-z_ ]*}}%Comparand)
	// CHECK-X64: call i64 asm sideeffect ".byte 0xf2 ; lock ; cmpxchg $2, $1", "={ax},=m,r,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Destination, i64 %Exchange, i64 %Comparand, i64* %Destination)			// CHECK-X64: call i64 asm sideeffect ".byte 0xf2 ; lock ; cmpxchg $2, $1", "={ax},=m,r,{ax},m,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Destination, i64 %Exchange, i64 %Comparand, i64* %Destination)
	return _InterlockedCompareExchange64_HLEAcquire(Destination, Exchange, Comparand);			return _InterlockedCompareExchange64_HLEAcquire(Destination, Exchange, Comparand);
	}			}
	__int64 test_InterlockedCompareExchange64_HLERelease(__int64 volatile *Destination,			__int64 test_InterlockedCompareExchange64_HLERelease(__int64 volatile *Destination,
	__int64 Exchange, __int64 Comparand) {			__int64 Exchange, __int64 Comparand) {
	// CHECK-X64: define{{.}} i64 @test_InterlockedCompareExchange64_HLERelease(i64{{[a-z_ ]}}%Destination, i64{{[a-z_ ]}}%Exchange, i64{{[a-z_ ]*}}%Comparand)			// CHECK-X64: define{{.}} i64 @test_InterlockedCompareExchange64_HLERelease(i64{{[a-z_ ]}}%Destination, i64{{[a-z_ ]}}%Exchange, i64{{[a-z_ ]*}}%Comparand)
	// CHECK-X64: call i64 asm sideeffect ".byte 0xf3 ; lock ; cmpxchg $2, $1", "={ax},=m,r,0,m,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Destination, i64 %Exchange, i64 %Comparand, i64* %Destination)			// CHECK-X64: call i64 asm sideeffect ".byte 0xf3 ; lock ; cmpxchg $2, $1", "={ax},=m,r,{ax},m,~{memory},~{dirflag},~{fpsr},~{flags}"(i64* %Destination, i64 %Exchange, i64 %Comparand, i64* %Destination)
	return _InterlockedCompareExchange64_HLERelease(Destination, Exchange, Comparand);			return _InterlockedCompareExchange64_HLERelease(Destination, Exchange, Comparand);
	}			}
	#endif			#endif

	#if defined(__arm__) \|\| defined(__aarch64__)			#if defined(__arm__) \|\| defined(__aarch64__)
	char test_InterlockedExchangeAdd8_acq(char volatile *value, char mask) {			char test_InterlockedExchangeAdd8_acq(char volatile *value, char mask) {
	return _InterlockedExchangeAdd8_acq(value, mask);			return _InterlockedExchangeAdd8_acq(value, mask);
	}			}
	▲ Show 20 Lines • Show All 737 Lines • Show Last 20 Lines

clang/test/CodeGen/x86-asm-register-constraint-mix.c

This file was added.

				// REQUIRES: x86-registered-target
				// RUN: %clang_cc1 -triple x86_64-pc-linux-gnu -O2 -emit-llvm %s -o - \| FileCheck %s

				unsigned long foo(unsigned long addr, unsigned long a0,
				unsigned long a1, unsigned long a2,
				unsigned long a3, unsigned long a4,
				unsigned long a5) {
				register unsigned long result asm("rax");
				register unsigned long addr1 asm("rax") = addr;
				register unsigned long b0 asm("rdi") = a0;
				register unsigned long b1 asm("rsi") = a1;
				register unsigned long b2 asm("rdx") = a2;
				register unsigned long b3 asm("rcx") = a3;
				register unsigned long b4 asm("r8") = a4;
				register unsigned long b5 asm("r9") = a5;

				// CHECK: tail call i64 asm "call *$1", "={rax},{rax},{rdi},{rsi},{rdx},{rcx},{r8},{r9},{rax},~{dirflag},~{fpsr},~{flags}"(i64 %addr, i64 %a0, i64 %a1, i64 %a2, i64 %a3, i64 %a4, i64 %a5, i64 undef)
				asm("call *%1"
				: "+r" (result)
				: "r"(addr1), "r"(b0), "r"(b1), "r"(b2), "r"(b3), "r"(b4), "r"(b5));
				return result;
				}

				unsigned long foo1(unsigned long addr, unsigned long a0,
				unsigned long a1, unsigned long a2,
				unsigned long a3, unsigned long a4,
				unsigned long a5) {
				unsigned long result;
				unsigned long addr1 = addr;
				unsigned long b0 = a0;
				unsigned long b1 = a1;
				unsigned long b2 = a2;
				unsigned long b3 = a3;
				unsigned long b4 = a4;
				unsigned long b5 = a5;

				// CHECK: tail call i64 asm "call *$1", "={rax},{rax},{rdi},{rsi},{rdx},{rcx},{r8},{r9},{rax},~{dirflag},~{fpsr},~{flags}"(i64 %addr, i64 %a0, i64 %a1, i64 %a2, i64 %a3, i64 %a4, i64 %a5, i64 undef)
				asm("call *%1"
				: "+{rax}" (result)
				: "{rax}"(addr1), "{rdi}"(b0), "{rsi}"(b1), "{rdx}"(b2), "{rcx}"(b3), "{r8}"(b4), "{r9}"(b5));
				return result;
				}

				unsigned long foo2(unsigned long addr, unsigned long a0,
				unsigned long a1, unsigned long a2,
				unsigned long a3, unsigned long a4,
				unsigned long a5) {
				register unsigned long result asm("rax");
				unsigned long addr1 = addr;
				unsigned long b0 = a0;
				register unsigned long b1 asm ("rsi") = a1;
				unsigned long b2 = a2;
				unsigned long b3 = a3;
				register unsigned long b4 asm ("r8") = a4;
				unsigned long b5 = a5;

				// CHECK: tail call i64 asm "call *$1", "={rax},{rax},{rdi},{rsi},{rdx},{rcx},{r8},{r9},{rax},~{dirflag},~{fpsr},~{flags}"(i64 %addr, i64 %a0, i64 %a1, i64 %a2, i64 %a3, i64 %a4, i64 %a5, i64 undef)
				asm("call *%1"
				: "+r" (result)
				: "{rax}"(addr1), "{rdi}"(b0), "r"(b1), "{rdx}"(b2), "{rcx}"(b3), "r"(b4), "{r9}"(b5));
				return result;
				}

clang/test/CodeGen/z-hard-register-inline-asm.c

This file was added.

				// RUN: %clang_cc1 -triple s390x-ibm-linux -emit-llvm -o - %s \| FileCheck %s
				// RUN: %clang_cc1 -triple s390x-ibm-zos -emit-llvm -o - %s \| FileCheck %s

				void f1() {
				int a, b;
				register int c asm("r1");
				register int d asm("r2");

				// CHECK-COUNT-2: call i32 asm "lhi $0,5\0A", "={r1}"
				__asm("lhi %0,5\n"
				: "={r1}"(a)
				:
				:);
				__asm("lhi %0,5\n"
				: "=r"(c)
				:
				:);

				// CHECK-COUNT-2: call i32 asm "lgr $0,$1\0A", "={r1},{r2}"
				__asm("lgr %0,%1\n"
				: "={r1}"(a)
				: "{r2}"(b)
				:);
				__asm("lgr %0,%1\n"
				: "=r"(c)
				: "r"(d)
				:);

				// CHECK-COUNT-2: call i32 asm "lgr $0,$1\0A", "={r1},{r2}"
				__asm("lgr %0,%1\n"
				: "={%r1}"(a)
				: "{%r2}"(b)
				:);
				__asm("lgr %0,%1\n"
				: "={r1}"(a)
				: "{%r2}"(b)
				:);

				// CHECK-COUNT-2: call i32 asm "lgr $0,$1\0A", "=&{r1},{r2}"
				__asm("lgr %0,%1\n"
				: "=&{r1}"(a)
				: "{%r2}"(b)
				:);
				__asm("lgr %0,%1\n"
				: "=&r"(c)
				: "r"(d)
				:);
				}

clang/test/Sema/z-hard-register-inline-asm.c

This file was added.

				// RUN: %clang_cc1 %s -triple s390x-ibm-linux -fsyntax-only -verify
				// RUN: %clang_cc1 %s -triple s390x-ibm-zos -fsyntax-only -verify

				void f1() {
				int a, b;
				__asm("lhi %0,5\n"
				: "={r2}"(a)
				:);

				__asm("lgr %0,%1\n"
				: "={r2}"(a)
				: "{r1}"(b));

				__asm("lgr %0,%1\n"
				: "={r2}"(a)
				: "{%r1}"(b));

				__asm("lgr %0,%1\n"
				: "=&{r1}"(a)
				: "{r2}"(b));

				__asm("lhi %0,5\n"
				: "={r2"(a) // expected-error {{invalid output constraint '={r2' in asm}}
				:);

				__asm("lhi %0,5\n"
				: "={r17}"(a) // expected-error {{invalid output constraint '={r17}' in asm}}
				:);

				__asm("lhi %0,5\n"
				: "={}"(a) // expected-error {{invalid output constraint '={}' in asm}}
				:);

				__asm("lhi %0,5\n"
				: "=&{r2"(a) // expected-error {{invalid output constraint '=&{r2' in asm}}
				:);

				__asm("lgr %0,%1\n"
				: "=r"(a)
				: "{r1"(b)); // expected-error {{invalid input constraint '{r1' in asm}}

				__asm("lgr %0,%1\n"
				: "=r"(a)
				: "{}"(b)); // expected-error {{invalid input constraint '{}' in asm}}

				__asm("lgr %0,%1\n"
				: "={r1}"(a)
				: "{r17}"(b)); // expected-error {{invalid input constraint '{r17}' in asm}}
				}