This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
2
LangRef.rst
-
include/llvm/
-
llvm/
-
Bitcode/
-
LLVMBitCodes.h
-
CodeGen/
-
TargetLowering.h
-
IR/
2/3
Instructions.h
-
lib/
-
AsmParser/
-
LLParser.cpp
-
Bitcode/
-
Reader/
-
BitcodeReader.cpp
-
Writer/
-
BitcodeWriter.cpp
-
CodeGen/
1/2
AtomicExpandPass.cpp
-
IR/
-
Instructions.cpp
-
Verifier.cpp
-
Target/
-
AArch64/
-
AArch64ISelLowering.cpp
-
ARM/
-
ARMISelLowering.cpp
-
Hexagon/
-
HexagonISelLowering.cpp
-
RISCV/
-
RISCVISelLowering.cpp
-
test/
-
Assembler/
-
atomic.ll
-
invalid-atomicrmw-fadd-must-be-fp-type.ll
-
invalid-atomicrmw-fsub-must-be-fp-type.ll
-
Bitcode/
-
compatibility.ll
-
Transforms/AtomicExpand/
-
AtomicExpand/
-
AArch64/
-
atomicrmw-fp.ll
-
ARM/
-
atomicrmw-fp.ll
-
Hexagon/
-
atomicrmw-fp.ll
-
lit.local.cfg
-
Mips/
-
atomicrmw-fp.ll
-
lit.local.cfg
-
RISCV/
-
atomicrmw-fp.ll
-
lit.local.cfg

Differential D53965

IR: Add fp operations to atomicrmw
ClosedPublic

Authored by arsenm on Oct 31 2018, 6:36 PM.

Download Raw Diff

Details

Reviewers

jfb
reames
hfinkel

Summary

Add just fadd/fsub for now.

Diff Detail

Event Timeline

arsenm created this revision.Oct 31 2018, 6:36 PM

Herald added a subscriber: wdng. · View Herald TranscriptOct 31 2018, 6:36 PM

arsenm added a child revision: D53966: Codegen support for atomicrmw fadd/fsub.Oct 31 2018, 6:38 PM

arsenm added a parent revision: D52416: Allow FP types for atomicrmw xchg.

Is this to support wg21.link/p0020 ?

Can you do fsub while you're there. It should be exactly the same thing.

AFAIK we don't care about FP flags at all here, correct? Even strictfp is irrelevant.

In D53965#1284143, @jfb wrote:

Is this to support wg21.link/p0020 ?

Yes

Can you do fsub while you're there. It should be exactly the same thing.

AMDGPU has hardware for fadd, fmin and fmax so I was focusing on those first, but it's trivial to add fsub. fmin/fmax are problematic since it requires deciding NaN behaviors (so I fear we'll end up with 2 or 3 versions of each)

AFAIK we don't care about FP flags at all here, correct? Even strictfp is irrelevant.

We can't algebraically combine this, so the flags shouldn't matter.

AFAIK we don't care about FP flags at all here, correct? Even strictfp is irrelevant.

We can't algebraically combine this, so the flags shouldn't matter.

To clarify my concern: were LLVM to support fenv properly, this instruction can spuriously set FP flags if, say, the fadd executes in a loop and has spurious failure. The final store could hold a result which raised no FP exception, but intermediate results could have done so. But LLVM doesn't support fenv properly yet, so that's not an issue (I think!).

In D53965#1284240, @jfb wrote:

AFAIK we don't care about FP flags at all here, correct? Even strictfp is irrelevant.

We can't algebraically combine this, so the flags shouldn't matter.

To clarify my concern: were LLVM to support fenv properly, this instruction can spuriously set FP flags if, say, the fadd executes in a loop and has spurious failure. The final store could hold a result which raised no FP exception, but intermediate results could have done so. But LLVM doesn't support fenv properly yet, so that's not an issue (I think!).

fenv support requires using the new constrained operation intrinsics, but the support there is incomplete (and still some confusion on how they should interact with the regular IR operations, which is used in the cmpxchg loop expansion)

Add fsub

Rebase langref change

I'd note that this change isn't a necessary prerequisite to implementing the C++ feature -- FP math is already available in C11, and clang already implements it by lowering to cmpxchg. I'd be very skeptical of this addition, if you hadn't mentioned that AMDGPU supported it in hardware. AFAIK, there's no other ISAs that support atomic fadd in hardware. However, that AMDGPU does have hardware support seems like a sufficient reason to add support for it to LLVM.

On LL/SC architectures, while you could theoretically execute a floating-point instruction inside the LL/SC loop, there's often a bunch of complexity regarding the potential for a trap to be raised (e.g. for software FPU emulation, or for edge cases like denormals), which may deterministically cancel the reservation, and create an infinite loop. So I'm pretty wary of any proposal to lower to an LL/SC loop containing FP instructions -- I think even on LL/SC architectures, this should generally be lowered to a cmpxchg loop.

On a general note -- this patch doesn't seem complete. I think this should be a complete implementation of the feature -- specifying an "atomicrmw fadd" should function on all architectures after this, even if not most optimally on AMDGPU (since it's not possible to lower into SelectionDAG until after the next patch).

That'll means pulling in the AtomicExpand changes, and also updating all architectures' shouldExpandAtomicRMWInIR to return CmpXchg for these ops.

docs/LangRef.rst
8443	Lost some words.
include/llvm/IR/Instructions.h
813	Unused?
lib/CodeGen/AtomicExpandPass.cpp
1559–1560	This isn't great. And I see in your next patch you fix it, but splitting that to another patch doesn't make sense.

arsenm marked 2 inline comments as done.Dec 6 2018, 10:02 AM

arsenm added inline comments.

include/llvm/IR/Instructions.h
813	It's not used here, but included for completeness since all of the other instructions do this (e.g. ICmpInst::getUnsignedPredicate
lib/CodeGen/AtomicExpandPass.cpp
1559–1560	I'd disagree, since usually with IR changes there's a split to add the minimum support to the assembler and bitcode before making them functional as a separate commit. This is just needed to quiet the covered switch warning

arsenm marked an inline comment as done.Dec 6 2018, 10:14 AM

arsenm added inline comments.

include/llvm/IR/Instructions.h
813	I found a use for it when updating all the targets anyway

Move AtomicExpand change, make targets default to cmpxchg

Herald added subscribers: jocewei, PkmX, the_o and 16 others. · View Herald TranscriptDec 6 2018, 7:28 PM

ping

LGTM

docs/LangRef.rst
8470	using floating-point rules -> using floating-point arithmetic

This revision is now accepted and ready to land.Jan 21 2019, 5:55 PM

r351778

yaxunl mentioned this in D71726: Let clang atomic builtins fetch add/sub support floating point types.Jul 28 2020, 11:07 AM

yaxunl mentioned this in rG61d065e21ff3: Let clang atomic builtins fetch add/sub support floating point types.Apr 6 2021, 12:45 PM

Revision Contents

Path

Size

docs/

LangRef.rst

31 lines

include/

llvm/

Bitcode/

LLVMBitCodes.h

4 lines

CodeGen/

TargetLowering.h

5 lines

IR/

Instructions.h

22 lines

lib/

AsmParser/

LLParser.cpp

40 lines

Bitcode/

Reader/

BitcodeReader.cpp

2 lines

Writer/

BitcodeWriter.cpp

2 lines

CodeGen/

AtomicExpandPass.cpp

6 lines

IR/

Instructions.cpp

4 lines

Verifier.cpp

5 lines

Target/

AArch64/

AArch64ISelLowering.cpp

3 lines

ARM/

ARMISelLowering.cpp

3 lines

Hexagon/

HexagonISelLowering.cpp

19 lines

RISCV/

RISCVISelLowering.cpp

3 lines

test/

Assembler/

atomic.ll

10 lines

invalid-atomicrmw-fadd-must-be-fp-type.ll

7 lines

invalid-atomicrmw-fsub-must-be-fp-type.ll

7 lines

Bitcode/

compatibility.ll

7 lines

Transforms/

AtomicExpand/

AArch64/

atomicrmw-fp.ll

47 lines

ARM/

atomicrmw-fp.ll

51 lines

Hexagon/

atomicrmw-fp.ll

47 lines

lit.local.cfg

2 lines

Mips/

atomicrmw-fp.ll

51 lines

lit.local.cfg

2 lines

RISCV/

atomicrmw-fp.ll

47 lines

lit.local.cfg

5 lines

Diff 177098

docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,812 Lines • ▼ Show 20 Lines

The '``fneg``' instruction returns the negation of its operand.		The '``fneg``' instruction returns the negation of its operand.

Arguments:		Arguments:
""""""""""		""""""""""

The argument to the '``fneg``' instruction must be a		The argument to the '``fneg``' instruction must be a
:ref:`floating-point <t_floating>` or :ref:`vector <t_vector>` of		:ref:`floating-point <t_floating>` or :ref:`vector <t_vector>` of
floating-point values.		floating-point values.

Semantics:		Semantics:
""""""""""		""""""""""

The value produced is a copy of the operand with its sign bit flipped.		The value produced is a copy of the operand with its sign bit flipped.
This instruction can also take any number of :ref:`fast-math		This instruction can also take any number of :ref:`fast-math
flags <fastmath>`, which are optimization hints to enable otherwise		flags <fastmath>`, which are optimization hints to enable otherwise
unsafe floating-point optimizations:		unsafe floating-point optimizations:
▲ Show 20 Lines • Show All 1,595 Lines • ▼ Show 20 Lines
- and		- and
- nand		- nand
- or		- or
- xor		- xor
- max		- max
- min		- min
- umax		- umax
- umin		- umin
		- fadd
		- fsub

For most of these operations, the type of '<value>' must be an integer		For most of these operations, the type of '<value>' must be an integer
type whose bit width is a power of two greater than or equal to eight		type whose bit width is a power of two greater than or equal to eight
and less than or equal to a target-specific size limit. For xchg, this		and less than or equal to a target-specific size limit. For xchg, this
may also be a floating point type with the same size constraints as		may also be a floating point type with the same size constraints as
integers. The type of the '``<pointer>``' operand must be a pointer to		integers. For fadd/fsub, this must be a floating point type. The
that type. If the ``atomicrmw`` is marked as ``volatile``, then the		type of the '``<pointer>``' operand must be a pointer to that type. If
optimizer is not allowed to modify the number or order of execution of		the ``atomicrmw`` is marked as ``volatile``, then the optimizer is not
this ``atomicrmw`` with other :ref:`volatile operations <volatile>`.		allowed to modify the number or order of execution of this
		jyknightUnsubmitted Not Done Reply Inline Actions Lost some words. jyknight: Lost some words.
		``atomicrmw`` with other :ref:`volatile operations <volatile>`.

A ``atomicrmw`` instruction can also take an optional		A ``atomicrmw`` instruction can also take an optional
":ref:`syncscope <syncscope>`" argument.		":ref:`syncscope <syncscope>`" argument.

Semantics:		Semantics:
""""""""""		""""""""""

The contents of memory at the location specified by the '``<pointer>``'		The contents of memory at the location specified by the '``<pointer>``'
Show All 9 Lines
- or: ``ptr = ptr \| val``		- or: ``ptr = ptr \| val``
- xor: ``ptr = ptr ^ val``		- xor: ``ptr = ptr ^ val``
- max: ``ptr = ptr > val ? *ptr : val`` (using a signed comparison)		- max: ``ptr = ptr > val ? *ptr : val`` (using a signed comparison)
- min: ``ptr = ptr < val ? *ptr : val`` (using a signed comparison)		- min: ``ptr = ptr < val ? *ptr : val`` (using a signed comparison)
- umax: ``ptr = ptr > val ? *ptr : val`` (using an unsigned		- umax: ``ptr = ptr > val ? *ptr : val`` (using an unsigned
comparison)		comparison)
- umin: ``ptr = ptr < val ? *ptr : val`` (using an unsigned		- umin: ``ptr = ptr < val ? *ptr : val`` (using an unsigned
comparison)		comparison)
		- fadd: ``ptr = ptr + val`` (using floating point rules)
		hfinkelUnsubmitted Not Done Reply Inline Actions using floating-point rules -> using floating-point arithmetic hfinkel: using floating-point rules -> using floating-point arithmetic
		- fsub: ``ptr = ptr - val`` (using floating point rules)

Example:		Example:
""""""""		""""""""

.. code-block:: llvm		.. code-block:: llvm

%old = atomicrmw add i32* %ptr, i32 1 acquire ; yields i32		%old = atomicrmw add i32* %ptr, i32 1 acquire ; yields i32

▲ Show 20 Lines • Show All 6,285 Lines • ▼ Show 20 Lines	::
declare <type>		declare <type>
@llvm.experimental.constrained.maxnum(<type> <op1>, <type> <op2>		@llvm.experimental.constrained.maxnum(<type> <op1>, <type> <op2>
metadata <rounding mode>,		metadata <rounding mode>,
metadata <exception behavior>)		metadata <exception behavior>)

Overview:		Overview:
"""""""""		"""""""""

The '``llvm.experimental.constrained.maxnum``' intrinsic returns the maximum		The '``llvm.experimental.constrained.maxnum``' intrinsic returns the maximum
of the two arguments.		of the two arguments.

Arguments:		Arguments:
""""""""""		""""""""""

The first two arguments and the return value are floating-point numbers		The first two arguments and the return value are floating-point numbers
of the same type.		of the same type.

The third and forth arguments specify the rounding mode and exception		The third and forth arguments specify the rounding mode and exception
behavior as described above.		behavior as described above.

Semantics:		Semantics:
""""""""""		""""""""""

▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	::
declare <type>		declare <type>
@llvm.experimental.constrained.ceil(<type> <op1>,		@llvm.experimental.constrained.ceil(<type> <op1>,
metadata <rounding mode>,		metadata <rounding mode>,
metadata <exception behavior>)		metadata <exception behavior>)

Overview:		Overview:
"""""""""		"""""""""

The '``llvm.experimental.constrained.ceil``' intrinsic returns the ceiling of the		The '``llvm.experimental.constrained.ceil``' intrinsic returns the ceiling of the
first operand.		first operand.

Arguments:		Arguments:
""""""""""		""""""""""

The first argument and the return value are floating-point numbers of the same		The first argument and the return value are floating-point numbers of the same
type.		type.

Show All 19 Lines	::
declare <type>		declare <type>
@llvm.experimental.constrained.floor(<type> <op1>,		@llvm.experimental.constrained.floor(<type> <op1>,
metadata <rounding mode>,		metadata <rounding mode>,
metadata <exception behavior>)		metadata <exception behavior>)

Overview:		Overview:
"""""""""		"""""""""

The '``llvm.experimental.constrained.floor``' intrinsic returns the floor of the		The '``llvm.experimental.constrained.floor``' intrinsic returns the floor of the
first operand.		first operand.

Arguments:		Arguments:
""""""""""		""""""""""

The first argument and the return value are floating-point numbers of the same		The first argument and the return value are floating-point numbers of the same
type.		type.

The second and third arguments specify the rounding mode and exception		The second and third arguments specify the rounding mode and exception
behavior as described above. The rounding mode is currently unused for this		behavior as described above. The rounding mode is currently unused for this
intrinsic.		intrinsic.

Semantics:		Semantics:
""""""""""		""""""""""

This function returns the same values as the libm ``floor`` functions		This function returns the same values as the libm ``floor`` functions
would and handles error conditions in the same way.		would and handles error conditions in the same way.


'``llvm.experimental.constrained.round``' Intrinsic		'``llvm.experimental.constrained.round``' Intrinsic
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Syntax:		Syntax:
"""""""		"""""""

::		::

declare <type>		declare <type>
@llvm.experimental.constrained.round(<type> <op1>,		@llvm.experimental.constrained.round(<type> <op1>,
metadata <rounding mode>,		metadata <rounding mode>,
metadata <exception behavior>)		metadata <exception behavior>)

Overview:		Overview:
"""""""""		"""""""""

The '``llvm.experimental.constrained.round``' intrinsic returns the first		The '``llvm.experimental.constrained.round``' intrinsic returns the first
operand rounded to the nearest integer.		operand rounded to the nearest integer.

Arguments:		Arguments:
""""""""""		""""""""""

The first argument and the return value are floating-point numbers of the same		The first argument and the return value are floating-point numbers of the same
type.		type.

Show All 19 Lines	::
declare <type>		declare <type>
@llvm.experimental.constrained.trunc(<type> <op1>,		@llvm.experimental.constrained.trunc(<type> <op1>,
metadata <truncing mode>,		metadata <truncing mode>,
metadata <exception behavior>)		metadata <exception behavior>)

Overview:		Overview:
"""""""""		"""""""""

The '``llvm.experimental.constrained.trunc``' intrinsic returns the first		The '``llvm.experimental.constrained.trunc``' intrinsic returns the first
operand rounded to the nearest integer not larger in magnitude than the		operand rounded to the nearest integer not larger in magnitude than the
operand.		operand.

Arguments:		Arguments:
""""""""""		""""""""""

The first argument and the return value are floating-point numbers of the same		The first argument and the return value are floating-point numbers of the same
type.		type.

▲ Show 20 Lines • Show All 983 Lines • Show Last 20 Lines

include/llvm/Bitcode/LLVMBitCodes.h

Show First 20 Lines • Show All 401 Lines • ▼ Show 20 Lines	enum RMWOperations {
RMW_SUB = 2,		RMW_SUB = 2,
RMW_AND = 3,		RMW_AND = 3,
RMW_NAND = 4,		RMW_NAND = 4,
RMW_OR = 5,		RMW_OR = 5,
RMW_XOR = 6,		RMW_XOR = 6,
RMW_MAX = 7,		RMW_MAX = 7,
RMW_MIN = 8,		RMW_MIN = 8,
RMW_UMAX = 9,		RMW_UMAX = 9,
RMW_UMIN = 10		RMW_UMIN = 10,
		RMW_FADD = 11,
		RMW_FSUB = 12
};		};

/// OverflowingBinaryOperatorOptionalFlags - Flags for serializing		/// OverflowingBinaryOperatorOptionalFlags - Flags for serializing
/// OverflowingBinaryOperator's SubclassOptionalData contents.		/// OverflowingBinaryOperator's SubclassOptionalData contents.
enum OverflowingBinaryOperatorOptionalFlags {		enum OverflowingBinaryOperatorOptionalFlags {
OBO_NO_UNSIGNED_WRAP = 0,		OBO_NO_UNSIGNED_WRAP = 0,
OBO_NO_SIGNED_WRAP = 1		OBO_NO_SIGNED_WRAP = 1
};		};
▲ Show 20 Lines • Show All 208 Lines • Show Last 20 Lines

include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 1,678 Lines • ▼ Show 20 Lines	public:
/// AtomicExpand pass.		/// AtomicExpand pass.
virtual AtomicExpansionKind		virtual AtomicExpansionKind
shouldExpandAtomicCmpXchgInIR(AtomicCmpXchgInst *AI) const {		shouldExpandAtomicCmpXchgInIR(AtomicCmpXchgInst *AI) const {
return AtomicExpansionKind::None;		return AtomicExpansionKind::None;
}		}

/// Returns how the IR-level AtomicExpand pass should expand the given		/// Returns how the IR-level AtomicExpand pass should expand the given
/// AtomicRMW, if at all. Default is to never expand.		/// AtomicRMW, if at all. Default is to never expand.
virtual AtomicExpansionKind shouldExpandAtomicRMWInIR(AtomicRMWInst *) const {		virtual AtomicExpansionKind shouldExpandAtomicRMWInIR(AtomicRMWInst *RMW) const {
return AtomicExpansionKind::None;		return RMW->isFloatingPointOperation() ?
		AtomicExpansionKind::CmpXChg : AtomicExpansionKind::None;
}		}

/// On some platforms, an AtomicRMW that never actually modifies the value		/// On some platforms, an AtomicRMW that never actually modifies the value
/// (such as fetch_add of 0) can be turned into a fence followed by an		/// (such as fetch_add of 0) can be turned into a fence followed by an
/// atomic load. This may sound useless, but it makes it possible for the		/// atomic load. This may sound useless, but it makes it possible for the
/// processor to keep the cacheline shared, dramatically improving		/// processor to keep the cacheline shared, dramatically improving
/// performance. And such idempotent RMWs are useful for implementing some		/// performance. And such idempotent RMWs are useful for implementing some
/// kinds of locks, see for example (justification + benchmarks):		/// kinds of locks, see for example (justification + benchmarks):
▲ Show 20 Lines • Show All 2,156 Lines • Show Last 20 Lines

include/llvm/IR/Instructions.h

Show First 20 Lines • Show All 708 Lines • ▼ Show 20 Lines	enum BinOp {
Max,		Max,
/// *p = old <signed v ? old : v		/// *p = old <signed v ? old : v
Min,		Min,
/// *p = old >unsigned v ? old : v		/// *p = old >unsigned v ? old : v
UMax,		UMax,
/// *p = old <unsigned v ? old : v		/// *p = old <unsigned v ? old : v
UMin,		UMin,

		/// *p = old + v
		FAdd,

		/// *p = old - v
		FSub,

FIRST_BINOP = Xchg,		FIRST_BINOP = Xchg,
LAST_BINOP = UMin,		LAST_BINOP = FSub,
BAD_BINOP		BAD_BINOP
};		};

AtomicRMWInst(BinOp Operation, Value Ptr, Value Val,		AtomicRMWInst(BinOp Operation, Value Ptr, Value Val,
AtomicOrdering Ordering, SyncScope::ID SSID,		AtomicOrdering Ordering, SyncScope::ID SSID,
Instruction *InsertBefore = nullptr);		Instruction *InsertBefore = nullptr);
AtomicRMWInst(BinOp Operation, Value Ptr, Value Val,		AtomicRMWInst(BinOp Operation, Value Ptr, Value Val,
AtomicOrdering Ordering, SyncScope::ID SSID,		AtomicOrdering Ordering, SyncScope::ID SSID,
BasicBlock *InsertAtEnd);		BasicBlock *InsertAtEnd);

// allocate space for exactly two operands		// allocate space for exactly two operands
void *operator new(size_t s) {		void *operator new(size_t s) {
return User::operator new(s, 2);		return User::operator new(s, 2);
}		}

BinOp getOperation() const {		BinOp getOperation() const {
return static_cast<BinOp>(getSubclassDataFromInstruction() >> 5);		return static_cast<BinOp>(getSubclassDataFromInstruction() >> 5);
}		}

static StringRef getOperationName(BinOp Op);		static StringRef getOperationName(BinOp Op);

		static bool isFPOperation(BinOp Op) {
		switch (Op) {
		case AtomicRMWInst::FAdd:
		case AtomicRMWInst::FSub:
		return true;
		default:
		return false;
		}
		}

void setOperation(BinOp Operation) {		void setOperation(BinOp Operation) {
unsigned short SubclassData = getSubclassDataFromInstruction();		unsigned short SubclassData = getSubclassDataFromInstruction();
setInstructionSubclassData((SubclassData & 31) \|		setInstructionSubclassData((SubclassData & 31) \|
(Operation << 5));		(Operation << 5));
}		}

/// Return true if this is a RMW on a volatile memory location.		/// Return true if this is a RMW on a volatile memory location.
///		///
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	public:
Value *getValOperand() { return getOperand(1); }		Value *getValOperand() { return getOperand(1); }
const Value *getValOperand() const { return getOperand(1); }		const Value *getValOperand() const { return getOperand(1); }

/// Returns the address space of the pointer operand.		/// Returns the address space of the pointer operand.
unsigned getPointerAddressSpace() const {		unsigned getPointerAddressSpace() const {
return getPointerOperand()->getType()->getPointerAddressSpace();		return getPointerOperand()->getType()->getPointerAddressSpace();
}		}

		bool isFloatingPointOperation() const {
		jyknightUnsubmitted Not Done Reply Inline Actions Unused? jyknight: Unused?
		arsenmAuthorUnsubmitted Done Reply Inline Actions It's not used here, but included for completeness since all of the other instructions do this (e.g. ICmpInst::getUnsignedPredicate arsenm: It's not used here, but included for completeness since all of the other instructions do this…
		arsenmAuthorUnsubmitted Done Reply Inline Actions I found a use for it when updating all the targets anyway arsenm: I found a use for it when updating all the targets anyway
		return isFPOperation(getOperation());
		}

// Methods for support type inquiry through isa, cast, and dyn_cast:		// Methods for support type inquiry through isa, cast, and dyn_cast:
static bool classof(const Instruction *I) {		static bool classof(const Instruction *I) {
return I->getOpcode() == Instruction::AtomicRMW;		return I->getOpcode() == Instruction::AtomicRMW;
}		}
static bool classof(const Value *V) {		static bool classof(const Value *V) {
return isa<Instruction>(V) && classof(cast<Instruction>(V));		return isa<Instruction>(V) && classof(cast<Instruction>(V));
}		}

▲ Show 20 Lines • Show All 4,107 Lines • Show Last 20 Lines

lib/AsmParser/LLParser.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 6,810 Lines • ▼ Show 20 Lines
	/// ::= 'atomicrmw' 'volatile'? BinOp TypeAndValue ',' TypeAndValue			/// ::= 'atomicrmw' 'volatile'? BinOp TypeAndValue ',' TypeAndValue
	/// 'singlethread'? AtomicOrdering			/// 'singlethread'? AtomicOrdering
	int LLParser::ParseAtomicRMW(Instruction *&Inst, PerFunctionState &PFS) {			int LLParser::ParseAtomicRMW(Instruction *&Inst, PerFunctionState &PFS) {
	Value Ptr, Val; LocTy PtrLoc, ValLoc;			Value Ptr, Val; LocTy PtrLoc, ValLoc;
	bool AteExtraComma = false;			bool AteExtraComma = false;
	AtomicOrdering Ordering = AtomicOrdering::NotAtomic;			AtomicOrdering Ordering = AtomicOrdering::NotAtomic;
	SyncScope::ID SSID = SyncScope::System;			SyncScope::ID SSID = SyncScope::System;
	bool isVolatile = false;			bool isVolatile = false;
				bool IsFP = false;
	AtomicRMWInst::BinOp Operation;			AtomicRMWInst::BinOp Operation;

	if (EatIfPresent(lltok::kw_volatile))			if (EatIfPresent(lltok::kw_volatile))
	isVolatile = true;			isVolatile = true;

	switch (Lex.getKind()) {			switch (Lex.getKind()) {
	default: return TokError("expected binary operation in atomicrmw");			default: return TokError("expected binary operation in atomicrmw");
	case lltok::kw_xchg: Operation = AtomicRMWInst::Xchg; break;			case lltok::kw_xchg: Operation = AtomicRMWInst::Xchg; break;
	case lltok::kw_add: Operation = AtomicRMWInst::Add; break;			case lltok::kw_add: Operation = AtomicRMWInst::Add; break;
	case lltok::kw_sub: Operation = AtomicRMWInst::Sub; break;			case lltok::kw_sub: Operation = AtomicRMWInst::Sub; break;
	case lltok::kw_and: Operation = AtomicRMWInst::And; break;			case lltok::kw_and: Operation = AtomicRMWInst::And; break;
	case lltok::kw_nand: Operation = AtomicRMWInst::Nand; break;			case lltok::kw_nand: Operation = AtomicRMWInst::Nand; break;
	case lltok::kw_or: Operation = AtomicRMWInst::Or; break;			case lltok::kw_or: Operation = AtomicRMWInst::Or; break;
	case lltok::kw_xor: Operation = AtomicRMWInst::Xor; break;			case lltok::kw_xor: Operation = AtomicRMWInst::Xor; break;
	case lltok::kw_max: Operation = AtomicRMWInst::Max; break;			case lltok::kw_max: Operation = AtomicRMWInst::Max; break;
	case lltok::kw_min: Operation = AtomicRMWInst::Min; break;			case lltok::kw_min: Operation = AtomicRMWInst::Min; break;
	case lltok::kw_umax: Operation = AtomicRMWInst::UMax; break;			case lltok::kw_umax: Operation = AtomicRMWInst::UMax; break;
	case lltok::kw_umin: Operation = AtomicRMWInst::UMin; break;			case lltok::kw_umin: Operation = AtomicRMWInst::UMin; break;
				case lltok::kw_fadd:
				Operation = AtomicRMWInst::FAdd;
				IsFP = true;
				break;
				case lltok::kw_fsub:
				Operation = AtomicRMWInst::FSub;
				IsFP = true;
				break;
	}			}
	Lex.Lex(); // Eat the operation.			Lex.Lex(); // Eat the operation.

	if (ParseTypeAndValue(Ptr, PtrLoc, PFS) \|\|			if (ParseTypeAndValue(Ptr, PtrLoc, PFS) \|\|
	ParseToken(lltok::comma, "expected ',' after atomicrmw address") \|\|			ParseToken(lltok::comma, "expected ',' after atomicrmw address") \|\|
	ParseTypeAndValue(Val, ValLoc, PFS) \|\|			ParseTypeAndValue(Val, ValLoc, PFS) \|\|
	ParseScopeAndOrdering(true /Always atomic/, SSID, Ordering))			ParseScopeAndOrdering(true /Always atomic/, SSID, Ordering))
	return true;			return true;

	if (Ordering == AtomicOrdering::Unordered)			if (Ordering == AtomicOrdering::Unordered)
	return TokError("atomicrmw cannot be unordered");			return TokError("atomicrmw cannot be unordered");
	if (!Ptr->getType()->isPointerTy())			if (!Ptr->getType()->isPointerTy())
	return Error(PtrLoc, "atomicrmw operand must be a pointer");			return Error(PtrLoc, "atomicrmw operand must be a pointer");
	if (cast<PointerType>(Ptr->getType())->getElementType() != Val->getType())			if (cast<PointerType>(Ptr->getType())->getElementType() != Val->getType())
	return Error(ValLoc, "atomicrmw value and pointer type do not match");			return Error(ValLoc, "atomicrmw value and pointer type do not match");

	if (Operation != AtomicRMWInst::Xchg && !Val->getType()->isIntegerTy()) {			if (Operation == AtomicRMWInst::Xchg) {
				if (!Val->getType()->isIntegerTy() &&
				!Val->getType()->isFloatingPointTy()) {
	return Error(ValLoc, "atomicrmw " +			return Error(ValLoc, "atomicrmw " +
	AtomicRMWInst::getOperationName(Operation) +			AtomicRMWInst::getOperationName(Operation) +
	" operand must be an integer");			" operand must be an integer or floating point type");
	}			}
				} else if (IsFP) {
	if (Operation == AtomicRMWInst::Xchg &&			if (!Val->getType()->isFloatingPointTy()) {
	!Val->getType()->isIntegerTy() &&
	!Val->getType()->isFloatingPointTy()) {
	return Error(ValLoc, "atomicrmw " +			return Error(ValLoc, "atomicrmw " +
	AtomicRMWInst::getOperationName(Operation) +			AtomicRMWInst::getOperationName(Operation) +
	" operand must be an integer or floating point type");			" operand must be a floating point type");
				}
				} else {
				if (!Val->getType()->isIntegerTy()) {
				return Error(ValLoc, "atomicrmw " +
				AtomicRMWInst::getOperationName(Operation) +
				" operand must be an integer");
				}
	}			}

	unsigned Size = Val->getType()->getPrimitiveSizeInBits();			unsigned Size = Val->getType()->getPrimitiveSizeInBits();
	if (Size < 8 \|\| (Size & (Size - 1)))			if (Size < 8 \|\| (Size & (Size - 1)))
	return Error(ValLoc, "atomicrmw operand must be power-of-two byte-sized"			return Error(ValLoc, "atomicrmw operand must be power-of-two byte-sized"
	" integer");			" integer");

	AtomicRMWInst *RMWI =			AtomicRMWInst *RMWI =
	▲ Show 20 Lines • Show All 1,521 Lines • Show Last 20 Lines

lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 1,029 Lines • ▼ Show 20 Lines	static AtomicRMWInst::BinOp getDecodedRMWOperation(unsigned Val) {
case bitc::RMW_AND: return AtomicRMWInst::And;		case bitc::RMW_AND: return AtomicRMWInst::And;
case bitc::RMW_NAND: return AtomicRMWInst::Nand;		case bitc::RMW_NAND: return AtomicRMWInst::Nand;
case bitc::RMW_OR: return AtomicRMWInst::Or;		case bitc::RMW_OR: return AtomicRMWInst::Or;
case bitc::RMW_XOR: return AtomicRMWInst::Xor;		case bitc::RMW_XOR: return AtomicRMWInst::Xor;
case bitc::RMW_MAX: return AtomicRMWInst::Max;		case bitc::RMW_MAX: return AtomicRMWInst::Max;
case bitc::RMW_MIN: return AtomicRMWInst::Min;		case bitc::RMW_MIN: return AtomicRMWInst::Min;
case bitc::RMW_UMAX: return AtomicRMWInst::UMax;		case bitc::RMW_UMAX: return AtomicRMWInst::UMax;
case bitc::RMW_UMIN: return AtomicRMWInst::UMin;		case bitc::RMW_UMIN: return AtomicRMWInst::UMin;
		case bitc::RMW_FADD: return AtomicRMWInst::FAdd;
		case bitc::RMW_FSUB: return AtomicRMWInst::FSub;
}		}
}		}

static AtomicOrdering getDecodedOrdering(unsigned Val) {		static AtomicOrdering getDecodedOrdering(unsigned Val) {
switch (Val) {		switch (Val) {
case bitc::ORDERING_NOTATOMIC: return AtomicOrdering::NotAtomic;		case bitc::ORDERING_NOTATOMIC: return AtomicOrdering::NotAtomic;
case bitc::ORDERING_UNORDERED: return AtomicOrdering::Unordered;		case bitc::ORDERING_UNORDERED: return AtomicOrdering::Unordered;
case bitc::ORDERING_MONOTONIC: return AtomicOrdering::Monotonic;		case bitc::ORDERING_MONOTONIC: return AtomicOrdering::Monotonic;
▲ Show 20 Lines • Show All 5,009 Lines • Show Last 20 Lines

lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 554 Lines • ▼ Show 20 Lines	static unsigned getEncodedRMWOperation(AtomicRMWInst::BinOp Op) {
case AtomicRMWInst::And: return bitc::RMW_AND;		case AtomicRMWInst::And: return bitc::RMW_AND;
case AtomicRMWInst::Nand: return bitc::RMW_NAND;		case AtomicRMWInst::Nand: return bitc::RMW_NAND;
case AtomicRMWInst::Or: return bitc::RMW_OR;		case AtomicRMWInst::Or: return bitc::RMW_OR;
case AtomicRMWInst::Xor: return bitc::RMW_XOR;		case AtomicRMWInst::Xor: return bitc::RMW_XOR;
case AtomicRMWInst::Max: return bitc::RMW_MAX;		case AtomicRMWInst::Max: return bitc::RMW_MAX;
case AtomicRMWInst::Min: return bitc::RMW_MIN;		case AtomicRMWInst::Min: return bitc::RMW_MIN;
case AtomicRMWInst::UMax: return bitc::RMW_UMAX;		case AtomicRMWInst::UMax: return bitc::RMW_UMAX;
case AtomicRMWInst::UMin: return bitc::RMW_UMIN;		case AtomicRMWInst::UMin: return bitc::RMW_UMIN;
		case AtomicRMWInst::FAdd: return bitc::RMW_FADD;
		case AtomicRMWInst::FSub: return bitc::RMW_FSUB;
}		}
}		}

static unsigned getEncodedOrdering(AtomicOrdering Ordering) {		static unsigned getEncodedOrdering(AtomicOrdering Ordering) {
switch (Ordering) {		switch (Ordering) {
case AtomicOrdering::NotAtomic: return bitc::ORDERING_NOTATOMIC;		case AtomicOrdering::NotAtomic: return bitc::ORDERING_NOTATOMIC;
case AtomicOrdering::Unordered: return bitc::ORDERING_UNORDERED;		case AtomicOrdering::Unordered: return bitc::ORDERING_UNORDERED;
case AtomicOrdering::Monotonic: return bitc::ORDERING_MONOTONIC;		case AtomicOrdering::Monotonic: return bitc::ORDERING_MONOTONIC;
▲ Show 20 Lines • Show All 3,912 Lines • Show Last 20 Lines

lib/CodeGen/AtomicExpandPass.cpp

Show First 20 Lines • Show All 544 Lines • ▼ Show 20 Lines	case AtomicRMWInst::Min:
NewVal = Builder.CreateICmpSLE(Loaded, Inc);		NewVal = Builder.CreateICmpSLE(Loaded, Inc);
return Builder.CreateSelect(NewVal, Loaded, Inc, "new");		return Builder.CreateSelect(NewVal, Loaded, Inc, "new");
case AtomicRMWInst::UMax:		case AtomicRMWInst::UMax:
NewVal = Builder.CreateICmpUGT(Loaded, Inc);		NewVal = Builder.CreateICmpUGT(Loaded, Inc);
return Builder.CreateSelect(NewVal, Loaded, Inc, "new");		return Builder.CreateSelect(NewVal, Loaded, Inc, "new");
case AtomicRMWInst::UMin:		case AtomicRMWInst::UMin:
NewVal = Builder.CreateICmpULE(Loaded, Inc);		NewVal = Builder.CreateICmpULE(Loaded, Inc);
return Builder.CreateSelect(NewVal, Loaded, Inc, "new");		return Builder.CreateSelect(NewVal, Loaded, Inc, "new");
		case AtomicRMWInst::FAdd:
		return Builder.CreateFAdd(Loaded, Inc, "new");
		case AtomicRMWInst::FSub:
		return Builder.CreateFSub(Loaded, Inc, "new");
default:		default:
llvm_unreachable("Unknown atomic op");		llvm_unreachable("Unknown atomic op");
}		}
}		}

bool AtomicExpand::tryExpandAtomicRMW(AtomicRMWInst *AI) {		bool AtomicExpand::tryExpandAtomicRMW(AtomicRMWInst *AI) {
switch (TLI->shouldExpandAtomicRMWInIR(AI)) {		switch (TLI->shouldExpandAtomicRMWInIR(AI)) {
case TargetLoweringBase::AtomicExpansionKind::None:		case TargetLoweringBase::AtomicExpansionKind::None:
▲ Show 20 Lines • Show All 982 Lines • ▼ Show 20 Lines	static ArrayRef<RTLIB::Libcall> GetRMWLibcall(AtomicRMWInst::BinOp Op) {
case AtomicRMWInst::Xor:		case AtomicRMWInst::Xor:
return makeArrayRef(LibcallsXor);		return makeArrayRef(LibcallsXor);
case AtomicRMWInst::Nand:		case AtomicRMWInst::Nand:
return makeArrayRef(LibcallsNand);		return makeArrayRef(LibcallsNand);
case AtomicRMWInst::Max:		case AtomicRMWInst::Max:
case AtomicRMWInst::Min:		case AtomicRMWInst::Min:
case AtomicRMWInst::UMax:		case AtomicRMWInst::UMax:
case AtomicRMWInst::UMin:		case AtomicRMWInst::UMin:
		case AtomicRMWInst::FAdd:
		case AtomicRMWInst::FSub:
// No atomic libcalls are available for max/min/umax/umin.		// No atomic libcalls are available for max/min/umax/umin.
return {};		return {};
}		}
llvm_unreachable("Unexpected AtomicRMW operation.");		llvm_unreachable("Unexpected AtomicRMW operation.");
		jyknightUnsubmitted Not Done Reply Inline Actions This isn't great. And I see in your next patch you fix it, but splitting that to another patch doesn't make sense. jyknight: This isn't great. And I see in your next patch you fix it, but splitting that to another patch…
		arsenmAuthorUnsubmitted Done Reply Inline Actions I'd disagree, since usually with IR changes there's a split to add the minimum support to the assembler and bitcode before making them functional as a separate commit. This is just needed to quiet the covered switch warning arsenm: I'd disagree, since usually with IR changes there's a split to add the minimum support to the…
}		}

void AtomicExpand::expandAtomicRMWToLibcall(AtomicRMWInst *I) {		void AtomicExpand::expandAtomicRMWToLibcall(AtomicRMWInst *I) {
ArrayRef<RTLIB::Libcall> Libcalls = GetRMWLibcall(I->getOperation());		ArrayRef<RTLIB::Libcall> Libcalls = GetRMWLibcall(I->getOperation());

unsigned Size = getAtomicOpSize(I);		unsigned Size = getAtomicOpSize(I);
unsigned Align = getAtomicOpAlign(I);		unsigned Align = getAtomicOpAlign(I);

▲ Show 20 Lines • Show All 223 Lines • Show Last 20 Lines

lib/IR/Instructions.cpp

Show First 20 Lines • Show All 1,417 Lines • ▼ Show 20 Lines	StringRef AtomicRMWInst::getOperationName(BinOp Op) {
case AtomicRMWInst::Max:		case AtomicRMWInst::Max:
return "max";		return "max";
case AtomicRMWInst::Min:		case AtomicRMWInst::Min:
return "min";		return "min";
case AtomicRMWInst::UMax:		case AtomicRMWInst::UMax:
return "umax";		return "umax";
case AtomicRMWInst::UMin:		case AtomicRMWInst::UMin:
return "umin";		return "umin";
		case AtomicRMWInst::FAdd:
		return "fadd";
		case AtomicRMWInst::FSub:
		return "fsub";
case AtomicRMWInst::BAD_BINOP:		case AtomicRMWInst::BAD_BINOP:
return "<invalid operation>";		return "<invalid operation>";
}		}

llvm_unreachable("invalid atomicrmw operation");		llvm_unreachable("invalid atomicrmw operation");
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 2,584 Lines • Show Last 20 Lines

lib/IR/Verifier.cpp

Show First 20 Lines • Show All 3,407 Lines • ▼ Show 20 Lines	void Verifier::visitAtomicRMWInst(AtomicRMWInst &RMWI) {
PointerType *PTy = dyn_cast<PointerType>(RMWI.getOperand(0)->getType());		PointerType *PTy = dyn_cast<PointerType>(RMWI.getOperand(0)->getType());
Assert(PTy, "First atomicrmw operand must be a pointer.", &RMWI);		Assert(PTy, "First atomicrmw operand must be a pointer.", &RMWI);
Type *ElTy = PTy->getElementType();		Type *ElTy = PTy->getElementType();
if (Op == AtomicRMWInst::Xchg) {		if (Op == AtomicRMWInst::Xchg) {
Assert(ElTy->isIntegerTy() \|\| ElTy->isFloatingPointTy(), "atomicrmw " +		Assert(ElTy->isIntegerTy() \|\| ElTy->isFloatingPointTy(), "atomicrmw " +
AtomicRMWInst::getOperationName(Op) +		AtomicRMWInst::getOperationName(Op) +
" operand must have integer or floating point type!",		" operand must have integer or floating point type!",
&RMWI, ElTy);		&RMWI, ElTy);
		} else if (AtomicRMWInst::isFPOperation(Op)) {
		Assert(ElTy->isFloatingPointTy(), "atomicrmw " +
		AtomicRMWInst::getOperationName(Op) +
		" operand must have floating point type!",
		&RMWI, ElTy);
} else {		} else {
Assert(ElTy->isIntegerTy(), "atomicrmw " +		Assert(ElTy->isIntegerTy(), "atomicrmw " +
AtomicRMWInst::getOperationName(Op) +		AtomicRMWInst::getOperationName(Op) +
" operand must have integer type!",		" operand must have integer type!",
&RMWI, ElTy);		&RMWI, ElTy);
}		}
checkAtomicMemAccessSize(ElTy, &RMWI);		checkAtomicMemAccessSize(ElTy, &RMWI);
Assert(ElTy == RMWI.getOperand(1)->getType(),		Assert(ElTy == RMWI.getOperand(1)->getType(),
▲ Show 20 Lines • Show All 1,811 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 11,525 Lines • ▼ Show 20 Lines
	AArch64TargetLowering::shouldExpandAtomicLoadInIR(LoadInst *LI) const {			AArch64TargetLowering::shouldExpandAtomicLoadInIR(LoadInst *LI) const {
	unsigned Size = LI->getType()->getPrimitiveSizeInBits();			unsigned Size = LI->getType()->getPrimitiveSizeInBits();
	return Size == 128 ? AtomicExpansionKind::LLSC : AtomicExpansionKind::None;			return Size == 128 ? AtomicExpansionKind::LLSC : AtomicExpansionKind::None;
	}			}

	// For the real atomic operations, we have ldxr/stxr up to 128 bits,			// For the real atomic operations, we have ldxr/stxr up to 128 bits,
	TargetLowering::AtomicExpansionKind			TargetLowering::AtomicExpansionKind
	AArch64TargetLowering::shouldExpandAtomicRMWInIR(AtomicRMWInst *AI) const {			AArch64TargetLowering::shouldExpandAtomicRMWInIR(AtomicRMWInst *AI) const {
				if (AI->isFloatingPointOperation())
				return AtomicExpansionKind::CmpXChg;

	unsigned Size = AI->getType()->getPrimitiveSizeInBits();			unsigned Size = AI->getType()->getPrimitiveSizeInBits();
	if (Size > 128) return AtomicExpansionKind::None;			if (Size > 128) return AtomicExpansionKind::None;
	// Nand not supported in LSE.			// Nand not supported in LSE.
	if (AI->getOperation() == AtomicRMWInst::Nand) return AtomicExpansionKind::LLSC;			if (AI->getOperation() == AtomicRMWInst::Nand) return AtomicExpansionKind::LLSC;
	// Leave 128 bits to LLSC.			// Leave 128 bits to LLSC.
	return (Subtarget->hasLSE() && Size < 128) ? AtomicExpansionKind::None : AtomicExpansionKind::LLSC;			return (Subtarget->hasLSE() && Size < 128) ? AtomicExpansionKind::None : AtomicExpansionKind::LLSC;
	}			}

	▲ Show 20 Lines • Show All 273 Lines • Show Last 20 Lines

lib/Target/ARM/ARMISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 14,520 Lines • ▼ Show 20 Lines	ARMTargetLowering::shouldExpandAtomicLoadInIR(LoadInst *LI) const {
return ((Size == 64) && !Subtarget->isMClass()) ? AtomicExpansionKind::LLOnly		return ((Size == 64) && !Subtarget->isMClass()) ? AtomicExpansionKind::LLOnly
: AtomicExpansionKind::None;		: AtomicExpansionKind::None;
}		}

// For the real atomic operations, we have ldrex/strex up to 32 bits,		// For the real atomic operations, we have ldrex/strex up to 32 bits,
// and up to 64 bits on the non-M profiles		// and up to 64 bits on the non-M profiles
TargetLowering::AtomicExpansionKind		TargetLowering::AtomicExpansionKind
ARMTargetLowering::shouldExpandAtomicRMWInIR(AtomicRMWInst *AI) const {		ARMTargetLowering::shouldExpandAtomicRMWInIR(AtomicRMWInst *AI) const {
		if (AI->isFloatingPointOperation())
		return AtomicExpansionKind::CmpXChg;

unsigned Size = AI->getType()->getPrimitiveSizeInBits();		unsigned Size = AI->getType()->getPrimitiveSizeInBits();
bool hasAtomicRMW = !Subtarget->isThumb() \|\| Subtarget->hasV8MBaselineOps();		bool hasAtomicRMW = !Subtarget->isThumb() \|\| Subtarget->hasV8MBaselineOps();
return (Size <= (Subtarget->isMClass() ? 32U : 64U) && hasAtomicRMW)		return (Size <= (Subtarget->isMClass() ? 32U : 64U) && hasAtomicRMW)
? AtomicExpansionKind::LLSC		? AtomicExpansionKind::LLSC
: AtomicExpansionKind::None;		: AtomicExpansionKind::None;
}		}

TargetLowering::AtomicExpansionKind		TargetLowering::AtomicExpansionKind
▲ Show 20 Lines • Show All 586 Lines • Show Last 20 Lines

lib/Target/Hexagon/HexagonISelLowering.cpp

Show First 20 Lines • Show All 3,101 Lines • ▼ Show 20 Lines	bool HexagonTargetLowering::shouldReduceLoadWidth(SDNode *Load,
}		}
return true;		return true;
}		}

Value HexagonTargetLowering::emitLoadLinked(IRBuilder<> &Builder, Value Addr,		Value HexagonTargetLowering::emitLoadLinked(IRBuilder<> &Builder, Value Addr,
AtomicOrdering Ord) const {		AtomicOrdering Ord) const {
BasicBlock *BB = Builder.GetInsertBlock();		BasicBlock *BB = Builder.GetInsertBlock();
Module *M = BB->getParent()->getParent();		Module *M = BB->getParent()->getParent();
Type *Ty = cast<PointerType>(Addr->getType())->getElementType();		auto PT = cast<PointerType>(Addr->getType());
		Type *Ty = PT->getElementType();
unsigned SZ = Ty->getPrimitiveSizeInBits();		unsigned SZ = Ty->getPrimitiveSizeInBits();
assert((SZ == 32 \|\| SZ == 64) && "Only 32/64-bit atomic loads supported");		assert((SZ == 32 \|\| SZ == 64) && "Only 32/64-bit atomic loads supported");
Intrinsic::ID IntID = (SZ == 32) ? Intrinsic::hexagon_L2_loadw_locked		Intrinsic::ID IntID = (SZ == 32) ? Intrinsic::hexagon_L2_loadw_locked
: Intrinsic::hexagon_L4_loadd_locked;		: Intrinsic::hexagon_L4_loadd_locked;

		PointerType *NewPtrTy
		= Builder.getIntNTy(SZ)->getPointerTo(PT->getAddressSpace());
		Addr = Builder.CreateBitCast(Addr, NewPtrTy);

Value *Fn = Intrinsic::getDeclaration(M, IntID);		Value *Fn = Intrinsic::getDeclaration(M, IntID);
return Builder.CreateCall(Fn, Addr, "larx");		Value *Call = Builder.CreateCall(Fn, Addr, "larx");

		return Builder.CreateBitCast(Call, Ty);
}		}

/// Perform a store-conditional operation to Addr. Return the status of the		/// Perform a store-conditional operation to Addr. Return the status of the
/// store. This should be 0 if the store succeeded, non-zero otherwise.		/// store. This should be 0 if the store succeeded, non-zero otherwise.
Value *HexagonTargetLowering::emitStoreConditional(IRBuilder<> &Builder,		Value *HexagonTargetLowering::emitStoreConditional(IRBuilder<> &Builder,
Value Val, Value Addr, AtomicOrdering Ord) const {		Value Val, Value Addr, AtomicOrdering Ord) const {
BasicBlock *BB = Builder.GetInsertBlock();		BasicBlock *BB = Builder.GetInsertBlock();
Module *M = BB->getParent()->getParent();		Module *M = BB->getParent()->getParent();
Type *Ty = Val->getType();		Type *Ty = Val->getType();
unsigned SZ = Ty->getPrimitiveSizeInBits();		unsigned SZ = Ty->getPrimitiveSizeInBits();

		Type *CastTy = Builder.getIntNTy(SZ);
assert((SZ == 32 \|\| SZ == 64) && "Only 32/64-bit atomic stores supported");		assert((SZ == 32 \|\| SZ == 64) && "Only 32/64-bit atomic stores supported");
Intrinsic::ID IntID = (SZ == 32) ? Intrinsic::hexagon_S2_storew_locked		Intrinsic::ID IntID = (SZ == 32) ? Intrinsic::hexagon_S2_storew_locked
: Intrinsic::hexagon_S4_stored_locked;		: Intrinsic::hexagon_S4_stored_locked;
Value *Fn = Intrinsic::getDeclaration(M, IntID);		Value *Fn = Intrinsic::getDeclaration(M, IntID);

		unsigned AS = Addr->getType()->getPointerAddressSpace();
		Addr = Builder.CreateBitCast(Addr, CastTy->getPointerTo(AS));
		Val = Builder.CreateBitCast(Val, CastTy);

Value *Call = Builder.CreateCall(Fn, {Addr, Val}, "stcx");		Value *Call = Builder.CreateCall(Fn, {Addr, Val}, "stcx");
Value *Cmp = Builder.CreateICmpEQ(Call, Builder.getInt32(0), "");		Value *Cmp = Builder.CreateICmpEQ(Call, Builder.getInt32(0), "");
Value *Ext = Builder.CreateZExt(Cmp, Type::getInt32Ty(M->getContext()));		Value *Ext = Builder.CreateZExt(Cmp, Type::getInt32Ty(M->getContext()));
return Ext;		return Ext;
}		}

TargetLowering::AtomicExpansionKind		TargetLowering::AtomicExpansionKind
HexagonTargetLowering::shouldExpandAtomicLoadInIR(LoadInst *LI) const {		HexagonTargetLowering::shouldExpandAtomicLoadInIR(LoadInst *LI) const {
Show All 20 Lines

lib/Target/RISCV/RISCVISelLowering.cpp

Show First 20 Lines • Show All 1,711 Lines • ▼ Show 20 Lines	Value *RISCVTargetLowering::emitMaskedAtomicRMWIntrinsic(
}		}

return Builder.CreateCall(LrwOpScwLoop, {AlignedAddr, Incr, Mask, Ordering});		return Builder.CreateCall(LrwOpScwLoop, {AlignedAddr, Incr, Mask, Ordering});
}		}

TargetLowering::AtomicExpansionKind		TargetLowering::AtomicExpansionKind
RISCVTargetLowering::shouldExpandAtomicCmpXchgInIR(		RISCVTargetLowering::shouldExpandAtomicCmpXchgInIR(
AtomicCmpXchgInst *CI) const {		AtomicCmpXchgInst *CI) const {
		if (CI->isFloatingPointOperation())
		return AtomicExpansionKind::CmpXChg;

unsigned Size = CI->getCompareOperand()->getType()->getPrimitiveSizeInBits();		unsigned Size = CI->getCompareOperand()->getType()->getPrimitiveSizeInBits();
if (Size == 8 \|\| Size == 16)		if (Size == 8 \|\| Size == 16)
return AtomicExpansionKind::MaskedIntrinsic;		return AtomicExpansionKind::MaskedIntrinsic;
return AtomicExpansionKind::None;		return AtomicExpansionKind::None;
}		}

Value *RISCVTargetLowering::emitMaskedAtomicCmpXchgIntrinsic(		Value *RISCVTargetLowering::emitMaskedAtomicCmpXchgIntrinsic(
IRBuilder<> &Builder, AtomicCmpXchgInst CI, Value AlignedAddr,		IRBuilder<> &Builder, AtomicCmpXchgInst CI, Value AlignedAddr,
Value CmpVal, Value NewVal, Value *Mask, AtomicOrdering Ord) const {		Value CmpVal, Value NewVal, Value *Mask, AtomicOrdering Ord) const {
Value *Ordering = Builder.getInt32(static_cast<uint32_t>(Ord));		Value *Ordering = Builder.getInt32(static_cast<uint32_t>(Ord));
Type *Tys[] = {AlignedAddr->getType()};		Type *Tys[] = {AlignedAddr->getType()};
Function *MaskedCmpXchg = Intrinsic::getDeclaration(		Function *MaskedCmpXchg = Intrinsic::getDeclaration(
CI->getModule(), Intrinsic::riscv_masked_cmpxchg_i32, Tys);		CI->getModule(), Intrinsic::riscv_masked_cmpxchg_i32, Tys);
return Builder.CreateCall(MaskedCmpXchg,		return Builder.CreateCall(MaskedCmpXchg,
{AlignedAddr, CmpVal, NewVal, Mask, Ordering});		{AlignedAddr, CmpVal, NewVal, Mask, Ordering});
}		}

test/Assembler/atomic.ll

Show All 33 Lines	define void @f(i32* %x) {
; CHECK: fence syncscope("singlethread") release		; CHECK: fence syncscope("singlethread") release
fence syncscope("singlethread") release		fence syncscope("singlethread") release
; CHECK: fence seq_cst		; CHECK: fence seq_cst
fence seq_cst		fence seq_cst
; CHECK: fence syncscope("device") seq_cst		; CHECK: fence syncscope("device") seq_cst
fence syncscope("device") seq_cst		fence syncscope("device") seq_cst
ret void		ret void
}		}

		define void @fp_atomics(float* %x) {
		; CHECK: atomicrmw fadd float* %x, float 1.000000e+00 seq_cst
		atomicrmw fadd float* %x, float 1.0 seq_cst

		; CHECK: atomicrmw volatile fadd float* %x, float 1.000000e+00 seq_cst
		atomicrmw volatile fadd float* %x, float 1.0 seq_cst

		ret void
		}

test/Assembler/invalid-atomicrmw-fadd-must-be-fp-type.ll

This file was added.

				; RUN: not llvm-as -disable-output %s 2>&1 \| FileCheck %s

				; CHECK: error: atomicrmw fadd operand must be a floating point type
				define void @f(i32* %ptr) {
				atomicrmw fadd i32* %ptr, i32 2 seq_cst
				ret void
				}

test/Assembler/invalid-atomicrmw-fsub-must-be-fp-type.ll

This file was added.

				; RUN: not llvm-as -disable-output %s 2>&1 \| FileCheck %s

				; CHECK: error: atomicrmw fsub operand must be a floating point type
				define void @f(i32* %ptr) {
				atomicrmw fsub i32* %ptr, i32 2 seq_cst
				ret void
				}

test/Bitcode/compatibility.ll

Show First 20 Lines • Show All 758 Lines • ▼ Show 20 Lines	define void @atomics(i32* %word) {
store atomic volatile i32 25, i32* %word syncscope("singlethread") monotonic, align 4		store atomic volatile i32 25, i32* %word syncscope("singlethread") monotonic, align 4
; CHECK: store atomic volatile i32 25, i32* %word syncscope("singlethread") monotonic, align 4		; CHECK: store atomic volatile i32 25, i32* %word syncscope("singlethread") monotonic, align 4
ret void		ret void
}		}

define void @fp_atomics(float* %word) {		define void @fp_atomics(float* %word) {
; CHECK: %atomicrmw.xchg = atomicrmw xchg float* %word, float 1.000000e+00 monotonic		; CHECK: %atomicrmw.xchg = atomicrmw xchg float* %word, float 1.000000e+00 monotonic
%atomicrmw.xchg = atomicrmw xchg float* %word, float 1.0 monotonic		%atomicrmw.xchg = atomicrmw xchg float* %word, float 1.0 monotonic

		; CHECK: %atomicrmw.fadd = atomicrmw fadd float* %word, float 1.000000e+00 monotonic
		%atomicrmw.fadd = atomicrmw fadd float* %word, float 1.0 monotonic

		; CHECK: %atomicrmw.fsub = atomicrmw fsub float* %word, float 1.000000e+00 monotonic
		%atomicrmw.fsub = atomicrmw fsub float* %word, float 1.0 monotonic

ret void		ret void
}		}

;; Fast Math Flags		;; Fast Math Flags
define void @fastmathflags_unop(float %op1) {		define void @fastmathflags_unop(float %op1) {
%f.nnan = fneg nnan float %op1		%f.nnan = fneg nnan float %op1
; CHECK: %f.nnan = fneg nnan float %op1		; CHECK: %f.nnan = fneg nnan float %op1
%f.ninf = fneg ninf float %op1		%f.ninf = fneg ninf float %op1
▲ Show 20 Lines • Show All 977 Lines • Show Last 20 Lines

test/Transforms/AtomicExpand/AArch64/atomicrmw-fp.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S -mtriple=aarch64-linux-gnu -atomic-expand %s \| FileCheck %s

				define float @test_atomicrmw_fadd_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fadd_f32(
				; CHECK-NEXT: [[TMP1:%.]] = load float, float [[PTR:%.*]], align 4
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[LOADED:%.]] = phi float [ [[TMP1]], [[TMP0:%.]] ], [ [[TMP6:%.*]], [[ATOMICRMW_START]] ]
				; CHECK-NEXT: [[NEW:%.]] = fadd float [[LOADED]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP3:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[LOADED]] to i32
				; CHECK-NEXT: [[TMP5:%.]] = cmpxchg i32 [[TMP2]], i32 [[TMP4]], i32 [[TMP3]] seq_cst seq_cst
				; CHECK-NEXT: [[SUCCESS:%.*]] = extractvalue { i32, i1 } [[TMP5]], 1
				; CHECK-NEXT: [[NEWLOADED:%.*]] = extractvalue { i32, i1 } [[TMP5]], 0
				; CHECK-NEXT: [[TMP6]] = bitcast i32 [[NEWLOADED]] to float
				; CHECK-NEXT: br i1 [[SUCCESS]], label [[ATOMICRMW_END:%.*]], label [[ATOMICRMW_START]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: ret float [[TMP6]]
				;
				%res = atomicrmw fadd float* %ptr, float %value seq_cst
				ret float %res
				}

				define float @test_atomicrmw_fsub_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fsub_f32(
				; CHECK-NEXT: [[TMP1:%.]] = load float, float [[PTR:%.*]], align 4
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[LOADED:%.]] = phi float [ [[TMP1]], [[TMP0:%.]] ], [ [[TMP6:%.*]], [[ATOMICRMW_START]] ]
				; CHECK-NEXT: [[NEW:%.]] = fsub float [[LOADED]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP3:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[LOADED]] to i32
				; CHECK-NEXT: [[TMP5:%.]] = cmpxchg i32 [[TMP2]], i32 [[TMP4]], i32 [[TMP3]] seq_cst seq_cst
				; CHECK-NEXT: [[SUCCESS:%.*]] = extractvalue { i32, i1 } [[TMP5]], 1
				; CHECK-NEXT: [[NEWLOADED:%.*]] = extractvalue { i32, i1 } [[TMP5]], 0
				; CHECK-NEXT: [[TMP6]] = bitcast i32 [[NEWLOADED]] to float
				; CHECK-NEXT: br i1 [[SUCCESS]], label [[ATOMICRMW_END:%.*]], label [[ATOMICRMW_START]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: ret float [[TMP6]]
				;
				%res = atomicrmw fsub float* %ptr, float %value seq_cst
				ret float %res
				}

test/Transforms/AtomicExpand/ARM/atomicrmw-fp.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S -mtriple=armv7-apple-ios7.0 -atomic-expand %s \| FileCheck %s

				define float @test_atomicrmw_fadd_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fadd_f32(
				; CHECK-NEXT: call void @llvm.arm.dmb(i32 11)
				; CHECK-NEXT: [[TMP1:%.]] = load float, float [[PTR:%.*]], align 4
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[LOADED:%.]] = phi float [ [[TMP1]], [[TMP0:%.]] ], [ [[TMP6:%.*]], [[ATOMICRMW_START]] ]
				; CHECK-NEXT: [[NEW:%.]] = fadd float [[LOADED]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP3:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[LOADED]] to i32
				; CHECK-NEXT: [[TMP5:%.]] = cmpxchg i32 [[TMP2]], i32 [[TMP4]], i32 [[TMP3]] monotonic monotonic
				; CHECK-NEXT: [[SUCCESS:%.*]] = extractvalue { i32, i1 } [[TMP5]], 1
				; CHECK-NEXT: [[NEWLOADED:%.*]] = extractvalue { i32, i1 } [[TMP5]], 0
				; CHECK-NEXT: [[TMP6]] = bitcast i32 [[NEWLOADED]] to float
				; CHECK-NEXT: br i1 [[SUCCESS]], label [[ATOMICRMW_END:%.*]], label [[ATOMICRMW_START]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: call void @llvm.arm.dmb(i32 11)
				; CHECK-NEXT: ret float [[TMP6]]
				;
				%res = atomicrmw fadd float* %ptr, float %value seq_cst
				ret float %res
				}

				define float @test_atomicrmw_fsub_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fsub_f32(
				; CHECK-NEXT: call void @llvm.arm.dmb(i32 11)
				; CHECK-NEXT: [[TMP1:%.]] = load float, float [[PTR:%.*]], align 4
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[LOADED:%.]] = phi float [ [[TMP1]], [[TMP0:%.]] ], [ [[TMP6:%.*]], [[ATOMICRMW_START]] ]
				; CHECK-NEXT: [[NEW:%.]] = fsub float [[LOADED]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP3:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[LOADED]] to i32
				; CHECK-NEXT: [[TMP5:%.]] = cmpxchg i32 [[TMP2]], i32 [[TMP4]], i32 [[TMP3]] monotonic monotonic
				; CHECK-NEXT: [[SUCCESS:%.*]] = extractvalue { i32, i1 } [[TMP5]], 1
				; CHECK-NEXT: [[NEWLOADED:%.*]] = extractvalue { i32, i1 } [[TMP5]], 0
				; CHECK-NEXT: [[TMP6]] = bitcast i32 [[NEWLOADED]] to float
				; CHECK-NEXT: br i1 [[SUCCESS]], label [[ATOMICRMW_END:%.*]], label [[ATOMICRMW_START]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: call void @llvm.arm.dmb(i32 11)
				; CHECK-NEXT: ret float [[TMP6]]
				;
				%res = atomicrmw fsub float* %ptr, float %value seq_cst
				ret float %res
				}

test/Transforms/AtomicExpand/Hexagon/atomicrmw-fp.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S -mtriple=hexagon-- -atomic-expand %s \| FileCheck %s

				define float @test_atomicrmw_fadd_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fadd_f32(
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[TMP1:%.]] = bitcast float [[PTR:%.]] to i32
				; CHECK-NEXT: [[LARX:%.]] = call i32 @llvm.hexagon.L2.loadw.locked(i32 [[TMP1]])
				; CHECK-NEXT: [[TMP2:%.*]] = bitcast i32 [[LARX]] to float
				; CHECK-NEXT: [[NEW:%.]] = fadd float [[TMP2]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP3:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[STCX:%.]] = call i32 @llvm.hexagon.S2.storew.locked(i32 [[TMP3]], i32 [[TMP4]])
				; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[STCX]], 0
				; CHECK-NEXT: [[TMP6:%.*]] = zext i1 [[TMP5]] to i32
				; CHECK-NEXT: [[TRYAGAIN:%.*]] = icmp ne i32 [[TMP6]], 0
				; CHECK-NEXT: br i1 [[TRYAGAIN]], label [[ATOMICRMW_START]], label [[ATOMICRMW_END:%.*]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: ret float [[TMP2]]
				;
				%res = atomicrmw fadd float* %ptr, float %value seq_cst
				ret float %res
				}

				define float @test_atomicrmw_fsub_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fsub_f32(
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[TMP1:%.]] = bitcast float [[PTR:%.]] to i32
				; CHECK-NEXT: [[LARX:%.]] = call i32 @llvm.hexagon.L2.loadw.locked(i32 [[TMP1]])
				; CHECK-NEXT: [[TMP2:%.*]] = bitcast i32 [[LARX]] to float
				; CHECK-NEXT: [[NEW:%.]] = fsub float [[TMP2]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP3:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[STCX:%.]] = call i32 @llvm.hexagon.S2.storew.locked(i32 [[TMP3]], i32 [[TMP4]])
				; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[STCX]], 0
				; CHECK-NEXT: [[TMP6:%.*]] = zext i1 [[TMP5]] to i32
				; CHECK-NEXT: [[TRYAGAIN:%.*]] = icmp ne i32 [[TMP6]], 0
				; CHECK-NEXT: br i1 [[TRYAGAIN]], label [[ATOMICRMW_START]], label [[ATOMICRMW_END:%.*]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: ret float [[TMP2]]
				;
				%res = atomicrmw fsub float* %ptr, float %value seq_cst
				ret float %res
				}

test/Transforms/AtomicExpand/Hexagon/lit.local.cfg

This file was added.

				if not 'Hexagon' in config.root.targets:
				config.unsupported = True

test/Transforms/AtomicExpand/Mips/atomicrmw-fp.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S -mtriple=mips64-mti-linux-gnu -atomic-expand %s \| FileCheck %s

				define float @test_atomicrmw_fadd_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fadd_f32(
				; CHECK-NEXT: fence seq_cst
				; CHECK-NEXT: [[TMP1:%.]] = load float, float [[PTR:%.*]], align 4
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[LOADED:%.]] = phi float [ [[TMP1]], [[TMP0:%.]] ], [ [[TMP6:%.*]], [[ATOMICRMW_START]] ]
				; CHECK-NEXT: [[NEW:%.]] = fadd float [[LOADED]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP3:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[LOADED]] to i32
				; CHECK-NEXT: [[TMP5:%.]] = cmpxchg i32 [[TMP2]], i32 [[TMP4]], i32 [[TMP3]] monotonic monotonic
				; CHECK-NEXT: [[SUCCESS:%.*]] = extractvalue { i32, i1 } [[TMP5]], 1
				; CHECK-NEXT: [[NEWLOADED:%.*]] = extractvalue { i32, i1 } [[TMP5]], 0
				; CHECK-NEXT: [[TMP6]] = bitcast i32 [[NEWLOADED]] to float
				; CHECK-NEXT: br i1 [[SUCCESS]], label [[ATOMICRMW_END:%.*]], label [[ATOMICRMW_START]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: fence seq_cst
				; CHECK-NEXT: ret float [[TMP6]]
				;
				%res = atomicrmw fadd float* %ptr, float %value seq_cst
				ret float %res
				}

				define float @test_atomicrmw_fsub_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fsub_f32(
				; CHECK-NEXT: fence seq_cst
				; CHECK-NEXT: [[TMP1:%.]] = load float, float [[PTR:%.*]], align 4
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[LOADED:%.]] = phi float [ [[TMP1]], [[TMP0:%.]] ], [ [[TMP6:%.*]], [[ATOMICRMW_START]] ]
				; CHECK-NEXT: [[NEW:%.]] = fsub float [[LOADED]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP3:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[LOADED]] to i32
				; CHECK-NEXT: [[TMP5:%.]] = cmpxchg i32 [[TMP2]], i32 [[TMP4]], i32 [[TMP3]] monotonic monotonic
				; CHECK-NEXT: [[SUCCESS:%.*]] = extractvalue { i32, i1 } [[TMP5]], 1
				; CHECK-NEXT: [[NEWLOADED:%.*]] = extractvalue { i32, i1 } [[TMP5]], 0
				; CHECK-NEXT: [[TMP6]] = bitcast i32 [[NEWLOADED]] to float
				; CHECK-NEXT: br i1 [[SUCCESS]], label [[ATOMICRMW_END:%.*]], label [[ATOMICRMW_START]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: fence seq_cst
				; CHECK-NEXT: ret float [[TMP6]]
				;
				%res = atomicrmw fsub float* %ptr, float %value seq_cst
				ret float %res
				}

test/Transforms/AtomicExpand/Mips/lit.local.cfg

This file was added.

				if not 'Mips' in config.root.targets:
				config.unsupported = True

test/Transforms/AtomicExpand/RISCV/atomicrmw-fp.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt -S -mtriple=riscv32-- -atomic-expand %s \| FileCheck %s

				define float @test_atomicrmw_fadd_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fadd_f32(
				; CHECK-NEXT: [[TMP1:%.]] = load float, float [[PTR:%.*]], align 4
				; CHECK-NEXT: br label [[ATOMICRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[LOADED:%.]] = phi float [ [[TMP1]], [[TMP0:%.]] ], [ [[TMP6:%.*]], [[ATOMICRMW_START]] ]
				; CHECK-NEXT: [[NEW:%.]] = fadd float [[LOADED]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP3:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[LOADED]] to i32
				; CHECK-NEXT: [[TMP5:%.]] = cmpxchg i32 [[TMP2]], i32 [[TMP4]], i32 [[TMP3]] seq_cst seq_cst
				; CHECK-NEXT: [[SUCCESS:%.*]] = extractvalue { i32, i1 } [[TMP5]], 1
				; CHECK-NEXT: [[NEWLOADED:%.*]] = extractvalue { i32, i1 } [[TMP5]], 0
				; CHECK-NEXT: [[TMP6]] = bitcast i32 [[NEWLOADED]] to float
				; CHECK-NEXT: br i1 [[SUCCESS]], label [[ATOMICRMW_END:%.*]], label [[ATOMICRMW_START]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: ret float [[TMP6]]
				;
				%res = atomicrmw fadd float* %ptr, float %value seq_cst
				ret float %res
				}

				define float @test_atomicrmw_fsub_f32(float* %ptr, float %value) {
				; CHECK-LABEL: @test_atomicrmw_fsub_f32(
				; CHECK-NEXT: [[TMP1:%.]] = load float, float [[PTR:%.*]], align 4
				; CHECK-NEXT: br label [[ATOMIxbCRMW_START:%.*]]
				; CHECK: atomicrmw.start:
				; CHECK-NEXT: [[LOADED:%.]] = phi float [ [[TMP1]], [[TMP0:%.]] ], [ [[TMP6:%.*]], [[ATOMICRMW_START]] ]
				; CHECK-NEXT: [[NEW:%.]] = fsub float [[LOADED]], [[VALUE:%.]]
				; CHECK-NEXT: [[TMP2:%.]] = bitcast float [[PTR]] to i32*
				; CHECK-NEXT: [[TMP3:%.*]] = bitcast float [[NEW]] to i32
				; CHECK-NEXT: [[TMP4:%.*]] = bitcast float [[LOADED]] to i32
				; CHECK-NEXT: [[TMP5:%.]] = cmpxchg i32 [[TMP2]], i32 [[TMP4]], i32 [[TMP3]] seq_cst seq_cst
				; CHECK-NEXT: [[SUCCESS:%.*]] = extractvalue { i32, i1 } [[TMP5]], 1
				; CHECK-NEXT: [[NEWLOADED:%.*]] = extractvalue { i32, i1 } [[TMP5]], 0
				; CHECK-NEXT: [[TMP6]] = bitcast i32 [[NEWLOADED]] to float
				; CHECK-NEXT: br i1 [[SUCCESS]], label [[ATOMICRMW_END:%.*]], label [[ATOMICRMW_START]]
				; CHECK: atomicrmw.end:
				; CHECK-NEXT: ret float [[TMP6]]
				;
				%res = atomicrmw fsub float* %ptr, float %value seq_cst
				ret float %res
				}

test/Transforms/AtomicExpand/RISCV/lit.local.cfg

This file was added.

				config.suffixes = ['.ll']

				targets = set(config.root.targets_to_build.split())
				if not 'RISCV' in targets:
				config.unsupported = True

This is an archive of the discontinued LLVM Phabricator instance.

IR: Add fp operations to atomicrmwClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 177098

docs/LangRef.rst

include/llvm/Bitcode/LLVMBitCodes.h

include/llvm/CodeGen/TargetLowering.h

include/llvm/IR/Instructions.h

lib/AsmParser/LLParser.cpp

lib/Bitcode/Reader/BitcodeReader.cpp

lib/Bitcode/Writer/BitcodeWriter.cpp

lib/CodeGen/AtomicExpandPass.cpp

lib/IR/Instructions.cpp

lib/IR/Verifier.cpp

lib/Target/AArch64/AArch64ISelLowering.cpp

lib/Target/ARM/ARMISelLowering.cpp

lib/Target/Hexagon/HexagonISelLowering.cpp

lib/Target/RISCV/RISCVISelLowering.cpp

test/Assembler/atomic.ll

test/Assembler/invalid-atomicrmw-fadd-must-be-fp-type.ll

test/Assembler/invalid-atomicrmw-fsub-must-be-fp-type.ll

test/Bitcode/compatibility.ll

test/Transforms/AtomicExpand/AArch64/atomicrmw-fp.ll

test/Transforms/AtomicExpand/ARM/atomicrmw-fp.ll

test/Transforms/AtomicExpand/Hexagon/atomicrmw-fp.ll

test/Transforms/AtomicExpand/Hexagon/lit.local.cfg

test/Transforms/AtomicExpand/Mips/atomicrmw-fp.ll

test/Transforms/AtomicExpand/Mips/lit.local.cfg

test/Transforms/AtomicExpand/RISCV/atomicrmw-fp.ll

test/Transforms/AtomicExpand/RISCV/lit.local.cfg

IR: Add fp operations to atomicrmw
ClosedPublic