This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/Mips/
-
Target/
-
Mips/
1/5
MipsISelLowering.cpp
-
test/CodeGen/Mips/GlobalISel/llvm-ir/
-
CodeGen/
-
Mips/
-
GlobalISel/
-
llvm-ir/
-
load_atomic.ll

Differential D141189

[Mips] Set setMaxAtomicSizeInBitsSupported
ClosedPublic

Authored by brad on Jan 7 2023, 12:56 AM.

Download Raw Diff

Details

Reviewers

atanasyan
sdardis
MaskRay
wzssyqa
efriedma

Commits

rG7973d5196597: [Mips] Set setMaxAtomicSizeInBitsSupported

Summary

Set setMaxAtomicSizeInBitsSupported for Mips. Set the value as appropriate for 64-bit MIPS vs 32-bit and take MIPS-I into consideration.

I am not quite sure if I used the appropriate Subtarget checks.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	60,250 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-autogenerated/policy/non-overloaded::vloxseg.c
	60,320 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-autogenerated/policy/non-overloaded::vluxseg.c
	60,300 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-autogenerated/policy/overloaded::vloxseg.c
	60,250 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-autogenerated/policy/overloaded::vluxseg.c

Event Timeline

brad created this revision.Jan 7 2023, 12:56 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 7 2023, 12:56 AM

Herald added subscribers: jrtc27, hiraditya, arichardson. · View Herald Transcript

brad requested review of this revision.Jan 7 2023, 12:56 AM

Harbormaster completed remote builds in B206261: Diff 487067.Jan 7 2023, 2:48 AM

Use update_llc_test_checks.py to update test.

Harbormaster completed remote builds in B207648: Diff 489012.Jan 13 2023, 10:12 AM

brad added a reviewer: MaskRay.Jan 13 2023, 3:30 PM

ping.

arsenm added a subscriber: arsenm.Jan 17 2023, 12:48 PM

arsenm added inline comments.

llvm/lib/Target/Mips/MipsISelLowering.cpp
500–501	Else case doesn't make sense to me but I know nothing about mips

brad added inline comments.Jan 17 2023, 1:02 PM

llvm/lib/Target/Mips/MipsISelLowering.cpp
500–501	Else case doesn't make sense to me but I know nothing about mips To deal with MIPS-I not having support for atomics.

ping.

MaskRay added a reviewer: wzssyqa.Feb 3 2023, 12:16 AM

wzssyqa added inline comments.Feb 3 2023, 2:06 AM

llvm/lib/Target/Mips/MipsISelLowering.cpp
500–501	Sorry for me. I am no idea about what's the mean of the value of `setMaxAtomicSizeInBitsSupported`. Maybe: load/store instructions no quirky behavior: an instruction can be split to in result. If so, MIPS has never this problem since MIPS I. normal load/store instructions can make sure that the data sync in a multicore system. If so, MIPS never archives this. architecture has some instructions to help atomic ops. If so, MIPS has LL/SC since MIPS II.

brad added inline comments.Feb 3 2023, 12:15 PM

llvm/lib/Target/Mips/MipsISelLowering.cpp
500–501	setMaxAtomicSizeInBitsSupported() indicates the maximum size of supported atomic operations that are all lock-free. 32-bit MIPS does not have hardware support for lock-free 64-bit atomics.

wzssyqa added inline comments.Feb 6 2023, 12:34 AM

llvm/lib/Target/Mips/MipsISelLowering.cpp
500–501	Ohh, Yes, ldc1 is available since MIPS II. and ldc1 $f0, 0($4) mfc1 $2, $f0 mfc1 $3, $f1 is wrong for MIPS32 FP64 ABI. For MIPS32 FP64 ABI(support MIPS32R2+ only), it should be like: ldc1 $f0, 0($4) mfc1 $2, $f0 mfhc1 $3, $f0

Tazdevil971 added a subscriber: Tazdevil971.Feb 6 2023, 6:21 AM

Makes sense to me.

For MIPS 1, we have essentially two choices: either we use setMaxAtomicSizeInBitsSupported(0), and generate __atomic_* calls, or we set setMaxAtomicSizeInBitsSupported(32), and generate native load/store with __sync_* calls for cmpxchg/atomicrmw. Either way, we can't actually provide implementations; the user has to make it work somehow. See D137980 for a similar discussion... but generating __sync_* calls seems to cause less trouble. (Not sure if anyone cares about MIPS 1 these days, though.)

Hi! Sorry to jump in randomly, but it just so happens that I was working on a very similar patch right now. Funnily enough I need to make everything work on MIPS I, because there are still people caring for that target, mainly in the homebrew scene as the CPU for the Playstation 1 and coprocessor of the Playstation 2 both are MIPS I targets.

Wouldn't it be more correct to set atomic widths to 0 and generate __atomic_* calls? As just emitting sync fences on CAS/RMW operations break with multithreading/preemption? I'm not so sure about the Playstation 1 but I'm pretty sure the Playstation 2 has multithreading support on the coprocessor. Also, as it has already been pointed out, MIPS I doesn't even support sync operations, so relying on some implementation of them, to me, seems more complicated to achieve.

Wouldn't it be more correct to set atomic widths to 0 and generate __atomic_* calls?

You need some sort of operating-system level magic to make atomicrmw/cmpxchg work without ll/sc. Disabling interrupts, or a restartable sequence, or something along those lines. Most of the ways you'd implement that end up being compatible with lowering atomic load/store to plain load/store operations. Granted, other implementations are possible.

Yes exactly, what I'm saying is that if we just emit sync fences (in the form of libcalls) for CAS/RMW operations it might become impossible to target systems that have some sort of preemption. So I personally would prefer to set atomic width to 0 and generate libcalls for all atomic operations, so implementors for specific environments can do the right thing for the platform.

Baremetal Playstation 1 is likely the only MIPS1 target anyone will ever use with LLVM, so we should do whatever makes sense there. I don't think it makes sense to theorycraft configurations that don't exist. (If MIPS1 Linux were a thing, it would probably do something like https://www.kernel.org/doc/Documentation/arm/kernel_user_helpers.txt .)

Fair enough. Looking more in detail at Playstation 1 and 2, they both use cooperative multithreading so preemption is not an issue. So for these targets it doesn't really make a difference.

Just wanted to point out this recent github bug report..

https://github.com/llvm/llvm-project/issues/61166

In fact, I don't think that it is a good idea to support MIPS I with degrading the performance of MIPS2+.

In fact, here we have more than 1 problems:

clang --target=mips-gnu-linux -march=mips1 fails due to lack of sync https://github.com/llvm/llvm-project/issues/61166

float instructions are always emit even +softfloat option is given.

./bin/llc -global-isel -mtriple=mips64el-linux-gnu -verify-machineinstrs ../llvm/test/CodeGen/Mips/GlobalISel/llvm-ir/load_atomic.ll -o - always fails.

To work with MIPS1, I guess

if (!Subtarget.hasMips2())
  setMaxAtomicSizeInBitsSupported(0);

Should be enough.

wzssyqa mentioned this in D154589: MIPS: setMaxAtomicSizeInBitsSupported to 32 for MIPS I.Jul 6 2023, 3:02 AM

brad updated this revision to Diff 540614.Jul 14 2023, 7:05 PM

LGTM. (I was sort of hoping for someone with Mips expertise to show up to approve this, but lacking that, I don't see any issues here.)

This revision is now accepted and ready to land.Jul 14 2023, 7:30 PM

Harbormaster completed remote builds in B245527: Diff 540614.Jul 14 2023, 7:53 PM

wzssyqa added a comment.Jul 14 2023, 9:51 PM

This comment was removed by wzssyqa.

wzssyqa added a comment.Jul 15 2023, 2:06 AM

This comment was removed by wzssyqa.

Sorry. It is my fault. With some dig, I know that some microarchitectures may split one ldc1/sdc1 to 2 micro-ops.
Thus we cannot be sure that it is atomic.

So, I agree with Brad's patch now.

Maybe, in future we can determine which core won't split it.

This revision was landed with ongoing or failed builds.Jul 15 2023, 2:29 PM

Closed by commit rG7973d5196597: [Mips] Set setMaxAtomicSizeInBitsSupported (authored by brad). · Explain Why

This revision was automatically updated to reflect the committed changes.

brad added a commit: rG7973d5196597: [Mips] Set setMaxAtomicSizeInBitsSupported.

Revision Contents

Path

Size

llvm/

lib/

Target/

Mips/

MipsISelLowering.cpp

7 lines

test/

CodeGen/

Mips/

GlobalISel/

llvm-ir/

load_atomic.ll

114 lines

Diff 489012

llvm/lib/Target/Mips/MipsISelLowering.cpp

Show First 20 Lines • Show All 487 Lines • ▼ Show 20 Lines	if (ABI.IsO32()) {
setLibcallName(RTLIB::SHL_I128, nullptr);		setLibcallName(RTLIB::SHL_I128, nullptr);
setLibcallName(RTLIB::SRL_I128, nullptr);		setLibcallName(RTLIB::SRL_I128, nullptr);
setLibcallName(RTLIB::SRA_I128, nullptr);		setLibcallName(RTLIB::SRA_I128, nullptr);
setLibcallName(RTLIB::MUL_I128, nullptr);		setLibcallName(RTLIB::MUL_I128, nullptr);
setLibcallName(RTLIB::MULO_I64, nullptr);		setLibcallName(RTLIB::MULO_I64, nullptr);
setLibcallName(RTLIB::MULO_I128, nullptr);		setLibcallName(RTLIB::MULO_I128, nullptr);
}		}

		if (Subtarget.isGP64bit())
		setMaxAtomicSizeInBitsSupported(64);
		else if (Subtarget.hasMips2())
		setMaxAtomicSizeInBitsSupported(32);
		else
		setMaxAtomicSizeInBitsSupported(0);
		arsenmUnsubmitted Not Done Reply Inline Actions Else case doesn't make sense to me but I know nothing about mips arsenm: Else case doesn't make sense to me but I know nothing about mips
		bradAuthorUnsubmitted Not Done Reply Inline Actions Else case doesn't make sense to me but I know nothing about mips To deal with MIPS-I not having support for atomics. brad: > Else case doesn't make sense to me but I know nothing about mips To deal with MIPS-I not…
		wzssyqaUnsubmitted Not Done Reply Inline Actions Sorry for me. I am no idea about what's the mean of the value of `setMaxAtomicSizeInBitsSupported`. Maybe: load/store instructions no quirky behavior: an instruction can be split to in result. If so, MIPS has never this problem since MIPS I. normal load/store instructions can make sure that the data sync in a multicore system. If so, MIPS never archives this. architecture has some instructions to help atomic ops. If so, MIPS has LL/SC since MIPS II. wzssyqa: Sorry for me. I am no idea about what's the mean of the value of…
		bradAuthorUnsubmitted Done Reply Inline Actions setMaxAtomicSizeInBitsSupported() indicates the maximum size of supported atomic operations that are all lock-free. 32-bit MIPS does not have hardware support for lock-free 64-bit atomics. brad: setMaxAtomicSizeInBitsSupported() indicates the maximum size of supported atomic operations…
		wzssyqaUnsubmitted Not Done Reply Inline Actions Ohh, Yes, ldc1 is available since MIPS II. and ldc1 $f0, 0($4) mfc1 $2, $f0 mfc1 $3, $f1 is wrong for MIPS32 FP64 ABI. For MIPS32 FP64 ABI(support MIPS32R2+ only), it should be like: ldc1 $f0, 0($4) mfc1 $2, $f0 mfhc1 $3, $f0 wzssyqa: Ohh, Yes, ldc1 is available since MIPS II. and ``` ldc1 $f0, 0($4) mfc1 $2, $f0 mfc1 $3, $f1…

setMinFunctionAlignment(Subtarget.isGP64bit() ? Align(8) : Align(4));		setMinFunctionAlignment(Subtarget.isGP64bit() ? Align(8) : Align(4));

// The arguments on the stack are defined in terms of 4-byte slots on O32		// The arguments on the stack are defined in terms of 4-byte slots on O32
// and 8-byte slots on N32/N64.		// and 8-byte slots on N32/N64.
setMinStackArgumentAlignment((ABI.IsN32() \|\| ABI.IsN64()) ? Align(8)		setMinStackArgumentAlignment((ABI.IsN32() \|\| ABI.IsN64()) ? Align(8)
: Align(4));		: Align(4));

setStackPointerRegisterToSaveRestore(ABI.IsN64() ? Mips::SP_64 : Mips::SP);		setStackPointerRegisterToSaveRestore(ABI.IsN64() ? Mips::SP_64 : Mips::SP);
▲ Show 20 Lines • Show All 991 Lines • Show Last 20 Lines

llvm/test/CodeGen/Mips/GlobalISel/llvm-ir/load_atomic.ll

	Show First 20 Lines • Show All 151 Lines • ▼ Show 20 Lines
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%load = load atomic i32, ptr %ptr unordered, align 4			%load = load atomic i32, ptr %ptr unordered, align 4
	ret i32 %load			ret i32 %load
	}			}

	define i64 @atomic_load_unordered_i64(ptr %ptr) {			define i64 @atomic_load_unordered_i64(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_unordered_i64:			; MIPS32-LABEL: atomic_load_unordered_i64:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: ldc1 $f0, 0($4)			; MIPS32-NEXT: addiu $sp, $sp, -24
	; MIPS32-NEXT: mfc1 $2, $f0			; MIPS32-NEXT: .cfi_def_cfa_offset 24
				; MIPS32-NEXT: sw $ra, 20($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: .cfi_offset 31, -4
				; MIPS32-NEXT: jal __atomic_load_8
				; MIPS32-NEXT: ori $5, $zero, 0
				; MIPS32-NEXT: lw $ra, 20($sp) # 4-byte Folded Reload
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: mfc1 $3, $f1			; MIPS32-NEXT: addiu $sp, $sp, 24
	%load = load atomic i64, ptr %ptr unordered, align 8			%load = load atomic i64, ptr %ptr unordered, align 8
	ret i64 %load			ret i64 %load
	}			}

	define float @atomic_load_unordered_f32(ptr %ptr) {			define float @atomic_load_unordered_f32(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_unordered_f32:			; MIPS32-LABEL: atomic_load_unordered_f32:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: lwc1 $f0, 64($4)			; MIPS32-NEXT: lwc1 $f0, 64($4)
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%gep = getelementptr inbounds float, ptr %ptr, i32 16			%gep = getelementptr inbounds float, ptr %ptr, i32 16
	%load = load atomic float, ptr %gep unordered, align 4			%load = load atomic float, ptr %gep unordered, align 4
	ret float %load			ret float %load
	}			}

	define double @atomic_load_unordered_f64(ptr %ptr) {			define double @atomic_load_unordered_f64(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_unordered_f64:			; MIPS32-LABEL: atomic_load_unordered_f64:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: ldc1 $f0, 128($4)			; MIPS32-NEXT: addiu $sp, $sp, -24
				; MIPS32-NEXT: .cfi_def_cfa_offset 24
				; MIPS32-NEXT: sw $ra, 20($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: .cfi_offset 31, -4
				; MIPS32-NEXT: ori $5, $zero, 0
				; MIPS32-NEXT: ori $1, $zero, 128
				; MIPS32-NEXT: jal __atomic_load_8
				; MIPS32-NEXT: addu $4, $4, $1
				; MIPS32-NEXT: mtc1 $2, $f0
				; MIPS32-NEXT: mtc1 $3, $f1
				; MIPS32-NEXT: lw $ra, 20($sp) # 4-byte Folded Reload
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: addiu $sp, $sp, 24
	%gep = getelementptr inbounds double, ptr %ptr, i32 16			%gep = getelementptr inbounds double, ptr %ptr, i32 16
	%load = load atomic double, ptr %gep unordered, align 8			%load = load atomic double, ptr %gep unordered, align 8
	ret double %load			ret double %load
	}			}

	define ptr @atomic_load_unordered_p0i8(ptr %ptr) {			define ptr @atomic_load_unordered_p0i8(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_unordered_p0i8:			; MIPS32-LABEL: atomic_load_unordered_p0i8:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	▲ Show 20 Lines • Show All 156 Lines • ▼ Show 20 Lines
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%load = load atomic i32, ptr %ptr monotonic, align 4			%load = load atomic i32, ptr %ptr monotonic, align 4
	ret i32 %load			ret i32 %load
	}			}

	define i64 @atomic_load_monotonic_i64(ptr %ptr) {			define i64 @atomic_load_monotonic_i64(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_monotonic_i64:			; MIPS32-LABEL: atomic_load_monotonic_i64:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: ldc1 $f0, 0($4)			; MIPS32-NEXT: addiu $sp, $sp, -24
	; MIPS32-NEXT: mfc1 $2, $f0			; MIPS32-NEXT: .cfi_def_cfa_offset 24
				; MIPS32-NEXT: sw $ra, 20($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: .cfi_offset 31, -4
				; MIPS32-NEXT: jal __atomic_load_8
				; MIPS32-NEXT: ori $5, $zero, 0
				; MIPS32-NEXT: lw $ra, 20($sp) # 4-byte Folded Reload
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: mfc1 $3, $f1			; MIPS32-NEXT: addiu $sp, $sp, 24
	%load = load atomic i64, ptr %ptr monotonic, align 8			%load = load atomic i64, ptr %ptr monotonic, align 8
	ret i64 %load			ret i64 %load
	}			}

	define float @atomic_load_monotonic_f32(ptr %ptr) {			define float @atomic_load_monotonic_f32(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_monotonic_f32:			; MIPS32-LABEL: atomic_load_monotonic_f32:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: lwc1 $f0, 64($4)			; MIPS32-NEXT: lwc1 $f0, 64($4)
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%gep = getelementptr inbounds float, ptr %ptr, i32 16			%gep = getelementptr inbounds float, ptr %ptr, i32 16
	%load = load atomic float, ptr %gep monotonic, align 4			%load = load atomic float, ptr %gep monotonic, align 4
	ret float %load			ret float %load
	}			}

	define double @atomic_load_monotonic_f64(ptr %ptr) {			define double @atomic_load_monotonic_f64(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_monotonic_f64:			; MIPS32-LABEL: atomic_load_monotonic_f64:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: ldc1 $f0, 128($4)			; MIPS32-NEXT: addiu $sp, $sp, -24
				; MIPS32-NEXT: .cfi_def_cfa_offset 24
				; MIPS32-NEXT: sw $ra, 20($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: .cfi_offset 31, -4
				; MIPS32-NEXT: ori $5, $zero, 0
				; MIPS32-NEXT: ori $1, $zero, 128
				; MIPS32-NEXT: jal __atomic_load_8
				; MIPS32-NEXT: addu $4, $4, $1
				; MIPS32-NEXT: mtc1 $2, $f0
				; MIPS32-NEXT: mtc1 $3, $f1
				; MIPS32-NEXT: lw $ra, 20($sp) # 4-byte Folded Reload
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: addiu $sp, $sp, 24
	%gep = getelementptr inbounds double, ptr %ptr, i32 16			%gep = getelementptr inbounds double, ptr %ptr, i32 16
	%load = load atomic double, ptr %gep monotonic, align 8			%load = load atomic double, ptr %gep monotonic, align 8
	ret double %load			ret double %load
	}			}

	define ptr @atomic_load_monotonic_p0i8(ptr %ptr) {			define ptr @atomic_load_monotonic_p0i8(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_monotonic_p0i8:			; MIPS32-LABEL: atomic_load_monotonic_p0i8:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	▲ Show 20 Lines • Show All 169 Lines • ▼ Show 20 Lines
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%load = load atomic i32, ptr %ptr acquire, align 4			%load = load atomic i32, ptr %ptr acquire, align 4
	ret i32 %load			ret i32 %load
	}			}

	define i64 @atomic_load_acquire_i64(ptr %ptr) {			define i64 @atomic_load_acquire_i64(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_acquire_i64:			; MIPS32-LABEL: atomic_load_acquire_i64:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: ldc1 $f0, 0($4)			; MIPS32-NEXT: addiu $sp, $sp, -24
	; MIPS32-NEXT: sync			; MIPS32-NEXT: .cfi_def_cfa_offset 24
	; MIPS32-NEXT: mfc1 $2, $f0			; MIPS32-NEXT: sw $ra, 20($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: .cfi_offset 31, -4
				; MIPS32-NEXT: jal __atomic_load_8
				; MIPS32-NEXT: ori $5, $zero, 2
				; MIPS32-NEXT: lw $ra, 20($sp) # 4-byte Folded Reload
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: mfc1 $3, $f1			; MIPS32-NEXT: addiu $sp, $sp, 24
	%load = load atomic i64, ptr %ptr acquire, align 8			%load = load atomic i64, ptr %ptr acquire, align 8
	ret i64 %load			ret i64 %load
	}			}

	define float @atomic_load_acquire_f32(ptr %ptr) {			define float @atomic_load_acquire_f32(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_acquire_f32:			; MIPS32-LABEL: atomic_load_acquire_f32:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: lwc1 $f0, 64($4)			; MIPS32-NEXT: lwc1 $f0, 64($4)
	; MIPS32-NEXT: sync			; MIPS32-NEXT: sync
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%gep = getelementptr inbounds float, ptr %ptr, i32 16			%gep = getelementptr inbounds float, ptr %ptr, i32 16
	%load = load atomic float, ptr %gep acquire, align 4			%load = load atomic float, ptr %gep acquire, align 4
	ret float %load			ret float %load
	}			}

	define double @atomic_load_acquire_f64(ptr %ptr) {			define double @atomic_load_acquire_f64(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_acquire_f64:			; MIPS32-LABEL: atomic_load_acquire_f64:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: ldc1 $f0, 128($4)			; MIPS32-NEXT: addiu $sp, $sp, -24
	; MIPS32-NEXT: sync			; MIPS32-NEXT: .cfi_def_cfa_offset 24
				; MIPS32-NEXT: sw $ra, 20($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: .cfi_offset 31, -4
				; MIPS32-NEXT: ori $5, $zero, 2
				; MIPS32-NEXT: ori $1, $zero, 128
				; MIPS32-NEXT: jal __atomic_load_8
				; MIPS32-NEXT: addu $4, $4, $1
				; MIPS32-NEXT: mtc1 $2, $f0
				; MIPS32-NEXT: mtc1 $3, $f1
				; MIPS32-NEXT: lw $ra, 20($sp) # 4-byte Folded Reload
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: addiu $sp, $sp, 24
	%gep = getelementptr inbounds double, ptr %ptr, i32 16			%gep = getelementptr inbounds double, ptr %ptr, i32 16
	%load = load atomic double, ptr %gep acquire, align 8			%load = load atomic double, ptr %gep acquire, align 8
	ret double %load			ret double %load
	}			}

	define ptr @atomic_load_acquire_p0i8(ptr %ptr) {			define ptr @atomic_load_acquire_p0i8(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_acquire_p0i8:			; MIPS32-LABEL: atomic_load_acquire_p0i8:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	▲ Show 20 Lines • Show All 170 Lines • ▼ Show 20 Lines
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%load = load atomic i32, ptr %ptr seq_cst, align 4			%load = load atomic i32, ptr %ptr seq_cst, align 4
	ret i32 %load			ret i32 %load
	}			}

	define i64 @atomic_load_seq_cst_i64(ptr %ptr) {			define i64 @atomic_load_seq_cst_i64(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_seq_cst_i64:			; MIPS32-LABEL: atomic_load_seq_cst_i64:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: ldc1 $f0, 0($4)			; MIPS32-NEXT: addiu $sp, $sp, -24
	; MIPS32-NEXT: sync			; MIPS32-NEXT: .cfi_def_cfa_offset 24
	; MIPS32-NEXT: mfc1 $2, $f0			; MIPS32-NEXT: sw $ra, 20($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: .cfi_offset 31, -4
				; MIPS32-NEXT: jal __atomic_load_8
				; MIPS32-NEXT: ori $5, $zero, 5
				; MIPS32-NEXT: lw $ra, 20($sp) # 4-byte Folded Reload
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: mfc1 $3, $f1			; MIPS32-NEXT: addiu $sp, $sp, 24
	%load = load atomic i64, ptr %ptr seq_cst, align 8			%load = load atomic i64, ptr %ptr seq_cst, align 8
	ret i64 %load			ret i64 %load
	}			}

	define float @atomic_load_seq_cst_f32(ptr %ptr) {			define float @atomic_load_seq_cst_f32(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_seq_cst_f32:			; MIPS32-LABEL: atomic_load_seq_cst_f32:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: lwc1 $f0, 64($4)			; MIPS32-NEXT: lwc1 $f0, 64($4)
	; MIPS32-NEXT: sync			; MIPS32-NEXT: sync
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%gep = getelementptr inbounds float, ptr %ptr, i32 16			%gep = getelementptr inbounds float, ptr %ptr, i32 16
	%load = load atomic float, ptr %gep seq_cst, align 4			%load = load atomic float, ptr %gep seq_cst, align 4
	ret float %load			ret float %load
	}			}

	define double @atomic_load_seq_cst_f64(ptr %ptr) {			define double @atomic_load_seq_cst_f64(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_seq_cst_f64:			; MIPS32-LABEL: atomic_load_seq_cst_f64:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: ldc1 $f0, 128($4)			; MIPS32-NEXT: addiu $sp, $sp, -24
	; MIPS32-NEXT: sync			; MIPS32-NEXT: .cfi_def_cfa_offset 24
				; MIPS32-NEXT: sw $ra, 20($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: .cfi_offset 31, -4
				; MIPS32-NEXT: ori $5, $zero, 5
				; MIPS32-NEXT: ori $1, $zero, 128
				; MIPS32-NEXT: jal __atomic_load_8
				; MIPS32-NEXT: addu $4, $4, $1
				; MIPS32-NEXT: mtc1 $2, $f0
				; MIPS32-NEXT: mtc1 $3, $f1
				; MIPS32-NEXT: lw $ra, 20($sp) # 4-byte Folded Reload
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: addiu $sp, $sp, 24
	%gep = getelementptr inbounds double, ptr %ptr, i32 16			%gep = getelementptr inbounds double, ptr %ptr, i32 16
	%load = load atomic double, ptr %gep seq_cst, align 8			%load = load atomic double, ptr %gep seq_cst, align 8
	ret double %load			ret double %load
	}			}

	define ptr @atomic_load_seq_cst_p0i8(ptr %ptr) {			define ptr @atomic_load_seq_cst_p0i8(ptr %ptr) {
	; MIPS32-LABEL: atomic_load_seq_cst_p0i8:			; MIPS32-LABEL: atomic_load_seq_cst_p0i8:
	; MIPS32: # %bb.0:			; MIPS32: # %bb.0:
	; MIPS32-NEXT: lw $2, 64($4)			; MIPS32-NEXT: lw $2, 64($4)
	; MIPS32-NEXT: sync			; MIPS32-NEXT: sync
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	%gep = getelementptr inbounds ptr, ptr %ptr, i32 16			%gep = getelementptr inbounds ptr, ptr %ptr, i32 16
	%load = load atomic ptr, ptr %gep seq_cst, align 4			%load = load atomic ptr, ptr %gep seq_cst, align 4
	ret ptr %load			ret ptr %load
	}			}