This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
2/2
SROA.cpp
-
test/Transforms/SROA/
-
Transforms/
-
SROA/
1/2
alignment.ll

Differential D75266

SROA: Don't drop atomic load/store alignments (PR45010)
ClosedPublic

Authored by hans on Feb 27 2020, 8:47 AM.

Download Raw Diff

Details

Reviewers

bkramer
chandlerc
gchatelet
rnk
efriedma

Commits

rG7cb682929128: SROA: Don't drop atomic load/store alignments (PR45010)
rGd48c981697a4: SROA: Don't drop atomic load/store alignments (PR45010)

Summary

SROA will drop the explicit alignment on allocas when the ABI guarantees enough alignment. Because the alignment on new load/store instructions are set base on the alloca's alignment, that means SROA would end up dropping the alignment from atomic loads and stores, which is not allowed (see bug). For those, make sure to always carry over the alignment from the previous instruction.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hans created this revision.Feb 27 2020, 8:47 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 27 2020, 8:47 AM

Herald added subscribers: jfb, hiraditya. · View Herald Transcript

aganea added a subscriber: aganea.Feb 27 2020, 8:49 AM

Thank you for taking a look @hans !
Only two nits on my side.

llvm/lib/Transforms/Scalar/SROA.cpp
2523	`NewLI->setAlignment(LI.getAlign());`
2715	`NewSI->setAlignment(SI.getAlign());`

This revision now requires changes to proceed.Feb 27 2020, 8:56 AM

Using getAlign().

I would prefer to just avoid creating unaligned allocas in the first place. https://github.com/llvm/llvm-project/blob/740ed617f7d4d16e7883636c5eff994f8be7eba4/llvm/lib/Transforms/Scalar/SROA.cpp#L4163 has some code which intentionally uses an unspecified alignment... but really, there's no reason to do that. allocas with no specified alignment are implicitly aligned using the ABI type alignment. So it's semantically equivalent to std::max(Alignment, DL.getABITypeAlignment(SliceTy), just more annoying to deal with.

Really, the alignment of load/store/alloca shouldn't be a MaybeAlign in the first place, but someone needs to go through and clean up a bunch of places to make that happen.

makes sense to me

In D75266#1895826, @efriedma wrote:

I would prefer to just avoid creating unaligned allocas in the first place. https://github.com/llvm/llvm-project/blob/740ed617f7d4d16e7883636c5eff994f8be7eba4/llvm/lib/Transforms/Scalar/SROA.cpp#L4163 has some code which intentionally uses an unspecified alignment... but really, there's no reason to do that. allocas with no specified alignment are implicitly aligned using the ABI type alignment. So it's semantically equivalent to std::max(Alignment, DL.getABITypeAlignment(SliceTy), just more annoying to deal with.

The current behaviour is from https://github.com/llvm/llvm-project/commit/903790eff54f473fca8fbf6915713751a070b5df

Changing that seems like a riskier change (I want this for 10.x) whereas copying over the alignment from the store/load seems straight-forward to me.

jfb added inline comments.Feb 27 2020, 9:28 AM

llvm/test/Transforms/SROA/alignment.ll
242	From the code, it looks like you want a test where the load is volatile only as well? Should probably test stores too? Does this affect other memory operations, i.e. the atomic ones?

Changing that seems like a riskier change (I want this for 10.x) whereas copying over the alignment from the store/load seems straight-forward to me.

It's not completely obvious at first glance that the alloca is actually sufficiently aligned. But looking a little more carefully, it should be; we're preserving the alignment of the original alloca. So this is fine, I guess. LGTM

Please address @jfb's comment but otherwise LGTM.

This revision is now accepted and ready to land.Feb 27 2020, 12:06 PM

hans marked an inline comment as done.Feb 28 2020, 1:38 AM

hans added inline comments.

llvm/test/Transforms/SROA/alignment.ll
242	I don't think the volatile-only case is interesting. My code doesn't change anything for that case. I've made the test cover stores too. I'm not sure what you mean by "other memory operations, i.e. the atomic ones". This change affects exactly the atomic memory operations..

Closed by commit rGd48c981697a4: SROA: Don't drop atomic load/store alignments (PR45010) (authored by hans). · Explain WhyFeb 28 2020, 2:09 AM

This revision was automatically updated to reflect the committed changes.

I meant compare and exchange, as well as RMW.

In D75266#1898238, @jfb wrote:

I meant compare and exchange, as well as RMW.

It doesn't look like SROA handles cmpxchg or atomicrmw instructions.

In D75266#1900542, @hans wrote:

In D75266#1898238, @jfb wrote:

I meant compare and exchange, as well as RMW.

It doesn't look like SROA handles cmpxchg or atomicrmw instructions.

I know, but if it grows that capability then the author will thank you for adding a test that prevents them from regressing correctness.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Scalar/

SROA.cpp

4 lines

test/

Transforms/

SROA/

alignment.ll

15 lines

Diff 247190

llvm/lib/Transforms/Scalar/SROA.cpp

Show First 20 Lines • Show All 2,513 Lines • ▼ Show 20 Lines	if (VecTy) {
TargetTy->isIntegerTy()))) {		TargetTy->isIntegerTy()))) {
LoadInst *NewLI = IRB.CreateAlignedLoad(NewAI.getAllocatedType(), &NewAI,		LoadInst *NewLI = IRB.CreateAlignedLoad(NewAI.getAllocatedType(), &NewAI,
NewAI.getAlign(), LI.isVolatile(),		NewAI.getAlign(), LI.isVolatile(),
LI.getName());		LI.getName());
if (AATags)		if (AATags)
NewLI->setAAMetadata(AATags);		NewLI->setAAMetadata(AATags);
if (LI.isVolatile())		if (LI.isVolatile())
NewLI->setAtomic(LI.getOrdering(), LI.getSyncScopeID());		NewLI->setAtomic(LI.getOrdering(), LI.getSyncScopeID());
		if (NewLI->isAtomic())
		NewLI->setAlignment(LI.getAlign());
		gchateletUnsubmitted Done Reply Inline Actions `NewLI->setAlignment(LI.getAlign());` gchatelet: `NewLI->setAlignment(LI.getAlign());`

// Any !nonnull metadata or !range metadata on the old load is also valid		// Any !nonnull metadata or !range metadata on the old load is also valid
// on the new load. This is even true in some cases even when the loads		// on the new load. This is even true in some cases even when the loads
// are different types, for example by mapping !nonnull metadata to		// are different types, for example by mapping !nonnull metadata to
// !range metadata by modeling the null pointer constant converted to the		// !range metadata by modeling the null pointer constant converted to the
// integer type.		// integer type.
// FIXME: Add support for range metadata here. Currently the utilities		// FIXME: Add support for range metadata here. Currently the utilities
// for this don't propagate range metadata in trivial cases from one		// for this don't propagate range metadata in trivial cases from one
▲ Show 20 Lines • Show All 174 Lines • ▼ Show 20 Lines	if (NewBeginOffset == NewAllocaBeginOffset &&
SI.isVolatile());		SI.isVolatile());
}		}
NewSI->copyMetadata(SI, {LLVMContext::MD_mem_parallel_loop_access,		NewSI->copyMetadata(SI, {LLVMContext::MD_mem_parallel_loop_access,
LLVMContext::MD_access_group});		LLVMContext::MD_access_group});
if (AATags)		if (AATags)
NewSI->setAAMetadata(AATags);		NewSI->setAAMetadata(AATags);
if (SI.isVolatile())		if (SI.isVolatile())
NewSI->setAtomic(SI.getOrdering(), SI.getSyncScopeID());		NewSI->setAtomic(SI.getOrdering(), SI.getSyncScopeID());
		if (NewSI->isAtomic())
		NewSI->setAlignment(SI.getAlign());
		gchateletUnsubmitted Done Reply Inline Actions `NewSI->setAlignment(SI.getAlign());` gchatelet: `NewSI->setAlignment(SI.getAlign());`
Pass.DeadInsts.insert(&SI);		Pass.DeadInsts.insert(&SI);
deleteIfTriviallyDead(OldOp);		deleteIfTriviallyDead(OldOp);

LLVM_DEBUG(dbgs() << " to: " << *NewSI << "\n");		LLVM_DEBUG(dbgs() << " to: " << *NewSI << "\n");
return NewSI->getPointerOperand() == &NewAI && !SI.isVolatile();		return NewSI->getPointerOperand() == &NewAI && !SI.isVolatile();
}		}

/// Compute an integer value from splatting an i8 across the given		/// Compute an integer value from splatting an i8 across the given
▲ Show 20 Lines • Show All 1,937 Lines • Show Last 20 Lines

llvm/test/Transforms/SROA/alignment.ll

Show First 20 Lines • Show All 222 Lines • ▼ Show 20 Lines	; CHECK: load i16, {{.*}}, align 2

%ptr = alloca {i32, i8, i8, {i8, i16}}, align 2		%ptr = alloca {i32, i8, i8, {i8, i16}}, align 2
%ptr.8 = bitcast {i32, i8, i8, {i8, i16}}* %ptr to i8*		%ptr.8 = bitcast {i32, i8, i8, {i8, i16}}* %ptr to i8*
call void @populate(i8* %ptr.8)		call void @populate(i8* %ptr.8)
%val = load {i32, i8, i8, {i8, i16}}, {i32, i8, i8, {i8, i16}}* %ptr, align 2		%val = load {i32, i8, i8, {i8, i16}}, {i32, i8, i8, {i8, i16}}* %ptr, align 2
ret void		ret void
}		}

		%struct = type { i32, i32 }
		define dso_local i32 @pr45010(%struct* %A) {
		; CHECK-LABEL: @pr45010
		; CHECK: load atomic volatile i32, {{.*}}, align 4

		%B = alloca %struct, align 4
		%A.i = getelementptr inbounds %struct, %struct* %A, i32 0, i32 0
		%B.i = getelementptr inbounds %struct, %struct* %B, i32 0, i32 0
		%1 = load i32, i32* %A.i, align 4
		store atomic volatile i32 %1, i32* %B.i release, align 4
		%2 = bitcast %struct* %B to i32*
		%x = load atomic volatile i32, i32* %2 acquire, align 4
		jfbUnsubmitted Not Done Reply Inline Actions From the code, it looks like you want a test where the load is volatile only as well? Should probably test stores too? Does this affect other memory operations, i.e. the atomic ones? jfb: From the code, it looks like you want a test where the load is volatile only as well? Should…
		hansAuthorUnsubmitted Done Reply Inline Actions I don't think the volatile-only case is interesting. My code doesn't change anything for that case. I've made the test cover stores too. I'm not sure what you mean by "other memory operations, i.e. the atomic ones". This change affects exactly the atomic memory operations.. hans: I don't think the volatile-only case is interesting. My code doesn't change anything for that…
		ret i32 %x
		}

declare void @populate(i8*)		declare void @populate(i8*)