Download Raw Diff

Details

Reviewers

gchatelet
spatel
lebedev.ri
hans
hfinkel
bjope

Commits

rG15488ff24b4a: [CodeGen] Fix the computation of the alignment of split stores.

Summary

Right now the alignment of the lower half of a store is computed as
align/2, which fails for unaligned stores (align = 1), and is overly
pessimitic for, e.g. a 8 byte store aligned to 4 bytes.
Fixes PR44851

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

courbet created this revision.Feb 10 2020, 5:33 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 10 2020, 5:33 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B46074: Diff 243521.Feb 10 2020, 5:34 AM

remove dplicate test

Harbormaster completed remote builds in B46075: Diff 243522.Feb 10 2020, 5:38 AM

gchatelet added inline comments.Feb 10 2020, 5:47 AM

llvm/lib/CodeGen/CodeGenPrepare.cpp
6873	Can you reuse `Alignment` here?

lebedev.ri added inline comments.Feb 10 2020, 5:50 AM

llvm/lib/CodeGen/CodeGenPrepare.cpp
6874	On a first read, this is very confusing. When splitting store in half, naturally one half will retain the alignment of the original wider store, regardless of whether it was over-aligned or not, while other will require adjustment.
6876	Can the no-longer-pessimism of the computation be demonstrated by the tests?

use opt test to explicitly show align

Add a test with align=2 to show "no-longer-pessimism".

Harbormaster completed remote builds in B46077: Diff 243524.Feb 10 2020, 6:02 AM

Harbormaster completed remote builds in B46078: Diff 243526.

Improve documentation as suggested during the review.

Harbormaster completed remote builds in B46079: Diff 243529.Feb 10 2020, 6:03 AM

Thanks

lebedev.ri added inline comments.Feb 10 2020, 6:14 AM

llvm/test/CodeGen/X86/split-store-unaligned.ll
30 ↗	(On Diff #243529)	Hmm. Could you please add a similar test (but with wide store being 64-bit aligned) with big-endian triple please.

rename test file

Harbormaster completed remote builds in B46081: Diff 243538.Feb 10 2020, 6:21 AM

add PowerPC test

Herald added subscribers: jsji, nemanjai. · View Herald TranscriptFeb 10 2020, 6:49 AM

Harbormaster completed remote builds in B46084: Diff 243549.Feb 10 2020, 6:53 AM

Herald added a subscriber: • wuzish. · View Herald TranscriptFeb 10 2020, 6:53 AM

LG to me.

llvm/test/CodeGen/PowerPC/split-store-alignment.ll

2–3 ↗

(On Diff #243549)

Please unique checklines

; RUN: opt -S -mtriple=powerpc64le -codegenprepare -force-split-store < %s  | FileCheck --check-prefixes=ALL,PPC64LE %s
; RUN: opt -S -mtriple=powerpc64 -codegenprepare -force-split-store < %s  | FileCheck --check-prefixes=ALL,PPC64 %s

101–102 ↗

(On Diff #243549)

Okay, i guess there is no big-endian issue after all.

This revision is now accepted and ready to land.Feb 10 2020, 7:10 AM

unique CHECK lines

Harbormaster completed remote builds in B46089: Diff 243565.Feb 10 2020, 8:01 AM

spatel added inline comments.Feb 10 2020, 8:05 AM

llvm/test/CodeGen/PowerPC/split-store-alignment.ll
101–102 ↗	(On Diff #243549)	Wait - why is there no difference between LE and BE here? We may need to explicitly specify the datalayout on the command-line or in the test file.

CGP is not creating an appropriate default datalayout based on triple, so we need to specify it explicitly:

$ opt -S splitstore.ll -mtriple=powerpc64 -data-layout="E" -force-split-store -codegenprepare
; ModuleID = 'splitstore.ll'
source_filename = "splitstore.ll"
target datalayout = "E"
target triple = "powerpc64"

define void @split_store_align8(i32 %x, i64* %p) {
  %z = zext i32 43 to i64
  %s = shl nuw nsw i64 %z, 32
  %z2 = zext i32 %x to i64
  %o = or i64 %s, %z2
  %1 = bitcast i64* %p to i32*
  %2 = getelementptr i32, i32* %1, i32 1
  store i32 %x, i32* %2, align 8
  %3 = bitcast i64* %p to i32*
  store i32 43, i32* %3, align 4
  ret void
}

$ opt -S splitstore.ll -mtriple=powerpc64le -data-layout="e" -force-split-store -codegenprepare
; ModuleID = 'splitstore.ll'
source_filename = "splitstore.ll"
target datalayout = "e"
target triple = "powerpc64le"

define void @split_store_align8(i32 %x, i64* %p) {
  %z = zext i32 43 to i64
  %s = shl nuw nsw i64 %z, 32
  %z2 = zext i32 %x to i64
  %o = or i64 %s, %z2
  %1 = bitcast i64* %p to i32*
  store i32 %x, i32* %1, align 8
  %2 = bitcast i64* %p to i32*
  %3 = getelementptr i32, i32* %2, i32 1
  store i32 43, i32* %3, align 4
  ret void
}

This revision now requires changes to proceed.Feb 10 2020, 8:12 AM

Add target triple in module in tests, which shows the brokenness of alignment
for BE targets. Fit it.

CGP is not creating an appropriate default datalayout based on triple, so we need to specify it explicitly:

Thanks !

Harbormaster completed remote builds in B46181: Diff 243750.Feb 11 2020, 12:00 AM

LG for real now :)

llvm/test/CodeGen/PowerPC/split-store-alignment.ll
101–102 ↗	(On Diff #243549)	Oops, thanks for catching that. So there was an issue after all.

LGTM - see inline for a comment nit and some potential test changes.

llvm/lib/CodeGen/CodeGenPrepare.cpp
6874	The suggested "one half" is better than the current "lower half" - eliminate the endian-specific inaccuracy.
llvm/test/CodeGen/PowerPC/split-store-alignment-le.ll
2 ↗	(On Diff #243750)	Unless I'm missing some advantage of the way of the tests are arranged currently, I prefer that: The tests live under llvm/test/Transforms/CodeGenPrepare/<target>. I know there are exceptions, but that is where I first look for IR --> IR tests. If we include the layout specifiers as parameters in the RUN line, then it saves some duplication, and it's easier to spot the endian differences. If we can push the non-crashing tests as a preliminary commit, that would be nicer, so we can see the current buggy code (and the tests will remain just in case this patch ever gets reverted).

This revision is now accepted and ready to land.Feb 11 2020, 10:19 AM

We probably want to backport the big-endian fix into 10.0 in some form?
Please do file a bug

rebase on submitted base tests.

Harbormaster completed remote builds in B46306: Diff 244088.Feb 12 2020, 12:52 AM

Thanks for the review !

Closed by commit rG15488ff24b4a: [CodeGen] Fix the computation of the alignment of split stores. (authored by courbet). · Explain WhyFeb 12 2020, 1:38 AM

This revision was automatically updated to reflect the committed changes.

In D74311#1870555, @lebedev.ri wrote:

We probably want to backport the big-endian fix into 10.0 in some form?
Please do file a bug

Was there a bug filed for this yet?

In D74311#1871706, @hans wrote:

In D74311#1870555, @lebedev.ri wrote:

We probably want to backport the big-endian fix into 10.0 in some form?
Please do file a bug

Was there a bug filed for this yet?

#44877

In D74311#1871709, @lebedev.ri wrote:

In D74311#1871706, @hans wrote:

In D74311#1870555, @lebedev.ri wrote:

We probably want to backport the big-endian fix into 10.0 in some form?
Please do file a bug

Was there a bug filed for this yet?

#44877

Thanks! I'll follow up there.

Diff 244102

llvm/lib/CodeGen/CodeGenPrepare.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,858 Lines • ▼ Show 20 Lines	if (HBC && HBC->getParent() != SI.getParent())
HValue = Builder.CreateBitCast(HBC->getOperand(0), HBC->getType());		HValue = Builder.CreateBitCast(HBC->getOperand(0), HBC->getType());

bool IsLE = SI.getModule()->getDataLayout().isLittleEndian();		bool IsLE = SI.getModule()->getDataLayout().isLittleEndian();
auto CreateSplitStore = [&](Value *V, bool Upper) {		auto CreateSplitStore = [&](Value *V, bool Upper) {
V = Builder.CreateZExtOrBitCast(V, SplitStoreType);		V = Builder.CreateZExtOrBitCast(V, SplitStoreType);
Value *Addr = Builder.CreateBitCast(		Value *Addr = Builder.CreateBitCast(
SI.getOperand(1),		SI.getOperand(1),
SplitStoreType->getPointerTo(SI.getPointerAddressSpace()));		SplitStoreType->getPointerTo(SI.getPointerAddressSpace()));
if ((IsLE && Upper) \|\| (!IsLE && !Upper))		const bool IsOffsetStore = (IsLE && Upper) \|\| (!IsLE && !Upper);
		if (IsOffsetStore)
Addr = Builder.CreateGEP(		Addr = Builder.CreateGEP(
SplitStoreType, Addr,		SplitStoreType, Addr,
ConstantInt::get(Type::getInt32Ty(SI.getContext()), 1));		ConstantInt::get(Type::getInt32Ty(SI.getContext()), 1));
Builder.CreateAlignedStore(V, Addr,		MaybeAlign Alignment = SI.getAlign();
Upper ? SI.getAlign() / 2 : SI.getAlign());		if (IsOffsetStore && Alignment) {
		gchateletUnsubmitted Done Reply Inline Actions Can you reuse `Alignment` here? gchatelet: Can you reuse `Alignment` here?
		// When splitting the store in half, naturally one half will retain the
		lebedev.riUnsubmitted Done Reply Inline Actions On a first read, this is very confusing. When splitting store in half, naturally one half will retain the alignment of the original wider store, regardless of whether it was over-aligned or not, while other will require adjustment. lebedev.ri: On a first read, this is very confusing. When splitting store in half, naturally one half will…
		spatelUnsubmitted Not Done Reply Inline Actions The suggested "one half" is better than the current "lower half" - eliminate the endian-specific inaccuracy. spatel: The suggested "one half" is better than the current "lower half" - eliminate the endian…
		// alignment of the original wider store, regardless of whether it was
		// over-aligned or not, while the other will require adjustment.
		lebedev.riUnsubmitted Done Reply Inline Actions Can the no-longer-pessimism of the computation be demonstrated by the tests? lebedev.ri: Can the no-longer-pessimism of the computation be demonstrated by the tests?
		Alignment = commonAlignment(Alignment, HalfValBitSize / 8);
		}
		Builder.CreateAlignedStore(V, Addr, Alignment);
};		};

CreateSplitStore(LValue, false);		CreateSplitStore(LValue, false);
CreateSplitStore(HValue, true);		CreateSplitStore(HValue, true);

// Delete the old store.		// Delete the old store.
SI.eraseFromParent();		SI.eraseFromParent();
return true;		return true;
▲ Show 20 Lines • Show All 612 Lines • Show Last 20 Lines

llvm/test/Transforms/CodeGenPrepare/PowerPC/split-store-alignment.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -codegenprepare -mtriple=powerpc64-unknown-linux-gnu -data-layout="E-m:e-i64:64-n32:64" -force-split-store < %s \| FileCheck --check-prefixes=ALL,BE %s			; RUN: opt -S -codegenprepare -mtriple=powerpc64-unknown-linux-gnu -data-layout="E-m:e-i64:64-n32:64" -force-split-store < %s \| FileCheck --check-prefixes=ALL,BE %s
	; RUN: opt -S -codegenprepare -mtriple=powerpc64le-unknown-linux-gnu -data-layout="e-m:e-i64:64-n32:64" -force-split-store < %s \| FileCheck --check-prefixes=ALL,LE %s			; RUN: opt -S -codegenprepare -mtriple=powerpc64le-unknown-linux-gnu -data-layout="e-m:e-i64:64-n32:64" -force-split-store < %s \| FileCheck --check-prefixes=ALL,LE %s

				define void @split_store_align1(float %x, i64* %p) {
				; BE-LABEL: @split_store_align1(
				; BE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32
				; BE-NEXT: [[Z:%.*]] = zext i32 0 to i64
				; BE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32
				; BE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64
				; BE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]
				; BE-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32
				; BE-NEXT: [[TMP2:%.]] = getelementptr i32, i32 [[TMP1]], i32 1
				; BE-NEXT: store i32 [[B]], i32* [[TMP2]], align 1
				; BE-NEXT: [[TMP3:%.]] = bitcast i64 [[P]] to i32*
				; BE-NEXT: store i32 0, i32* [[TMP3]], align 1
				; BE-NEXT: ret void
				;
				; LE-LABEL: @split_store_align1(
				; LE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32
				; LE-NEXT: [[Z:%.*]] = zext i32 0 to i64
				; LE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32
				; LE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64
				; LE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]
				; LE-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32
				; LE-NEXT: store i32 [[B]], i32* [[TMP1]], align 1
				; LE-NEXT: [[TMP2:%.]] = bitcast i64 [[P]] to i32*
				; LE-NEXT: [[TMP3:%.]] = getelementptr i32, i32 [[TMP2]], i32 1
				; LE-NEXT: store i32 0, i32* [[TMP3]], align 1
				; LE-NEXT: ret void
				;
				%b = bitcast float %x to i32
				%z = zext i32 0 to i64
				%s = shl nuw nsw i64 %z, 32
				%z2 = zext i32 %b to i64
				%o = or i64 %s, %z2
				store i64 %o, i64* %p, align 1
				ret void
				}

	define void @split_store_align2(float %x, i64* %p) {			define void @split_store_align2(float %x, i64* %p) {
	; BE-LABEL: @split_store_align2(			; BE-LABEL: @split_store_align2(
	; BE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32			; BE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32
	; BE-NEXT: [[Z:%.*]] = zext i32 0 to i64			; BE-NEXT: [[Z:%.*]] = zext i32 0 to i64
	; BE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32			; BE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32
	; BE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64			; BE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64
	; BE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]			; BE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]
	; BE-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32			; BE-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32
	; BE-NEXT: [[TMP2:%.]] = getelementptr i32, i32 [[TMP1]], i32 1			; BE-NEXT: [[TMP2:%.]] = getelementptr i32, i32 [[TMP1]], i32 1
	; BE-NEXT: store i32 [[B]], i32* [[TMP2]], align 2			; BE-NEXT: store i32 [[B]], i32* [[TMP2]], align 2
	; BE-NEXT: [[TMP3:%.]] = bitcast i64 [[P]] to i32*			; BE-NEXT: [[TMP3:%.]] = bitcast i64 [[P]] to i32*
	; BE-NEXT: store i32 0, i32* [[TMP3]], align 1			; BE-NEXT: store i32 0, i32* [[TMP3]], align 2
	; BE-NEXT: ret void			; BE-NEXT: ret void
	;			;
	; LE-LABEL: @split_store_align2(			; LE-LABEL: @split_store_align2(
	; LE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32			; LE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32
	; LE-NEXT: [[Z:%.*]] = zext i32 0 to i64			; LE-NEXT: [[Z:%.*]] = zext i32 0 to i64
	; LE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32			; LE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32
	; LE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64			; LE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64
	; LE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]			; LE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]
	; LE-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32			; LE-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32
	; LE-NEXT: store i32 [[B]], i32* [[TMP1]], align 2			; LE-NEXT: store i32 [[B]], i32* [[TMP1]], align 2
	; LE-NEXT: [[TMP2:%.]] = bitcast i64 [[P]] to i32*			; LE-NEXT: [[TMP2:%.]] = bitcast i64 [[P]] to i32*
	; LE-NEXT: [[TMP3:%.]] = getelementptr i32, i32 [[TMP2]], i32 1			; LE-NEXT: [[TMP3:%.]] = getelementptr i32, i32 [[TMP2]], i32 1
	; LE-NEXT: store i32 0, i32* [[TMP3]], align 1			; LE-NEXT: store i32 0, i32* [[TMP3]], align 2
	; LE-NEXT: ret void			; LE-NEXT: ret void
	;			;
	%b = bitcast float %x to i32			%b = bitcast float %x to i32
	%z = zext i32 0 to i64			%z = zext i32 0 to i64
	%s = shl nuw nsw i64 %z, 32			%s = shl nuw nsw i64 %z, 32
	%z2 = zext i32 %b to i64			%z2 = zext i32 %b to i64
	%o = or i64 %s, %z2			%o = or i64 %s, %z2
	store i64 %o, i64* %p, align 2			store i64 %o, i64* %p, align 2
	ret void			ret void
	}			}

	define void @split_store_align8(float %x, i64* %p) {			define void @split_store_align8(float %x, i64* %p) {
	; BE-LABEL: @split_store_align8(			; BE-LABEL: @split_store_align8(
	; BE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32			; BE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32
	; BE-NEXT: [[Z:%.*]] = zext i32 0 to i64			; BE-NEXT: [[Z:%.*]] = zext i32 0 to i64
	; BE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32			; BE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32
	; BE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64			; BE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64
	; BE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]			; BE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]
	; BE-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32			; BE-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32
	; BE-NEXT: [[TMP2:%.]] = getelementptr i32, i32 [[TMP1]], i32 1			; BE-NEXT: [[TMP2:%.]] = getelementptr i32, i32 [[TMP1]], i32 1
	; BE-NEXT: store i32 [[B]], i32* [[TMP2]], align 8			; BE-NEXT: store i32 [[B]], i32* [[TMP2]], align 4
	; BE-NEXT: [[TMP3:%.]] = bitcast i64 [[P]] to i32*			; BE-NEXT: [[TMP3:%.]] = bitcast i64 [[P]] to i32*
	; BE-NEXT: store i32 0, i32* [[TMP3]], align 4			; BE-NEXT: store i32 0, i32* [[TMP3]], align 8
	; BE-NEXT: ret void			; BE-NEXT: ret void
	;			;
	; LE-LABEL: @split_store_align8(			; LE-LABEL: @split_store_align8(
	; LE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32			; LE-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32
	; LE-NEXT: [[Z:%.*]] = zext i32 0 to i64			; LE-NEXT: [[Z:%.*]] = zext i32 0 to i64
	; LE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32			; LE-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32
	; LE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64			; LE-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64
	; LE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]			; LE-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]
	Show All 15 Lines

llvm/test/Transforms/CodeGenPrepare/X86/split-store-alignment.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -codegenprepare -mtriple=x86_64-unknown-unknown -force-split-store -S < %s \| FileCheck %s			; RUN: opt -codegenprepare -mtriple=x86_64-unknown-unknown -force-split-store -S < %s \| FileCheck %s

	target datalayout = "e-m:x-p:32:32-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:32-n8:16:32-a:0:32-S32"			target datalayout = "e-m:x-p:32:32-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:32-n8:16:32-a:0:32-S32"
	target triple = "i686-w64-windows-gnu"			target triple = "i686-w64-windows-gnu"

				define void @split_store_align1(float %x, i64* %p) {
				; CHECK-LABEL: @split_store_align1(
				; CHECK-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32
				; CHECK-NEXT: [[Z:%.*]] = zext i32 0 to i64
				; CHECK-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32
				; CHECK-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64
				; CHECK-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]
				; CHECK-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32
				; CHECK-NEXT: store i32 [[B]], i32* [[TMP1]], align 1
				; CHECK-NEXT: [[TMP2:%.]] = bitcast i64 [[P]] to i32*
				; CHECK-NEXT: [[TMP3:%.]] = getelementptr i32, i32 [[TMP2]], i32 1
				; CHECK-NEXT: store i32 0, i32* [[TMP3]], align 1
				; CHECK-NEXT: ret void
				;
				%b = bitcast float %x to i32
				%z = zext i32 0 to i64
				%s = shl nuw nsw i64 %z, 32
				%z2 = zext i32 %b to i64
				%o = or i64 %s, %z2
				store i64 %o, i64* %p, align 1
				ret void
				}

	define void @split_store_align2(float %x, i64* %p) {			define void @split_store_align2(float %x, i64* %p) {
	; CHECK-LABEL: @split_store_align2(			; CHECK-LABEL: @split_store_align2(
	; CHECK-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32			; CHECK-NEXT: [[B:%.]] = bitcast float [[X:%.]] to i32
	; CHECK-NEXT: [[Z:%.*]] = zext i32 0 to i64			; CHECK-NEXT: [[Z:%.*]] = zext i32 0 to i64
	; CHECK-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32			; CHECK-NEXT: [[S:%.*]] = shl nuw nsw i64 [[Z]], 32
	; CHECK-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64			; CHECK-NEXT: [[Z2:%.*]] = zext i32 [[B]] to i64
	; CHECK-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]			; CHECK-NEXT: [[O:%.*]] = or i64 [[S]], [[Z2]]
	; CHECK-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32			; CHECK-NEXT: [[TMP1:%.]] = bitcast i64 [[P:%.]] to i32
	; CHECK-NEXT: store i32 [[B]], i32* [[TMP1]], align 2			; CHECK-NEXT: store i32 [[B]], i32* [[TMP1]], align 2
	; CHECK-NEXT: [[TMP2:%.]] = bitcast i64 [[P]] to i32*			; CHECK-NEXT: [[TMP2:%.]] = bitcast i64 [[P]] to i32*
	; CHECK-NEXT: [[TMP3:%.]] = getelementptr i32, i32 [[TMP2]], i32 1			; CHECK-NEXT: [[TMP3:%.]] = getelementptr i32, i32 [[TMP2]], i32 1
	; CHECK-NEXT: store i32 0, i32* [[TMP3]], align 1			; CHECK-NEXT: store i32 0, i32* [[TMP3]], align 2
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	%b = bitcast float %x to i32			%b = bitcast float %x to i32
	%z = zext i32 0 to i64			%z = zext i32 0 to i64
	%s = shl nuw nsw i64 %z, 32			%s = shl nuw nsw i64 %z, 32
	%z2 = zext i32 %b to i64			%z2 = zext i32 %b to i64
	%o = or i64 %s, %z2			%o = or i64 %s, %z2
	store i64 %o, i64* %p, align 2			store i64 %o, i64* %p, align 2
	Show All 25 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[CodeGen] Fix the computation of the alignment of split stores.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 244102

llvm/lib/CodeGen/CodeGenPrepare.cpp

llvm/test/Transforms/CodeGenPrepare/PowerPC/split-store-alignment.ll

llvm/test/Transforms/CodeGenPrepare/X86/split-store-alignment.ll

This is an archive of the discontinued LLVM Phabricator instance.

[CodeGen] Fix the computation of the alignment of split stores.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 244102

llvm/lib/CodeGen/CodeGenPrepare.cpp

llvm/test/Transforms/CodeGenPrepare/PowerPC/split-store-alignment.ll

llvm/test/Transforms/CodeGenPrepare/X86/split-store-alignment.ll

[CodeGen] Fix the computation of the alignment of split stores.
ClosedPublic