Page MenuHomePhabricator

[DAGCombiner] Fix load-store forwarding of indexed loads.

Authored by niravd on Nov 8 2018, 8:40 AM.



Handle extra output from index loads in cases where we wish to
forward a load value directly from a preceeding store.

Fixes PR39571.

Diff Detail


Event Timeline

niravd created this revision.Nov 8 2018, 8:40 AM

Thanks very much for the quick update. I can confirm that the original test file compiles successfully. I'll take a look at the change tomorrow (need to leave office).

This is looking good. I've got one style nit that I'll leave to your discretion. I think it is probably worth tidying up the test case a little more though. I've made a proposal in the comment.

12865 ↗(On Diff #173171)

Taking up one of the comments from the original review for D49200. The use of auto could be considered a bit excessive here.

For example use bool for IsSub, ISD::NodeType for Opc and SDValue for Idx. I don't have a strong opinion about this personally but it might be more consistent with the other code in the file.

1 ↗(On Diff #173171)

I've made an attempt to strip down the test case a bit. I've also upped the target Arm architecture to a more recent one. It seems like the original test passed with armv7a due to the availability of unaligned access, but it will fail if -mno-unaligned-access is used.

I've changed the variable names to remove the association with the original program.

; RUN: llc < %s -mtriple armv7a-unknown-linux-gnueabi -mattr=+strict-align

; Avoid crash from forwarding indexed-loads back to store.
%struct.anon = type {*, %struct.mb } = type { i8 }
%struct.mb = type { i8, i8 }
%struct.anon.0 = type { %struct.anon.1 }
%struct.anon.1 = type { %struct.ds }
%struct.ds = type <{ i8, }> = type { %struct.ib }
%struct.ib = type { i8, i8, i16 }

@a = common dso_local local_unnamed_addr global %struct.anon* null, align 4
@b = common dso_local local_unnamed_addr global %struct.anon.0 zeroinitializer, align 1

; Function Attrs: norecurse nounwind
define dso_local void @func() local_unnamed_addr {
  %0 = load %struct.anon*, %struct.anon** @a, align 4
  %ad = getelementptr inbounds %struct.anon, %struct.anon* %0, i32 0, i32 0
  %1 = load*,** %ad, align 4
  %c.sroa.0.0..sroa_idx = getelementptr inbounds,* %1, i32 0, i32 0
  %c.sroa.0.0.copyload = load i8, i8* %c.sroa.0.0..sroa_idx, align 1
  %cb = getelementptr inbounds %struct.anon, %struct.anon* %0, i32 0, i32 1
  %band = getelementptr inbounds %struct.anon, %struct.anon* %0, i32 0, i32 1, i32 1
  store i8 %c.sroa.0.0.copyload, i8* %band, align 4
  store i8 6, i8* getelementptr inbounds (%struct.anon.0, %struct.anon.0* @b, i32 0, i32 0, i32 0, i32 1, i32 0, i32 0), align 1
  store i8 2, i8* getelementptr inbounds (%struct.anon.0, %struct.anon.0* @b, i32 0, i32 0, i32 0, i32 1, i32 0, i32 1), align 1
  %2 = bitcast %struct.mb* %cb to i32*
  %3 = load i32, i32* bitcast (i8* getelementptr inbounds (%struct.anon.0, %struct.anon.0* @b, i32 0, i32 0, i32 0, i32 1, i32 0, i32 0) to i32*), align 1
  store i32 %3, i32* %2, align 1
  ret void
niravd updated this revision to Diff 173420.Nov 9 2018, 12:43 PM

Fold in Peter's comments.

niravd marked 2 inline comments as done.Nov 9 2018, 12:44 PM
peter.smith accepted this revision.Nov 12 2018, 3:20 AM

Thanks for the update. Looks good to me.

This revision is now accepted and ready to land.Nov 12 2018, 3:20 AM
This revision was automatically updated to reflect the committed changes.