Download Raw Diff

Details

Reviewers

reames
fhahn

Commits

rG4277d932ef18: [LV] Use speculatability within entire loop to avoid strided load predication

Summary

Use existing functionality for identifying total access size by strided
loads. If we can speculate the load across all vector iterations, we can
avoid predication for these strided loads (or masked gathers in
architectures which support it).

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	190 ms	x64 debian > LLVM.Transforms/LoopVectorize/X86::load-deref-pred.ll

Event Timeline

anna created this revision.Mar 8 2023, 2:26 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 8 2023, 2:26 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

anna requested review of this revision.Mar 8 2023, 2:26 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 8 2023, 2:26 PM

Herald added subscribers: llvm-commits, • pcwang-thead. · View Herald Transcript

anna mentioned this in rGac4c0ea73b11: [Tests] Precommit tests for D145616.Mar 8 2023, 2:31 PM

Harbormaster completed remote builds in B218215: Diff 503523.Mar 8 2023, 3:59 PM

updated tests

anna edited the summary of this revision. (Show Details)Mar 9 2023, 8:33 AM

anna added reviewers: reames, fhahn.

Herald added a subscriber: StephenFan. · View Herald TranscriptMar 9 2023, 8:33 AM

AFAICT, the only thing which prevented us from figuring out dereferencability for strided loads (i.e. accesses with gaps) was identifying the correct AccessSize. So, that's basically the patch.

llvm/test/Transforms/LoopVectorize/X86/load-deref-pred.ll
1023 ↗	(On Diff #503790)	One thing I noticed is that we drop the `inbounds` on GEPs when we converted the masked loads to unmasked versions (perhaps because we cannot prove if the `inbounds` is correct without the predication?). We do not do the same "dropping of inbounds" when we removed predication for the strided case. Any idea why is that? It looks like we should be dropping on the strided case, but I don't know the LV code well enough to see where this is done and what is missing.

reames added inline comments.Mar 9 2023, 9:44 AM

llvm/lib/Analysis/Loads.cpp
294–304	Ignore is confusing here. "ignore" sounds like we might have a latent correctness issue here. What I think you mean is that we're being conservative on overlapping accesses. Also, your TODO doesn't sound right to me. You'd want something along the lines of TC * Step + EltSize - Step.

anna added inline comments.Mar 9 2023, 10:02 AM

llvm/lib/Analysis/Loads.cpp
294–304	Good catch. TC * max(Step, EltSize) gets extra bytes without accounting for overlapping access.

Harbormaster completed remote builds in B218410: Diff 503790.Mar 9 2023, 10:23 AM

updated comment.

anna marked an inline comment as done.Mar 15 2023, 7:43 AM

Harbormaster completed remote builds in B219632: Diff 505486.Mar 15 2023, 9:17 AM

LGTM w/minor comment

llvm/test/Transforms/LoopVectorize/X86/load-deref-pred.ll
1158 ↗	(On Diff #505486)	Remove TODO

This revision is now accepted and ready to land.Mar 15 2023, 2:41 PM

LGTM with the TODO in the test removed, thanks!

anna marked an inline comment as done.Mar 21 2023, 7:32 AM

anna added inline comments.

llvm/test/Transforms/LoopVectorize/X86/load-deref-pred.ll
1023 ↗	(On Diff #503790)	Just to loop back on this: I did some digging into history of where this inbounds drop was introduced. It was here: https://reviews.llvm.org/D111846. Also, there is a specific comment stating we do not need to drop inbounds (and other poison generating flags) when the original instructions are gather/scatter. If backends convert the gather/scatter into use "base + offsets", those backends need fixing (just paraphrasing from the comment here: https://reviews.llvm.org/D111846#3098547). So, I'll go ahead and land this change.

removed todo

This revision was landed with ongoing or failed builds.Mar 21 2023, 9:08 AM

Closed by commit rG4277d932ef18: [LV] Use speculatability within entire loop to avoid strided load predication (authored by anna). · Explain Why

This revision was automatically updated to reflect the committed changes.

anna added a commit: rG4277d932ef18: [LV] Use speculatability within entire loop to avoid strided load predication.

Harbormaster completed remote builds in B220721: Diff 506975.Mar 21 2023, 9:46 AM

This is an archive of the discontinued LLVM Phabricator instance.

[LV] Use speculatability within entire loop to avoid strided load predication
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 503523

llvm/lib/Analysis/Loads.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[LV] Use speculatability within entire loop to avoid strided load predicationClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 503523

llvm/lib/Analysis/Loads.cpp

[LV] Use speculatability within entire loop to avoid strided load predication
ClosedPublic