Page MenuHomePhabricator

Please use GitHub pull requests for new patches. Phabricator shutdown timeline

dorit (Dorit Nuzman)
User

Projects

User does not belong to any projects.

User Details

User Since
Dec 20 2015, 4:54 AM (404 w, 4 d)

Recent Activity

Aug 8 2021

dorit committed rG67278b8a908e: [LV] Support Interleaved Store Group With Gaps (authored by dorit).
[LV] Support Interleaved Store Group With Gaps
Aug 8 2021, 1:26 AM
dorit closed D104750: [LV] Support Interleaved Store Group With Gaps.
Aug 8 2021, 1:26 AM · Restricted Project
dorit updated the diff for D104750: [LV] Support Interleaved Store Group With Gaps.

addressed clang-format issue

Aug 8 2021, 12:35 AM · Restricted Project

Aug 7 2021

dorit updated the diff for D104750: [LV] Support Interleaved Store Group With Gaps.

Addressed tidy warnings

Aug 7 2021, 11:46 PM · Restricted Project

Aug 5 2021

dorit updated the diff for D104750: [LV] Support Interleaved Store Group With Gaps.

rebased

Aug 5 2021, 5:31 AM · Restricted Project

Jun 28 2021

dorit added a comment to D104750: [LV] Support Interleaved Store Group With Gaps.

Will get back to pushing this patch in a few weeks (unfortunately didn't make it before going away on vacation).

Jun 28 2021, 3:18 AM · Restricted Project

Jun 27 2021

dorit updated the diff for D104750: [LV] Support Interleaved Store Group With Gaps.

updated a comment, as pointed out in the review

Jun 27 2021, 7:02 AM · Restricted Project

Jun 24 2021

dorit added a comment to D104750: [LV] Support Interleaved Store Group With Gaps.

Thanks, Ayal! Incorporated all your comments.

Jun 24 2021, 6:33 AM · Restricted Project
dorit updated the diff for D104750: [LV] Support Interleaved Store Group With Gaps.

Addressed review comments.

Jun 24 2021, 6:31 AM · Restricted Project

Jun 23 2021

dorit added a reviewer for D104750: [LV] Support Interleaved Store Group With Gaps: gilr.
Jun 23 2021, 3:19 AM · Restricted Project
dorit updated the diff for D104750: [LV] Support Interleaved Store Group With Gaps.

(accidentally uploaded without context)

Jun 23 2021, 12:31 AM · Restricted Project
dorit updated the diff for D104750: [LV] Support Interleaved Store Group With Gaps.

updated formatting

Jun 23 2021, 12:22 AM · Restricted Project

Jun 22 2021

dorit added a comment to D104750: [LV] Support Interleaved Store Group With Gaps.

Costmodel is wrong: if there are gaps, you need to load the vector of original values, and insert the non-gap elements into it.

Jun 22 2021, 2:48 PM · Restricted Project
dorit requested review of D104750: [LV] Support Interleaved Store Group With Gaps.
Jun 22 2021, 2:35 PM · Restricted Project

Oct 17 2019

dorit committed rL375203: Request commit access for dorit.
Request commit access for dorit
Oct 17 2019, 10:33 PM

Aug 15 2019

dorit committed rGd57d73daed30: [LV] fold-tail predication should be respected even with assume_safety (authored by dorit).
[LV] fold-tail predication should be respected even with assume_safety
Aug 15 2019, 12:13 AM
dorit committed rL368973: [LV] fold-tail predication should be respected even with assume_safety.
[LV] fold-tail predication should be respected even with assume_safety
Aug 15 2019, 12:11 AM
dorit closed D66106: [LV] fold-tail predication should be respected even with assume_safety .
Aug 15 2019, 12:11 AM · Restricted Project

Aug 13 2019

dorit committed rG491ca2425d4a: [LV] Fold-tail flag (authored by dorit).
[LV] Fold-tail flag
Aug 13 2019, 10:25 PM
dorit committed rL368801: [LV] Fold-tail flag.
[LV] Fold-tail flag
Aug 13 2019, 10:21 PM
dorit closed D66108: [LV] fold-tail flag.
Aug 13 2019, 10:21 PM · Restricted Project
dorit added a comment to D66108: [LV] fold-tail flag.

Thanks for taking a look! Please see responses below.

Aug 13 2019, 2:14 AM · Restricted Project

Aug 12 2019

dorit created D66108: [LV] fold-tail flag.
Aug 12 2019, 1:11 PM · Restricted Project
dorit created D66106: [LV] fold-tail predication should be respected even with assume_safety .
Aug 12 2019, 1:04 PM · Restricted Project

Jan 31 2019

Herald added a project to D57180: [LV] Avoid adding into interleaved group in presence of WAW dependency: Restricted Project.

I plan on having a look later this week. I am a little worried that the checks in-line here are already quite complex and I would like to have a think if that could be improved in some way.

Jan 31 2019, 11:19 PM · Restricted Project

Oct 31 2018

dorit committed rL345705: [LV] Support vectorization of interleave-groups that require an epilog under.
[LV] Support vectorization of interleave-groups that require an epilog under
Oct 31 2018, 3:01 AM
dorit closed D53668: [LV] Support vectorization of interleave-groups that require an epilog under optsize using masked wide loads .
Oct 31 2018, 3:01 AM

Oct 29 2018

dorit added a comment to D53668: [LV] Support vectorization of interleave-groups that require an epilog under optsize using masked wide loads .

Comments addressed. Thanks!

Oct 29 2018, 7:17 AM
dorit updated the diff for D53668: [LV] Support vectorization of interleave-groups that require an epilog under optsize using masked wide loads .

Addressed comments.
Also added a test with stride 3.

Oct 29 2018, 7:16 AM

Oct 24 2018

dorit accepted D53612: [LV] Avoid vectorizing loops under opt for size that involve SCEV checks.

Just minor comments on the tests.
LGTM.

Oct 24 2018, 1:56 PM
dorit created D53668: [LV] Support vectorization of interleave-groups that require an epilog under optsize using masked wide loads .
Oct 24 2018, 1:29 PM
dorit committed rL345115: [LV] Don't have fold-tail under optsize invalidate interleave-groups when.
[LV] Don't have fold-tail under optsize invalidate interleave-groups when
Oct 24 2018, 12:13 AM
dorit closed D53559: [LV] Don't have fold-tail under optsize invalidate interleave-groups when masked-interleaving is enabled.
Oct 24 2018, 12:13 AM

Oct 23 2018

dorit committed rL345021: Leftover bits from https://reviews.llvm.org/D53420 that were accidentally left.
Leftover bits from https://reviews.llvm.org/D53420 that were accidentally left
Oct 23 2018, 4:54 AM
dorit created D53559: [LV] Don't have fold-tail under optsize invalidate interleave-groups when masked-interleaving is enabled.
Oct 23 2018, 1:43 AM

Oct 21 2018

dorit added inline comments to D53420: [IAI,LV] Avoid creating a scalar epilogue due to gaps in interleave-groups when optimizing for size.
Oct 21 2018, 11:25 PM
dorit committed rL344883: [IAI,LV] Avoid creating a scalar epilogue due to gaps in interleave-groups when .
[IAI,LV] Avoid creating a scalar epilogue due to gaps in interleave-groups when
Oct 21 2018, 11:19 PM
dorit closed D53420: [IAI,LV] Avoid creating a scalar epilogue due to gaps in interleave-groups when optimizing for size.
Oct 21 2018, 11:19 PM
dorit added a comment to D53420: [IAI,LV] Avoid creating a scalar epilogue due to gaps in interleave-groups when optimizing for size.

Addressed comments. See couple responses below. Thanks!

Oct 21 2018, 8:24 PM

Oct 20 2018

dorit updated the diff for D53420: [IAI,LV] Avoid creating a scalar epilogue due to gaps in interleave-groups when optimizing for size.

updated to top of trunk.

Oct 20 2018, 11:34 AM

Oct 19 2018

dorit created D53420: [IAI,LV] Avoid creating a scalar epilogue due to gaps in interleave-groups when optimizing for size.
Oct 19 2018, 1:17 AM

Oct 14 2018

dorit committed rL344475: recommit 344472 after fixing build failure on ARM and PPC..
recommit 344472 after fixing build failure on ARM and PPC.
Oct 14 2018, 1:52 AM
dorit committed rL344473: revert 344472 due to failures..
revert 344472 due to failures.
Oct 14 2018, 12:23 AM
dorit added a reverting change for rL344472: [IAI,LV] Add support for vectorizing predicated strided accesses using masked: rL344473: revert 344472 due to failures..
Oct 14 2018, 12:23 AM
dorit committed rL344472: [IAI,LV] Add support for vectorizing predicated strided accesses using masked.
[IAI,LV] Add support for vectorizing predicated strided accesses using masked
Oct 14 2018, 12:08 AM
dorit closed D53011: [LV] Add support for vectorizing predicated strided accesses using masked interleave-group.
Oct 14 2018, 12:08 AM

Oct 10 2018

dorit added a comment to D53011: [LV] Add support for vectorizing predicated strided accesses using masked interleave-group.

Comment addressed, thanks.

Oct 10 2018, 11:40 PM
dorit updated the diff for D53011: [LV] Add support for vectorizing predicated strided accesses using masked interleave-group.
Oct 10 2018, 11:34 PM

Oct 9 2018

dorit added inline comments to D53011: [LV] Add support for vectorizing predicated strided accesses using masked interleave-group.
Oct 9 2018, 5:35 AM
dorit updated the diff for D53011: [LV] Add support for vectorizing predicated strided accesses using masked interleave-group.

Addressed Ayal's comments. Thanks!

Oct 9 2018, 5:31 AM
dorit created D53011: [LV] Add support for vectorizing predicated strided accesses using masked interleave-group.
Oct 9 2018, 12:41 AM

Oct 6 2018

dorit committed rL343931: [IAI,LV] Avoid creating interleave-groups for predicated accesse.
[IAI,LV] Avoid creating interleave-groups for predicated accesse
Oct 6 2018, 11:59 PM
dorit closed D52682: [IAI,LV] Avoid creating interleave-groups for predicated accesses.
Oct 6 2018, 11:59 PM
dorit added a comment to D52682: [IAI,LV] Avoid creating interleave-groups for predicated accesses.

Thanks!

Oct 6 2018, 11:56 PM

Sep 28 2018

dorit created D52682: [IAI,LV] Avoid creating interleave-groups for predicated accesses.
Sep 28 2018, 10:31 PM

Mar 7 2018

dorit added a reviewer for D43812: [LV] Let recordVectorLoopValueForInductionCast to check if IV was created from the cast.: dcaballe.

Hopefully I can delegate the review to Diego...
Thanks for the fix, Andrei

Mar 7 2018, 12:31 PM

Jan 14 2018

dorit accepted D41913: [LV] Don't call recordVectorLoopValueForInductionCast for newly-created IV from a trunc..

LGTM. Thanks for the fix.

Jan 14 2018, 2:46 AM

Dec 14 2017

dorit closed D38948: [LV] Support efficient vectorization of an induction with redundant casts.
Dec 14 2017, 10:54 PM
dorit added a comment to D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Thanks so much for all your help with this work!

Dec 14 2017, 6:21 AM

Dec 13 2017

dorit committed rL320672: [LV] Support efficient vectorization of an induction with redundant casts.
[LV] Support efficient vectorization of an induction with redundant casts
Dec 13 2017, 11:57 PM

Dec 12 2017

dorit added inline comments to D38948: [LV] Support efficient vectorization of an induction with redundant casts.
Dec 12 2017, 12:46 PM
dorit updated the diff for D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Addressed Ayal's and Silviu's comments.

Dec 12 2017, 12:40 PM
dorit committed rL320463: [LV] Ignore the cost of values that will not appear in the vectorized loop.
[LV] Ignore the cost of values that will not appear in the vectorized loop
Dec 12 2017, 12:58 AM
dorit closed D40883: [LV] Ignore the cost of values that will not appear in the vectorized loop by committing rL320463: [LV] Ignore the cost of values that will not appear in the vectorized loop.
Dec 12 2017, 12:58 AM

Dec 10 2017

dorit committed rL320298: [SCEV] Fix wrong Equal predicate created in getAddRecForPhiWithCasts.
[SCEV] Fix wrong Equal predicate created in getAddRecForPhiWithCasts
Dec 10 2017, 3:14 AM
dorit closed D40641: [SCEV] Fix wrong Equal predicate created in getAddRecForPhiWithCasts by committing rL320298: [SCEV] Fix wrong Equal predicate created in getAddRecForPhiWithCasts.
Dec 10 2017, 3:14 AM

Dec 7 2017

dorit updated the diff for D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Dropped the parts that are uploaded for review separately (D40641, D40883), and hopefully addressed Silviu's last comments.

Dec 7 2017, 2:00 PM
dorit added a comment to D40641: [SCEV] Fix wrong Equal predicate created in getAddRecForPhiWithCasts.

ping

Dec 7 2017, 1:29 PM

Dec 6 2017

dorit updated the diff for D40883: [LV] Ignore the cost of values that will not appear in the vectorized loop.

Thanks Florian. Uploaded the formatting fix.

Dec 6 2017, 2:37 AM
dorit added a comment to D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Hi Ayal,

Dec 6 2017, 12:50 AM
dorit created D40883: [LV] Ignore the cost of values that will not appear in the vectorized loop.
Dec 6 2017, 12:43 AM

Nov 30 2017

dorit added inline comments to D38948: [LV] Support efficient vectorization of an induction with redundant casts.
Nov 30 2017, 2:00 AM
dorit created D40641: [SCEV] Fix wrong Equal predicate created in getAddRecForPhiWithCasts.
Nov 30 2017, 1:43 AM

Nov 28 2017

dorit updated the diff for D38948: [LV] Support efficient vectorization of an induction with redundant casts.

(uploaded a fix to LoopUtils:getCastsForInductionPHI())

Nov 28 2017, 5:54 AM
dorit updated the diff for D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Hi Silviu,

Nov 28 2017, 1:55 AM

Nov 22 2017

dorit added a comment to D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Hi Silviu,

Nov 22 2017, 2:03 PM

Nov 20 2017

dorit added a comment to D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Hi Silviu,
I started to try out the approach you suggested, and I realized that our assumption doesn't hold... (see response to inlined comment).
Thanks,
Dorit

Nov 20 2017, 12:59 PM
dorit added inline comments to D38948: [LV] Support efficient vectorization of an induction with redundant casts.
Nov 20 2017, 1:29 AM

Nov 19 2017

dorit updated the diff for D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Addressed Ayal's comments.
Have yet to address Silviu's comments.

Nov 19 2017, 2:49 PM
dorit added a comment to D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Yes, IndVarSimpify wouldn't fix this issue, but I was thinking more of using the techniques there that use the SCEV expressions to find these cases instead of doing the pattern matching (see the inline comment).

Nov 19 2017, 12:40 PM
dorit added a comment to D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Hi Silviu,

Nov 19 2017, 5:24 AM

Nov 16 2017

dorit updated the diff for D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Thanks Ayal. Incorporated your suggestions.

Nov 16 2017, 9:58 AM
dorit added a comment to D38948: [LV] Support efficient vectorization of an induction with redundant casts.

ping^2

Nov 16 2017, 1:58 AM

Nov 15 2017

dorit accepted D40008: [X86][TTI] update costs of interleaved load\store of i64\double.

You missed just one mcpu=skylake :)
LGTM with this change

Nov 15 2017, 11:57 PM · Restricted Project
dorit added a comment to D40008: [X86][TTI] update costs of interleaved load\store of i64\double.

I think it would be nice to make the testcases smaller; Right now you have something like this:
for (…) {
Dst[2*i] = Dst[2*i] + Src[2*i] * k
Dst[2*i+1] = Dst[2*i+1] + Src[2*i+1] * k
}
...which actually tests both strided loads and strided stores.
So you could either use one test to check both store and load costs (and even then you probably don't need both a mul and an add just to check memops costs).
Or if you want to separate the load and store cases, the Load test could be something like:
for (…) {
s += Src[2*i]
s += Src[2*i+1]
}
The Store test could be something like:
For(…){

Dst[2*i] = k1;
Dst[2*i+1] = k2;

}

Nov 15 2017, 10:28 AM · Restricted Project

Nov 7 2017

dorit added a comment to D38948: [LV] Support efficient vectorization of an induction with redundant casts.

ping :)

Nov 7 2017, 1:04 PM

Nov 5 2017

dorit updated the diff for D38948: [LV] Support efficient vectorization of an induction with redundant casts.

Incorporated Ayal's comments. Thanks!

Nov 5 2017, 10:23 AM
dorit committed rL317438.
Nov 5 2017, 8:53 AM
dorit closed D38785: [LV/LAA] Avoid specializing a loop for stride=1 when this predicate implies a single-iteration loop by committing rL317438.
Nov 5 2017, 8:53 AM

Nov 2 2017

dorit added a comment to D38785: [LV/LAA] Avoid specializing a loop for stride=1 when this predicate implies a single-iteration loop.

Hi Silviu,

Nov 2 2017, 4:50 AM

Oct 18 2017

dorit retitled D38785: [LV/LAA] Avoid specializing a loop for stride=1 when this predicate implies a single-iteration loop from [LV/LAA] Avoid secializing a loop for stride=1 when this predicate implies a single-iteration loop to [LV/LAA] Avoid specializing a loop for stride=1 when this predicate implies a single-iteration loop.
Oct 18 2017, 1:37 AM

Oct 17 2017

dorit accepted D38762: Cost calculation for interleave load/store patterns {v8i8,v16i8,v32i8,v64i8}.

LGTM with the last couple of comments.

Oct 17 2017, 3:47 AM
dorit added a comment to D38762: Cost calculation for interleave load/store patterns {v8i8,v16i8,v32i8,v64i8}.

AVX512 side of things now also looks good to me (with the tiny comments below).

Oct 17 2017, 3:32 AM

Oct 16 2017

dorit added a comment to D38762: Cost calculation for interleave load/store patterns {v8i8,v16i8,v32i8,v64i8}.

The AVX2 changes look ok to me now.
A couple comments about the AVX512 changes below.

Oct 16 2017, 5:57 AM
dorit created D38948: [LV] Support efficient vectorization of an induction with redundant casts.
Oct 16 2017, 5:29 AM

Oct 11 2017

dorit added a comment to D38785: [LV/LAA] Avoid specializing a loop for stride=1 when this predicate implies a single-iteration loop.

Hi Silviu,

Oct 11 2017, 5:10 AM

Oct 10 2017

dorit added reviewers for D38785: [LV/LAA] Avoid specializing a loop for stride=1 when this predicate implies a single-iteration loop: Ayal, hfinkel, silviu.baranga.
Oct 10 2017, 11:27 PM
dorit created D38785: [LV/LAA] Avoid specializing a loop for stride=1 when this predicate implies a single-iteration loop.
Oct 10 2017, 11:25 PM
dorit added a comment to D38762: Cost calculation for interleave load/store patterns {v8i8,v16i8,v32i8,v64i8}.

My only main concerns are with respect to interleave-group with gaps (see below), and the fact that we don't distinguish the AVX2 from the AVX512 case (also see below). Just minor comments beyond that.

Oct 10 2017, 11:01 PM

Sep 9 2017

dorit added a comment to D37507: Fix maximum legal VF calculation.

Looks fine to me (with a couple very minor dbg reports improvements).
@Ayal/@hfinkel - what do you say?

Sep 9 2017, 11:43 PM