Page MenuHomePhabricator

david-arm (David Sherwood)
User

Projects

User does not belong to any projects.

User Details

User Since
Nov 20 2019, 6:41 AM (175 w, 1 d)

Recent Activity

Yesterday

david-arm accepted D147136: [AArch64][SME] Fix broken intrinsics for ZA STR (vector).

LGTM! Thanks for this fix @sdesmalen.

Wed, Mar 29, 6:44 AM · Restricted Project, Restricted Project
david-arm requested review of D147152: [LoopVectorize] Fix zext/sext cost calculations when types are shrunk.
Wed, Mar 29, 6:33 AM · Restricted Project, Restricted Project
david-arm requested review of D147151: [NFC][LoopVectorize] Add zext/sext cost tests when there is type shrinkage.
Wed, Mar 29, 6:31 AM · Restricted Project, Restricted Project
david-arm accepted D136251: [LoopVectorize] Use available masked vector functions when required.

LGTM! Thanks for all the good work on this so far @huntergr. :)

Wed, Mar 29, 6:07 AM · Restricted Project, Restricted Project
david-arm accepted D147040: [AArch64][CodeGen] Use interleave store for streaming compatible functions.

LGTM! Eccelente! Thanks for making the changes @CarolineConcatto.

Wed, Mar 29, 6:06 AM · Restricted Project, Restricted Project
david-arm added a comment to D147040: [AArch64][CodeGen] Use interleave store for streaming compatible functions.

Thanks for the new tests @CarolineConcatto! I just had a couple more suggestions on possibly improving the tests a bit more ...

Wed, Mar 29, 5:49 AM · Restricted Project, Restricted Project
david-arm committed rG0ef8a79b12f6: [LoopVectorize] Add non-zero check for MaxPowerOf2RuntimeVF in computeMaxVF (authored by david-arm).
[LoopVectorize] Add non-zero check for MaxPowerOf2RuntimeVF in computeMaxVF
Wed, Mar 29, 3:09 AM · Restricted Project, Restricted Project
david-arm added inline comments to D147040: [AArch64][CodeGen] Use interleave store for streaming compatible functions.
Wed, Mar 29, 1:51 AM · Restricted Project, Restricted Project
david-arm added inline comments to D146199: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail.
Wed, Mar 29, 1:18 AM · Restricted Project, Restricted Project

Mon, Mar 27

david-arm committed rG636efd2e3508: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops (authored by david-arm).
[SVE][LoopVectorize] Add option to disable tail-folding for reverse loops
Mon, Mar 27, 7:10 AM · Restricted Project, Restricted Project
david-arm closed D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.
Mon, Mar 27, 7:10 AM · Restricted Project, Restricted Project
david-arm updated the diff for D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.
  • Changed patch to remove dependence on D146127
Mon, Mar 27, 5:52 AM · Restricted Project, Restricted Project
david-arm committed rG1c4fedfa35ae: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail (authored by david-arm).
[LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail
Mon, Mar 27, 1:35 AM · Restricted Project, Restricted Project
david-arm closed D146199: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail.
Mon, Mar 27, 1:34 AM · Restricted Project, Restricted Project
david-arm added inline comments to D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.
Mon, Mar 27, 1:07 AM · Restricted Project, Restricted Project

Fri, Mar 24

david-arm added a comment to D144274: [InstCombine] use loop info when running the pass after loop vectorization.

I've put up https://reviews.llvm.org/D146813 to move the loop invariant GEP reassociation into LICM, which should allow us to drop the LoopInfo dependency from InstCombine.

Fri, Mar 24, 9:36 AM · Restricted Project, Restricted Project
david-arm updated the diff for D146127: [NFC][LoopVectorize] Simplify preferPredicateOverEpilogue interface.
  • Introduced a new TailFoldingInfo structure at the suggestion of @paulwalker-arm on D146128 as I think this makes sense.
Fri, Mar 24, 9:29 AM · Restricted Project, Restricted Project
david-arm added a reviewer for D146127: [NFC][LoopVectorize] Simplify preferPredicateOverEpilogue interface: paulwalker-arm.
Fri, Mar 24, 7:55 AM · Restricted Project, Restricted Project
david-arm added inline comments to D136251: [LoopVectorize] Use available masked vector functions when required.
Fri, Mar 24, 7:01 AM · Restricted Project, Restricted Project
david-arm added a comment to D144274: [InstCombine] use loop info when running the pass after loop vectorization.

There is no space for an InstCombineWithLoopInfo to avoid all the phase ordering issues?

Fri, Mar 24, 6:59 AM · Restricted Project, Restricted Project
david-arm added a comment to D144274: [InstCombine] use loop info when running the pass after loop vectorization.

I think I've found the problem - the patch was missing another case that needs the loop info option.

great, would it be possible to add a phase ordering test that demonstrates the issue and shows that it is fixed with the latest version?

Fri, Mar 24, 6:58 AM · Restricted Project, Restricted Project
david-arm added inline comments to D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.
Fri, Mar 24, 6:27 AM · Restricted Project, Restricted Project
david-arm updated the diff for D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.
  • Addressed review comments
Fri, Mar 24, 6:26 AM · Restricted Project, Restricted Project
david-arm updated the diff for D146199: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail.
  • Addressed review comments.
Fri, Mar 24, 5:57 AM · Restricted Project, Restricted Project
david-arm committed rGbd0c281fcdcb: [NFC][LoopVectorize] Change trip counts for some tests to guarantee a scalar… (authored by david-arm).
[NFC][LoopVectorize] Change trip counts for some tests to guarantee a scalar…
Fri, Mar 24, 2:44 AM · Restricted Project, Restricted Project
david-arm closed D146219: [NFC][LoopVectorize] Change trip counts for some tests to guarantee a scalar tail.
Fri, Mar 24, 2:44 AM · Restricted Project, Restricted Project

Thu, Mar 23

david-arm updated the summary of D144274: [InstCombine] use loop info when running the pass after loop vectorization.
Thu, Mar 23, 6:27 AM · Restricted Project, Restricted Project
david-arm added a reviewer for D144274: [InstCombine] use loop info when running the pass after loop vectorization: sdesmalen.
Thu, Mar 23, 6:24 AM · Restricted Project, Restricted Project
david-arm updated the diff for D144274: [InstCombine] use loop info when running the pass after loop vectorization.
  • Added a missing place where we require the loop info.
Thu, Mar 23, 6:23 AM · Restricted Project, Restricted Project
david-arm added inline comments to D145301: Add more efficient vector bitcast for AArch64.
Thu, Mar 23, 6:20 AM · Restricted Project, Restricted Project
david-arm commandeered D144274: [InstCombine] use loop info when running the pass after loop vectorization.

I have spoken with @spatel who said he is unlikely to have much time to progress this patch and he's happy for me to commandeer it.

Thu, Mar 23, 4:58 AM · Restricted Project, Restricted Project
david-arm reopened D144274: [InstCombine] use loop info when running the pass after loop vectorization.

I think I've found the problem - the patch was missing another case that needs the loop info option.

Thu, Mar 23, 4:57 AM · Restricted Project, Restricted Project

Wed, Mar 22

david-arm commandeered D144045: [InstCombine] avoid sinking fdiv into a loop .

I have spoken with @spatel who said he is unlikely to have much time to progress this patch and he's happy for me to commandeer it. I would like to make progress on this because it is an important fix on AArch64 for the SPEC2017 benchmark parest.

Wed, Mar 22, 2:30 AM · Restricted Project, Restricted Project

Tue, Mar 21

david-arm added a comment to D146199: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail.

I had written a very similar patch recently, but it would only use the fixed length if the scalable was unknown. The performance of it was pretty bad though, so I ended up dropping it. I had noticed that there is an xfail in llvm/test/Transforms/LoopVectorize/AArch64/eliminate-tail-predication.ll at the moment. Can it now be replaced with a check for store <vscale x 4 x i32>?

TargetTransformInfo::isVScaleKnownToBeAPowerOfTwo isn't going to be useable from all the places that need it like instcombine. It might be best to add it to somewhere like vscale_range in the long run?

Tue, Mar 21, 10:15 AM · Restricted Project, Restricted Project
david-arm updated the diff for D146199: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail.
  • Addressed review comments about using std::optional
Tue, Mar 21, 9:49 AM · Restricted Project, Restricted Project
david-arm added a comment to D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.

LoopVectorizationLegality::containsDecreasingPointers seems to loop over all the instructions and call isConsecutivePtr, which just calls getPtrStride. Could that logic just be placed in AArch64TTIImpl::preferPredicateOverEpilogue? That is how it has worked in ARMTTIImpl::preferPredicateOverEpilogue via canTailPredicateLoop. Otherwise the code in containsDecreasingPointers is ran for any architecture, but only used by AArch64.

Tue, Mar 21, 6:05 AM · Restricted Project, Restricted Project
david-arm updated the diff for D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.
  • Moved containsDecreasingPointers to AArch64TargetTransformInfo.cpp
Tue, Mar 21, 6:04 AM · Restricted Project, Restricted Project

Thu, Mar 16

david-arm updated subscribers of D146199: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail.

Hi @paulwalker-arm,

Thu, Mar 16, 10:25 AM · Restricted Project, Restricted Project
david-arm updated the diff for D146199: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail.
  • Rebased on top of NFC test patch.
Thu, Mar 16, 4:59 AM · Restricted Project, Restricted Project
david-arm requested review of D146219: [NFC][LoopVectorize] Change trip counts for some tests to guarantee a scalar tail.
Thu, Mar 16, 4:57 AM · Restricted Project, Restricted Project
david-arm added a comment to D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.

LoopVectorizationLegality::containsDecreasingPointers seems to loop over all the instructions and call isConsecutivePtr, which just calls getPtrStride. Could that logic just be placed in AArch64TTIImpl::preferPredicateOverEpilogue? That is how it has worked in ARMTTIImpl::preferPredicateOverEpilogue via canTailPredicateLoop. Otherwise the code in containsDecreasingPointers is ran for any architecture, but only used by AArch64.

Thu, Mar 16, 3:40 AM · Restricted Project, Restricted Project
david-arm requested review of D146199: [LoopVectorize] Don't tail-fold for scalable VFs when there is no scalar tail.
Thu, Mar 16, 1:50 AM · Restricted Project, Restricted Project

Wed, Mar 15

david-arm added a reviewer for D145583: [AArch64][SME] Fix an infinite loop in DAGCombine related to adding -force-streaming-compatible-sve flag.: hassnaa-arm.

Adding @hassnaa-arm as a reviewer too as she has been doing a lot of work on the streaming compatible code generation.

Wed, Mar 15, 9:11 AM · Restricted Project, Restricted Project
david-arm added inline comments to D145583: [AArch64][SME] Fix an infinite loop in DAGCombine related to adding -force-streaming-compatible-sve flag..
Wed, Mar 15, 9:08 AM · Restricted Project, Restricted Project
david-arm requested review of D146128: [SVE][LoopVectorize] Add option to disable tail-folding for reverse loops.
Wed, Mar 15, 4:45 AM · Restricted Project, Restricted Project
david-arm requested review of D146127: [NFC][LoopVectorize] Simplify preferPredicateOverEpilogue interface.
Wed, Mar 15, 4:38 AM · Restricted Project, Restricted Project

Mon, Mar 13

david-arm added a comment to D144274: [InstCombine] use loop info when running the pass after loop vectorization.

Hm, it looks like this ended up causing non-trivial codegen changes: https://llvm-compile-time-tracker.com/compare.php?from=772aa05452f8ff90a47168e6801cda2acb5a1873&to=43ae4b62b2671cf73e691c0b53324cd39405cd51&stat=size-text So we're not preserving previous behavior in some significant way(s).

Thanks - I reverted with 43ae4b62b267.
It's unlikely that I will be able to investigate those diffs in the near term, so hopefully someone else can commandeer/edit this patch and update D144045.

Hi @sanjay, thanks for landing the patch anyway!

Mon, Mar 13, 1:51 AM · Restricted Project, Restricted Project

Fri, Mar 10

david-arm accepted D145755: [SVE] Restrict cmp+and->pred_cmp isel to instances where the and is the sole user of the compare..

LGTM!

Fri, Mar 10, 5:17 AM · Restricted Project, Restricted Project

Mon, Mar 6

david-arm accepted D145356: [LoopVectorize][NFC] Fix incorrect comment in LoopVectorize.h.
Mon, Mar 6, 6:01 AM · Restricted Project, Restricted Project
david-arm accepted D144274: [InstCombine] use loop info when running the pass after loop vectorization.

From what I can see the patch LGTM! It seems like all review comments have been addressed and this patch then unblocks D144045.

Mon, Mar 6, 5:56 AM · Restricted Project, Restricted Project

Fri, Mar 3

david-arm updated the summary of D145236: [AArch64] Improve codegen for vectorised loops with two active lane masks.
Fri, Mar 3, 5:36 AM · Restricted Project, Restricted Project
david-arm updated the summary of D145236: [AArch64] Improve codegen for vectorised loops with two active lane masks.
Fri, Mar 3, 5:35 AM · Restricted Project, Restricted Project
david-arm requested review of D145236: [AArch64] Improve codegen for vectorised loops with two active lane masks.
Fri, Mar 3, 5:34 AM · Restricted Project, Restricted Project

Feb 28 2023

david-arm committed rG6333779ac082: [AArch64][SME2] Add multi-vector uunpk and sunpk intrinsics (authored by david-arm).
[AArch64][SME2] Add multi-vector uunpk and sunpk intrinsics
Feb 28 2023, 6:02 AM · Restricted Project, Restricted Project
david-arm closed D142964: [AArch64][SME2] Add multi-vector uunpk and sunpk intrinsics.
Feb 28 2023, 6:02 AM · Restricted Project, Restricted Project

Feb 27 2023

david-arm accepted D143278: [SME2][AArch64] Add multi-indexed multiply-add long long intrinsics.

LGTM!

Feb 27 2023, 3:36 AM · Restricted Project, Restricted Project

Feb 22 2023

david-arm resigned from D120234: [ARM] Make i32 ISD::ABS Legal instead of pattern matching during isel..

This patch hasn't moved for a long time. Trying to clean up my review list in Phabricator!

Feb 22 2023, 3:58 AM · Restricted Project, Restricted Project
david-arm resigned from D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

This patch hasn't moved for a long time. Trying to clean up my review list in Phabricator!

Feb 22 2023, 3:58 AM · Restricted Project, Restricted Project
david-arm added inline comments to D136251: [LoopVectorize] Use available masked vector functions when required.
Feb 22 2023, 3:44 AM · Restricted Project, Restricted Project

Feb 20 2023

david-arm accepted D144118: [SME2][AArch64] Add multi-vector rounding shift left intrinsics.

LGTM! Eccelente. :)

Feb 20 2023, 5:26 AM · Restricted Project, Restricted Project
david-arm accepted D144018: [AArch64] More consistently use buildvector for zero and all-ones constants.

LGTM! Thanks for making the changes @dmgreen. :)

Feb 20 2023, 3:48 AM · Restricted Project, Restricted Project

Feb 16 2023

david-arm accepted D143278: [SME2][AArch64] Add multi-indexed multiply-add long long intrinsics.

LGTM!

Feb 16 2023, 5:19 AM · Restricted Project, Restricted Project
david-arm accepted D143277: [SME2][AArch64] Add multi-multi multiply-add long long intrinsics.

LGTM!

Feb 16 2023, 5:16 AM · Restricted Project, Restricted Project

Feb 15 2023

david-arm added a comment to D144018: [AArch64] More consistently use buildvector for zero and all-ones constants.

Looks sensible to me! I just had one minor comment ...

Feb 15 2023, 9:10 AM · Restricted Project, Restricted Project
david-arm accepted D143276: [SME2][AArch64] Add multi-single multiply-add long long intrinsics.

LGTM! Muy bueno!

Feb 15 2023, 8:59 AM · Restricted Project, Restricted Project

Feb 14 2023

david-arm added a comment to D142894: [LoopVectorize] Use overflow-check analysis to improve tail-folding..

Nice! I had a few minor comments, except for a possible issue with the use of getSmallBestKnownTC.

Feb 14 2023, 9:29 AM · Restricted Project, Restricted Project
david-arm added a comment to D143631: [LTO] Don't let InstCombine re-sink the vastly more expensive fdiv.

Hi @spatel @craig.topper, I did think about trying to change InstCombine to avoid sinking, but given resistance to this approach in the past I wasn't sure if this was the right approach. However, certainly it's unfortunate that we run InstCombine so many times in the LTO pipeline that we end up with this problem. I agree that adding another LICM pass at the end increases compilation time too, which isn't great. I think ultimately we have to decide between 1) increased compilation time without changing InstCombine, 2) reduce compilation time and increase performance by not sinking the fdiv in InstCombine. I'm not particularly attached to either approach, but I definitely do want to fix it. :)

Feb 14 2023, 1:16 AM · Restricted Project, Restricted Project

Feb 9 2023

david-arm added inline comments to D143278: [SME2][AArch64] Add multi-indexed multiply-add long long intrinsics.
Feb 9 2023, 5:58 AM · Restricted Project, Restricted Project
david-arm added inline comments to D143277: [SME2][AArch64] Add multi-multi multiply-add long long intrinsics.
Feb 9 2023, 5:25 AM · Restricted Project, Restricted Project
david-arm added a comment to D127910: [Clang][AArch64][SME] Add vector load/store (ld1/st1) intrinsics.

Thanks a lot for making all the changes @bryanpkc - it's looking really good now! I just have a few minor comments/suggestions and then I think it looks good to go.

Feb 9 2023, 4:00 AM · Restricted Project, Restricted Project
david-arm updated the summary of D143631: [LTO] Don't let InstCombine re-sink the vastly more expensive fdiv.
Feb 9 2023, 1:21 AM · Restricted Project, Restricted Project
david-arm accepted D132458: [LoopVectorize] Synthesize mask operands for vector variants as needed.

LGTM! Thanks for making all the changes @huntergr. :)

Feb 9 2023, 1:18 AM · Restricted Project, Restricted Project
david-arm requested review of D143631: [LTO] Don't let InstCombine re-sink the vastly more expensive fdiv.
Feb 9 2023, 1:12 AM · Restricted Project, Restricted Project

Feb 8 2023

david-arm added inline comments to D143276: [SME2][AArch64] Add multi-single multiply-add long long intrinsics.
Feb 8 2023, 8:47 AM · Restricted Project, Restricted Project

Feb 6 2023

david-arm accepted D142109: [LoopVectorize] Remove runtime check and scalar tail loop when tail-folding..

LGTM!

Feb 6 2023, 6:48 AM · Restricted Project, Restricted Project

Feb 3 2023

david-arm committed rG9d19bbe6e0c9: [AArch64][SVE2p1] Add IntrNoMem to int_aarch64_sve_psel intrinsic (authored by david-arm).
[AArch64][SVE2p1] Add IntrNoMem to int_aarch64_sve_psel intrinsic
Feb 3 2023, 8:17 AM · Restricted Project, Restricted Project
david-arm closed D143262: [AArch64][SVE2p1] Add IntrNoMem to int_aarch64_sve_psel intrinsic.
Feb 3 2023, 8:17 AM · Restricted Project, Restricted Project
david-arm accepted D143058: [AArch64][SME2] Add multi-vector frint intrinsics.

LGTM!

Feb 3 2023, 6:09 AM · Restricted Project, Restricted Project
david-arm requested review of D143262: [AArch64][SVE2p1] Add IntrNoMem to int_aarch64_sve_psel intrinsic.
Feb 3 2023, 5:38 AM · Restricted Project, Restricted Project
david-arm committed rG6c1842845fdc: [AArch64][SVE2p1] Add 2-way SVE2p1 dot product intrinsics (authored by david-arm).
[AArch64][SVE2p1] Add 2-way SVE2p1 dot product intrinsics
Feb 3 2023, 4:08 AM · Restricted Project, Restricted Project
david-arm closed D143066: [AArch64][SVE2p1] Add 2-way SVE2p1 dot product intrinsics.
Feb 3 2023, 4:08 AM · Restricted Project, Restricted Project
david-arm committed rG6e1ebb916e46: Fix up tests committed in 57a6bb34729df30df18a133ee2083b3323a936dc (authored by david-arm).
Fix up tests committed in 57a6bb34729df30df18a133ee2083b3323a936dc
Feb 3 2023, 3:19 AM · Restricted Project, Restricted Project
david-arm committed rG57a6bb34729d: [AArch64][SME2] Add multi-vector zip/uzp intrinsics (authored by david-arm).
[AArch64][SME2] Add multi-vector zip/uzp intrinsics
Feb 3 2023, 2:50 AM · Restricted Project, Restricted Project
david-arm closed D142904: [AArch64][SME2] Add multi-vector zip/uzp intrinsics.
Feb 3 2023, 2:50 AM · Restricted Project, Restricted Project
david-arm committed rGd54d99748db6: [AArch64][SME2] Add LLVM IR intrinsics for multi-indexed dots (authored by david-arm).
[AArch64][SME2] Add LLVM IR intrinsics for multi-indexed dots
Feb 3 2023, 1:58 AM · Restricted Project, Restricted Project
david-arm closed D142533: [AArch64][SME2] Add LLVM IR intrinsics for multi-indexed dots.
Feb 3 2023, 1:58 AM · Restricted Project, Restricted Project
david-arm accepted D142732: [AArch64][SME2] Add multi-vector floating point min/max number intrinsics.

LGTM!

Feb 3 2023, 1:42 AM · Restricted Project, Restricted Project

Feb 1 2023

david-arm requested review of D143066: [AArch64][SVE2p1] Add 2-way SVE2p1 dot product intrinsics.
Feb 1 2023, 6:14 AM · Restricted Project, Restricted Project
david-arm updated the diff for D142904: [AArch64][SME2] Add multi-vector zip/uzp intrinsics.
  • Added unused parameters to the x4 tests and changed x2 tests to ensure we're testing the destination register starts at a multiple of 2.
Feb 1 2023, 4:22 AM · Restricted Project, Restricted Project
david-arm added a comment to D142998: [SVE][codegen] Add few more tests for MUL followed by ADD/SUB (NFC).

Thanks for putting these tests in a precommit patch @sushgokh!

Feb 1 2023, 3:25 AM · Restricted Project, Restricted Project
david-arm added a comment to D142656: [SVE][codegen] Add pattern for SVE multiply-add accumulate.

Hi @SjoerdMeijer, in @sushgokh's defence there is precedent for some of the changes in this patch - by changing from SVE_4_Op_Pat to SVE_4_Mad_Op_Pat we are able to set the AddedComplexity field to the pattern, which is not dissimilar to SVE_3_Op_Pat_SelZero or SVE_3_Op_Pat_Shift_Imm_SelZero, i.e.

Feb 1 2023, 3:10 AM · Restricted Project, Restricted Project
david-arm committed rGcdcd43c87e9b: [AArch64][SME2] Add LLVM IR intrinsics for multi-multi dots (authored by david-arm).
[AArch64][SME2] Add LLVM IR intrinsics for multi-multi dots
Feb 1 2023, 2:00 AM · Restricted Project, Restricted Project
david-arm closed D142478: [AArch64][SME2] Add LLVM IR intrinsics for multi-multi dots.
Feb 1 2023, 1:59 AM · Restricted Project, Restricted Project
david-arm accepted D142887: [LoopVectorize][TTI] NFCI: Clarify enum for the tail folding style..

LGTM! Thanks for making the changes. :)

Feb 1 2023, 1:57 AM · Restricted Project, Restricted Project

Jan 31 2023

david-arm added inline comments to D142656: [SVE][codegen] Add pattern for SVE multiply-add accumulate.
Jan 31 2023, 8:32 AM · Restricted Project, Restricted Project
david-arm committed rGd538bc7f0303: [AArch64][SME2] Add LLVM IR intrinsics for multi-single dots (authored by david-arm).
[AArch64][SME2] Add LLVM IR intrinsics for multi-single dots
Jan 31 2023, 7:40 AM · Restricted Project, Restricted Project
david-arm closed D142474: [AArch64][SME2] Add LLVM IR intrinsics for multi-single dots.
Jan 31 2023, 7:40 AM · Restricted Project, Restricted Project
david-arm added a comment to D132458: [LoopVectorize] Synthesize mask operands for vector variants as needed.

Thanks a lot for addressing all the comments @huntergr! I just have a few more minor comments then I think it's good to go. :)

Jan 31 2023, 6:47 AM · Restricted Project, Restricted Project
david-arm added a comment to D142875: [LV] Predicated epilog vectorization.

Hi @dmgreen, thanks for this patch - it adds some very useful functionality to the vectoriser and allows us to reduce the code size for epilogues too! I just had a few questions ...

Jan 31 2023, 3:35 AM · Restricted Project, Restricted Project
david-arm added a reviewer for D142887: [LoopVectorize][TTI] NFCI: Clarify enum for the tail folding style.: paulwalker-arm.

This patch looks good and it saves us having two orthogonal states! I just had a few minor comments ...

Jan 31 2023, 2:51 AM · Restricted Project, Restricted Project