Page MenuHomePhabricator

cameron.mcinally (Cameron McInally)
User

Projects

User does not belong to any projects.

User Details

User Since
Jan 6 2015, 6:21 AM (429 w, 2 d)

Recent Activity

Feb 22 2023

cameron.mcinally committed rGaf4c4f4e2143: [DAGCombine] Fix an ICE in combineMinNumMaxNum(...) (authored by cameron.mcinally).
[DAGCombine] Fix an ICE in combineMinNumMaxNum(...)
Feb 22 2023, 11:01 AM · Restricted Project, Restricted Project
cameron.mcinally closed D144571: [DAGCombine] Fix an ICE in combineMinNumMaxNum(...).
Feb 22 2023, 11:01 AM · Restricted Project, Restricted Project
cameron.mcinally added inline comments to D144571: [DAGCombine] Fix an ICE in combineMinNumMaxNum(...).
Feb 22 2023, 10:35 AM · Restricted Project, Restricted Project
cameron.mcinally added inline comments to D144571: [DAGCombine] Fix an ICE in combineMinNumMaxNum(...).
Feb 22 2023, 10:25 AM · Restricted Project, Restricted Project
cameron.mcinally updated the diff for D144571: [DAGCombine] Fix an ICE in combineMinNumMaxNum(...).

Updated patch for Matt's reviews...

Feb 22 2023, 10:24 AM · Restricted Project, Restricted Project
cameron.mcinally updated the diff for D144571: [DAGCombine] Fix an ICE in combineMinNumMaxNum(...).

Update the diff to include context.

Feb 22 2023, 9:39 AM · Restricted Project, Restricted Project
cameron.mcinally added a comment to rG65420c8041f4: DAG: Use getNegatedExpression in combineMinNumMaxNum.

Created D144571 with a patch to fix the ICE.

Feb 22 2023, 9:37 AM · Restricted Project, Restricted Project
cameron.mcinally requested review of D144571: [DAGCombine] Fix an ICE in combineMinNumMaxNum(...).
Feb 22 2023, 9:35 AM · Restricted Project, Restricted Project

Feb 21 2023

cameron.mcinally added a comment to rG65420c8041f4: DAG: Use getNegatedExpression in combineMinNumMaxNum.

Also note that a fix will be needed for the release/16.x branch.

Feb 21 2023, 1:43 PM · Restricted Project, Restricted Project
cameron.mcinally added a comment to rG65420c8041f4: DAG: Use getNegatedExpression in combineMinNumMaxNum.

Hi Matt,

Feb 21 2023, 1:42 PM · Restricted Project, Restricted Project

Jan 9 2023

cameron.mcinally accepted D140318: [SVE] Restrict SVE fixed length extload/truncstore combine to float and double types..

LGTM

Jan 9 2023, 7:10 AM · Restricted Project, Restricted Project

Aug 24 2022

cameron.mcinally committed rG38d58c1b376d: [GlobalOpt] Bail out of GlobalOpt SROA if a Scalable Vector is seen (authored by cameron.mcinally).
[GlobalOpt] Bail out of GlobalOpt SROA if a Scalable Vector is seen
Aug 24 2022, 1:22 PM · Restricted Project, Restricted Project
cameron.mcinally closed D132417: [GlobalOpt] Bail out on SROA of a Global if a scalable vector type is seen.
Aug 24 2022, 1:21 PM · Restricted Project, Restricted Project
cameron.mcinally updated the diff for D132417: [GlobalOpt] Bail out on SROA of a Global if a scalable vector type is seen.

Apologies, @efriedma. I've updated the patch again to move the check into collectSRATypes(). Would you mind doing one more review?

Aug 24 2022, 8:50 AM · Restricted Project, Restricted Project

Aug 23 2022

cameron.mcinally updated the diff for D132417: [GlobalOpt] Bail out on SROA of a Global if a scalable vector type is seen.

Not sure why you're sticking the test in llvm/test/CodeGen/AArch64 instead of llvm/test/Transforms/GlobalOpt/.

Aug 23 2022, 2:55 PM · Restricted Project, Restricted Project

Aug 22 2022

cameron.mcinally requested review of D132417: [GlobalOpt] Bail out on SROA of a Global if a scalable vector type is seen.
Aug 22 2022, 3:13 PM · Restricted Project, Restricted Project

Apr 7 2022

cameron.mcinally added inline comments to D115924: [ConstantFolding] Unify handling of load from uniform value.
Apr 7 2022, 7:49 AM · Restricted Project, Restricted Project, Restricted Project

Apr 6 2022

Herald added a project to D115924: [ConstantFolding] Unify handling of load from uniform value: Restricted Project.
Apr 6 2022, 10:50 AM · Restricted Project, Restricted Project, Restricted Project

Apr 5 2022

cameron.mcinally accepted D120328: [DAGCombine] insert_subvector undef, (splat X), N2 -> splat X.

Ah, great. Thanks for working on this.

Apr 5 2022, 9:05 AM · Restricted Project, Restricted Project

Mar 15 2022

cameron.mcinally requested changes to D120328: [DAGCombine] insert_subvector undef, (splat X), N2 -> splat X.

Just spoke with Paul about this issue. We decided that defining the undef elements may be too aggressive. Updated patch to come...

Mar 15 2022, 9:09 AM · Restricted Project, Restricted Project

Mar 1 2022

cameron.mcinally committed rG70629d570bb6: [SVE] Update patterns to commute FMLS multiplication operands (authored by cameron.mcinally).
[SVE] Update patterns to commute FMLS multiplication operands
Mar 1 2022, 12:53 PM · Restricted Project
cameron.mcinally closed D120570: [SVE] Add pattern to commute FMLS operands.
Mar 1 2022, 12:53 PM · Restricted Project, Restricted Project
cameron.mcinally retitled D120570: [SVE] Add pattern to commute FMLS operands from [SVE] Add pattern to commute FMSB operands to [SVE] Add pattern to commute FMLS operands.
Mar 1 2022, 8:46 AM · Restricted Project, Restricted Project
cameron.mcinally updated the diff for D120570: [SVE] Add pattern to commute FMLS operands.

Addressed Paul's review...

Mar 1 2022, 8:21 AM · Restricted Project, Restricted Project
cameron.mcinally added inline comments to D120570: [SVE] Add pattern to commute FMLS operands.
Mar 1 2022, 7:18 AM · Restricted Project, Restricted Project

Feb 28 2022

cameron.mcinally updated the diff for D120570: [SVE] Add pattern to commute FMLS operands.

Updated Diff to implement commuting with PatFrags.

Feb 28 2022, 12:55 PM · Restricted Project, Restricted Project

Feb 25 2022

cameron.mcinally accepted D120328: [DAGCombine] insert_subvector undef, (splat X), N2 -> splat X.

LGTM. Thanks, Paul.

Feb 25 2022, 2:14 PM · Restricted Project, Restricted Project
cameron.mcinally requested review of D120570: [SVE] Add pattern to commute FMLS operands.
Feb 25 2022, 8:35 AM · Restricted Project, Restricted Project

Feb 23 2022

cameron.mcinally abandoned D120152: [AArch64][SVE] Match VLS all-1's masks to PTRUE.

Good point. Replacing the lowered truncates with ptrue sounds like a win in the general case. Abandoning this Diff.

Feb 23 2022, 6:45 AM · Restricted Project

Feb 22 2022

cameron.mcinally added inline comments to D120152: [AArch64][SVE] Match VLS all-1's masks to PTRUE.
Feb 22 2022, 12:04 PM · Restricted Project
cameron.mcinally updated the diff for D120152: [AArch64][SVE] Match VLS all-1's masks to PTRUE.

Updated patch based on @david-arm's review.

Feb 22 2022, 11:58 AM · Restricted Project

Feb 18 2022

cameron.mcinally updated the diff for D120152: [AArch64][SVE] Match VLS all-1's masks to PTRUE.

Fix formatting for the Lint bots.

Feb 18 2022, 1:53 PM · Restricted Project
cameron.mcinally added a comment to D120152: [AArch64][SVE] Match VLS all-1's masks to PTRUE.

Why didn't or cannot InstCombine catch this?

Feb 18 2022, 1:28 PM · Restricted Project
cameron.mcinally updated the diff for D120152: [AArch64][SVE] Match VLS all-1's masks to PTRUE.

Updated Diff.

Feb 18 2022, 12:04 PM · Restricted Project
cameron.mcinally requested review of D120152: [AArch64][SVE] Match VLS all-1's masks to PTRUE.
Feb 18 2022, 11:42 AM · Restricted Project

Feb 9 2022

cameron.mcinally abandoned D119285: [SVE] Bail out of constructDup(...) optimization for fixed width vectors > 128 bits.

Ah, sorry for the noise. Abandoning this Diff...

Feb 9 2022, 8:44 AM · Restricted Project
cameron.mcinally added a comment to D119252: [AArch64][SVE] Fix selection failure during lowering of shuffle_vector.

Hi @paulwalker-arm, this bug was actually found in user-written code in Gromacs, although only there was only one instance of this I think. So it is something users may see, just not very often!

Feb 9 2022, 8:42 AM · Restricted Project

Feb 8 2022

cameron.mcinally requested review of D119285: [SVE] Bail out of constructDup(...) optimization for fixed width vectors > 128 bits.
Feb 8 2022, 2:10 PM · Restricted Project

Jan 25 2022

cameron.mcinally abandoned D118047: [SVE] Fix VLS selection error from performPostLD1Combine(...).

I believe this is a duplicate of D117674, which had not been reviewed yet so I've just pushed it along.

Jan 25 2022, 7:07 AM · Restricted Project

Jan 24 2022

cameron.mcinally updated the diff for D118047: [SVE] Fix VLS selection error from performPostLD1Combine(...).

Further reduced test case, but still not great.

Jan 24 2022, 2:45 PM · Restricted Project
cameron.mcinally requested review of D118047: [SVE] Fix VLS selection error from performPostLD1Combine(...).
Jan 24 2022, 8:06 AM · Restricted Project
cameron.mcinally added a comment to D117795: [AArch64] Add some missing strict FP vector lowering.

Is it possible to break the 4 subtasks into separate reviews?

Jan 24 2022, 7:31 AM · Restricted Project, Restricted Project

Nov 1 2021

cameron.mcinally added a comment to D112557: [SVE] Fix VLS FMA generation at CodeGenOpt::Aggressive.

Perhaps worth adding matching half/fp16 tests to sve-fixed-length-fp-fma.ll but otherwise looks good.

Nov 1 2021, 11:11 AM · Restricted Project
cameron.mcinally committed rG702fd3d323aa: [SVE] Fix VLS FMA matching for CodeGenOpt::Aggressive. (authored by cameron.mcinally).
[SVE] Fix VLS FMA matching for CodeGenOpt::Aggressive.
Nov 1 2021, 10:44 AM
cameron.mcinally closed D112557: [SVE] Fix VLS FMA generation at CodeGenOpt::Aggressive.
Nov 1 2021, 10:44 AM · Restricted Project
cameron.mcinally updated the summary of D112557: [SVE] Fix VLS FMA generation at CodeGenOpt::Aggressive.
Nov 1 2021, 8:47 AM · Restricted Project

Oct 30 2021

cameron.mcinally updated the diff for D112557: [SVE] Fix VLS FMA generation at CodeGenOpt::Aggressive.

Updated Diff for @paulwalker-arm's reviews...

Oct 30 2021, 1:10 PM · Restricted Project

Oct 26 2021

cameron.mcinally updated the diff for D112557: [SVE] Fix VLS FMA generation at CodeGenOpt::Aggressive.

Fix clang-format warning and add the missing '+' to "+sve".

Oct 26 2021, 11:36 AM · Restricted Project
cameron.mcinally added inline comments to D112557: [SVE] Fix VLS FMA generation at CodeGenOpt::Aggressive.
Oct 26 2021, 10:14 AM · Restricted Project
cameron.mcinally requested review of D112557: [SVE] Fix VLS FMA generation at CodeGenOpt::Aggressive.
Oct 26 2021, 9:41 AM · Restricted Project

Jan 29 2021

cameron.mcinally updated the diff for D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.
Jan 29 2021, 1:49 PM · Restricted Project, Restricted Project
cameron.mcinally updated the diff for D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

[NOT READY FOR REVIEW]

Jan 29 2021, 1:48 PM · Restricted Project, Restricted Project

Jan 25 2021

cameron.mcinally added a comment to D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

In D94444#2497697, @paulwalker-arm wrote:
<A x Elt> llvm.experimental.vector.extract.elements(<B x Elt> %invec, i32 index, i32 stride)

Jan 25 2021, 2:46 PM · Restricted Project, Restricted Project
cameron.mcinally added a comment to D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

Ok, I see where you are coming from now. LoopVectorize is keeping the shuffle result full by widening the the load+shuffle to double wide. LV's double wide choice seems like a weird one, but I suppose if that sequence is codegen'd correctly, then it will work out.

Jan 25 2021, 8:15 AM · Restricted Project, Restricted Project

Jan 22 2021

cameron.mcinally added a comment to D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

In D94444#2497697, @paulwalker-arm wrote:
<A x Elt> llvm.experimental.vector.extract.elements(<B x Elt> %invec, i32 index, i32 stride)

Jan 22 2021, 8:18 AM · Restricted Project, Restricted Project

Jan 19 2021

cameron.mcinally updated subscribers of D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

Having said that, I wonder if we should revisit the idea of allowing shuffle vectors to accept step vector masks?

Jan 19 2021, 1:21 PM · Restricted Project, Restricted Project

Jan 15 2021

cameron.mcinally added a comment to D94708: [IR] Introduce llvm.experimental.vector.splice intrinsic.

In D94444, @paulwalker-arm proposed a more generic extract vector intrinsic that accepts an index and stride. Now I'm wondering if we should just have a generic scalable shuffle vector intrinsic to handle all these operations under one intrinsic.

Jan 15 2021, 8:27 AM · Restricted Project

Jan 14 2021

cameron.mcinally added inline comments to D94708: [IR] Introduce llvm.experimental.vector.splice intrinsic.
Jan 14 2021, 12:37 PM · Restricted Project
cameron.mcinally added a comment to D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

A bit of a flyby review as I'm still on holidays but to my mind many of the restrictions being proposed for the new intrinsic seem purely down to the design decision of splitting the input vector across two operands. I understand this is how the underlying instructions work for SVE but that does not seem like a good enough reason to compromise the IR.

So my first questions are whether the IR and ISD interfaces need to match and from an IR point of view what is the expected usage?

Jan 14 2021, 8:10 AM · Restricted Project, Restricted Project

Jan 13 2021

cameron.mcinally added inline comments to D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.
Jan 13 2021, 10:25 AM · Restricted Project, Restricted Project
cameron.mcinally updated the diff for D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

Add known minimum number of elements restrictions...

Jan 13 2021, 10:24 AM · Restricted Project, Restricted Project

Jan 12 2021

cameron.mcinally updated the diff for D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

Updated to @david-arm's suggested naming scheme...

Jan 12 2021, 1:49 PM · Restricted Project, Restricted Project
cameron.mcinally accepted D94504: [SVE] Add ISel pattern for addvl.

I'm assuming scheduling the new addvls closer to their uses is a register pressure win?

Jan 12 2021, 1:14 PM · Restricted Project
cameron.mcinally updated the diff for D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

Address some of @sdesmalen's comments, but deferring name changes...

Jan 12 2021, 9:02 AM · Restricted Project, Restricted Project
cameron.mcinally added a comment to D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.

Thanks for creating this patch!

I chose to extract the even elements from a pair of vectors (full vector result), rather than a single vector (1/2 width vector result). This is in line with existing fixed shuffle vectors. And can be extended to accept an undef argument if needed. The motivation behind this decision was that we'd want the result vector to be a full vector for performance reasons. It would also map well to SVE's LD2 and UZP1.

Are you also planning to add intrinsics for interleaving?

Jan 12 2021, 7:30 AM · Restricted Project, Restricted Project

Jan 11 2021

cameron.mcinally requested review of D94444: [RFC][Scalable] Add scalable shuffle intrinsic to extract evens from a pair of vectors.
Jan 11 2021, 12:51 PM · Restricted Project, Restricted Project

Jan 7 2021

cameron.mcinally added a comment to D94193: [SVE] Unpacked scalable floating point ZIP/UZP/TRN.

Please can you add entries for nxv2f16 as well? That way all the legal fp types are covered.

Jan 7 2021, 7:58 AM · Restricted Project
cameron.mcinally committed rGf4013359b3da: [SVE] Add unpacked scalable floating point ZIP/UZP/TRN patterns (authored by cameron.mcinally).
[SVE] Add unpacked scalable floating point ZIP/UZP/TRN patterns
Jan 7 2021, 7:57 AM
cameron.mcinally closed D94193: [SVE] Unpacked scalable floating point ZIP/UZP/TRN.
Jan 7 2021, 7:57 AM · Restricted Project

Jan 6 2021

cameron.mcinally requested review of D94193: [SVE] Unpacked scalable floating point ZIP/UZP/TRN.
Jan 6 2021, 1:30 PM · Restricted Project

Jan 4 2021

cameron.mcinally committed rG92be640bd7d4: [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are… (authored by cameron.mcinally).
[FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are…
Jan 4 2021, 12:44 PM
cameron.mcinally closed D93243: [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are flushed.
Jan 4 2021, 12:44 PM · Restricted Project
cameron.mcinally accepted D93607: [SVE] Lower vector CTLZ, CTPOP and CTTZ operations..

LGTM

Jan 4 2021, 7:57 AM · Restricted Project
cameron.mcinally added a comment to D93243: [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are flushed.

Ping.

Jan 4 2021, 7:49 AM · Restricted Project

Dec 26 2020

cameron.mcinally added inline comments to D93607: [SVE] Lower vector CTLZ, CTPOP and CTTZ operations..
Dec 26 2020, 9:46 AM · Restricted Project

Dec 17 2020

cameron.mcinally updated the diff for D93243: [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are flushed.

Add FIXME comment.

Dec 17 2020, 8:51 AM · Restricted Project

Dec 15 2020

cameron.mcinally added inline comments to D93243: [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are flushed.
Dec 15 2020, 7:41 AM · Restricted Project

Dec 14 2020

cameron.mcinally retitled D93243: [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are flushed from [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are preserved to [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are flushed.
Dec 14 2020, 2:02 PM · Restricted Project
cameron.mcinally requested review of D93243: [FPEnv][AMDGPU] Disable FSUB(-0,X)->FNEG(X) DAGCombine when subnormals are flushed.
Dec 14 2020, 2:01 PM · Restricted Project

Dec 11 2020

cameron.mcinally accepted D93050: [SVE][CodeGen] Lower scalable floating-point vector reductions.

LGTM

Dec 11 2020, 7:17 AM · Restricted Project

Dec 10 2020

cameron.mcinally added a comment to D93050: [SVE][CodeGen] Lower scalable floating-point vector reductions.

LGTM with one nit below...

Dec 10 2020, 1:29 PM · Restricted Project

Dec 4 2020

cameron.mcinally accepted D91362: [SelectionDAG] Add llvm.vector.{extract,insert} intrinsics.

I think @ctetreau's "first class citizen" argument on the RFC has merit though. But this patch is a good first step if we're not ready to extend ShuffleVector yet. I personally would like to see ShuffleVector extended eventually, since it would be easier to optimize.

Dec 4 2020, 9:54 AM · Restricted Project

Dec 1 2020

cameron.mcinally added a comment to D91362: [SelectionDAG] Add llvm.vector.{extract,insert} intrinsics.

Do we need to protect against mismatched element types? Or does legalization handle those exts/truncs?

Dec 1 2020, 8:24 AM · Restricted Project

Nov 12 2020

cameron.mcinally added inline comments to D91362: [SelectionDAG] Add llvm.vector.{extract,insert} intrinsics.
Nov 12 2020, 10:17 AM · Restricted Project

Nov 10 2020

cameron.mcinally added inline comments to D91077: [LoopVectorizer][SVE] Vectorize a simple loop with with a scalable VF..
Nov 10 2020, 7:35 AM · Restricted Project

Nov 4 2020

cameron.mcinally committed rGc126eb7529be: [SelectionDAG] Add legalizations for VECREDUCE_SEQ_FMUL (authored by cameron.mcinally).
[SelectionDAG] Add legalizations for VECREDUCE_SEQ_FMUL
Nov 4 2020, 12:21 PM
cameron.mcinally closed D90644: [Legalizer][ARM][AArch64] Add legalizations for VECREDUCE_SEQ_FMUL.
Nov 4 2020, 12:20 PM · Restricted Project

Nov 3 2020

cameron.mcinally added a comment to D90644: [Legalizer][ARM][AArch64] Add legalizations for VECREDUCE_SEQ_FMUL.
  • In llvm/test/CodeGen/ARM/vecreduce-fmul-legalization-strict.ll and llvm/test/CodeGen/AArch64/vecreduce-fmul-legalization-strict.ll, use 1.0 instead of 0.0 as the start value. That was probably a copy&paste mistake from fadds.

That caught my eye too, but the 0.0 seemed okay since we can't peep this without NSZ (-0*0) and NNAN (0*NaN). Changing it to 1.0 isn't a big deal though...

Nov 3 2020, 2:11 PM · Restricted Project
cameron.mcinally added a comment to D90644: [Legalizer][ARM][AArch64] Add legalizations for VECREDUCE_SEQ_FMUL.
  • In llvm/test/CodeGen/ARM/vecreduce-fmul-legalization-strict.ll and llvm/test/CodeGen/AArch64/vecreduce-fmul-legalization-strict.ll, use 1.0 instead of 0.0 as the start value. That was probably a copy&paste mistake from fadds.
Nov 3 2020, 1:45 PM · Restricted Project
cameron.mcinally updated the diff for D90644: [Legalizer][ARM][AArch64] Add legalizations for VECREDUCE_SEQ_FMUL.

Reformat to appease pre-merge checks...

Nov 3 2020, 7:40 AM · Restricted Project

Nov 2 2020

cameron.mcinally requested review of D90644: [Legalizer][ARM][AArch64] Add legalizations for VECREDUCE_SEQ_FMUL.
Nov 2 2020, 1:45 PM · Restricted Project

Oct 30 2020

cameron.mcinally committed rGdda1e74b58bd: [Legalize] Add legalizations for VECREDUCE_SEQ_FADD (authored by cameron.mcinally).
[Legalize] Add legalizations for VECREDUCE_SEQ_FADD
Oct 30 2020, 2:03 PM
cameron.mcinally closed D90247: [AArch64] Add legalizations for VECREDUCE_SEQ_FADD .
Oct 30 2020, 2:03 PM · Restricted Project
cameron.mcinally added inline comments to D90247: [AArch64] Add legalizations for VECREDUCE_SEQ_FADD .
Oct 30 2020, 2:02 PM · Restricted Project
cameron.mcinally added inline comments to D90247: [AArch64] Add legalizations for VECREDUCE_SEQ_FADD .
Oct 30 2020, 12:48 PM · Restricted Project
cameron.mcinally updated the diff for D90247: [AArch64] Add legalizations for VECREDUCE_SEQ_FADD .

Update patch based on @nikic's comments...

Oct 30 2020, 12:48 PM · Restricted Project

Oct 28 2020

cameron.mcinally updated the diff for D90247: [AArch64] Add legalizations for VECREDUCE_SEQ_FADD .

Updated patch with, I think, all the needed legalizations.

Oct 28 2020, 11:59 AM · Restricted Project

Oct 27 2020

cameron.mcinally added a comment to D90247: [AArch64] Add legalizations for VECREDUCE_SEQ_FADD .

Comment from ARM/ARMISelLowering.cpp:

Oct 27 2020, 2:41 PM · Restricted Project
cameron.mcinally added a comment to D90247: [AArch64] Add legalizations for VECREDUCE_SEQ_FADD .

Ah, I see it in ARM/. That will work...

Oct 27 2020, 11:57 AM · Restricted Project
cameron.mcinally updated the diff for D90247: [AArch64] Add legalizations for VECREDUCE_SEQ_FADD .

Update 'neutral' element to -0.0.

Oct 27 2020, 11:45 AM · Restricted Project