Page MenuHomePhabricator

Please use GitHub pull requests for new patches. Phabricator shutdown timeline

RolandF (Roland Froese)
User

Projects

User does not belong to any projects.

User Details

User Since
May 16 2016, 11:34 AM (384 w, 2 d)

Recent Activity

Aug 14 2023

RolandF committed rG4d425f86632f: [PowerPC] vector cost model add cost to extract i1 (authored by RolandF).
[PowerPC] vector cost model add cost to extract i1
Aug 14 2023, 2:04 PM · Restricted Project, Restricted Project
RolandF closed D155876: [PowerPC] vector cost model add cost to extract i1.
Aug 14 2023, 2:04 PM · Restricted Project, Restricted Project

Jul 26 2023

RolandF added inline comments to D155876: [PowerPC] vector cost model add cost to extract i1.
Jul 26 2023, 11:41 AM · Restricted Project, Restricted Project

Jul 24 2023

RolandF added inline comments to D155876: [PowerPC] vector cost model add cost to extract i1.
Jul 24 2023, 8:17 AM · Restricted Project, Restricted Project

Jul 21 2023

RolandF updated the diff for D155876: [PowerPC] vector cost model add cost to extract i1.

Oops C++ fail and missed some tests.

Jul 21 2023, 1:32 PM · Restricted Project, Restricted Project

Jul 20 2023

RolandF added reviewers for D155876: [PowerPC] vector cost model add cost to extract i1: shchenz, stefanp.
Jul 20 2023, 11:56 AM · Restricted Project, Restricted Project
RolandF requested review of D155876: [PowerPC] vector cost model add cost to extract i1.
Jul 20 2023, 11:53 AM · Restricted Project, Restricted Project

Jan 5 2023

RolandF added inline comments to D138899: [DAGCombiner] handle more store value forwarding.
Jan 5 2023, 10:05 AM · Restricted Project, Restricted Project

Dec 14 2022

RolandF added inline comments to D138899: [DAGCombiner] handle more store value forwarding.
Dec 14 2022, 8:26 AM · Restricted Project, Restricted Project

Dec 12 2022

RolandF added inline comments to D138899: [DAGCombiner] handle more store value forwarding.
Dec 12 2022, 10:07 AM · Restricted Project, Restricted Project

Dec 7 2022

RolandF added a comment to D138899: [DAGCombiner] handle more store value forwarding.

There is an existing place in codegen to handle forwarding a store value to the corresponding load - ForwardStoreValueToDirectLoad in lib/CodeGen/SelectionDAG/DAGCombiner.cpp. Handling this in codegen would not require putting the code in multiple places. The code there is relatively simple - it just looks to see if the thing on the chain immediately before the load is the setting store. For the first case in byval-lhs.ll there is a CALLSEQ_START on the chain in between the load and the store. Maybe it is possible to look past that in the chain to see the store. Since the load is for a call there may be a register copy required to replace the load. Where there is a sequence of multiple stores followed by multiple loads it would require looking back in the chain past loads and past stores to fixed stack locations that do not overlap. I don't know if that is allowed, but in theory it could work.

Dec 7 2022, 12:09 PM · Restricted Project, Restricted Project

Nov 14 2022

RolandF accepted D137423: [PowerPC] make expensive mflr be away from its user in the function prologue.

LGTM

Nov 14 2022, 10:05 AM · Restricted Project, Restricted Project

Nov 9 2022

RolandF accepted D137612: [PowerPC] add a new subtarget feature FastMFLR.

LGTM

Nov 9 2022, 7:13 AM · Restricted Project, Restricted Project

Nov 8 2022

RolandF added inline comments to D137612: [PowerPC] add a new subtarget feature FastMFLR.
Nov 8 2022, 12:48 PM · Restricted Project, Restricted Project

Nov 7 2022

RolandF added inline comments to D137423: [PowerPC] make expensive mflr be away from its user in the function prologue.
Nov 7 2022, 7:59 AM · Restricted Project, Restricted Project

Oct 3 2022

RolandF accepted D134783: [PowerPC] Fix a number of inefficiencies and issues with atomic code gen.

LGTM

Oct 3 2022, 1:33 PM · Restricted Project, Restricted Project

Sep 14 2022

RolandF committed rG207228c1d650: [DAGCombiner] More load-store forwarding for big-endian (authored by RolandF).
[DAGCombiner] More load-store forwarding for big-endian
Sep 14 2022, 12:37 PM · Restricted Project, Restricted Project
RolandF closed D130115: [DAGCombiner] More load-store forwarding for big-endian.
Sep 14 2022, 12:37 PM · Restricted Project, Restricted Project

Aug 9 2022

RolandF updated the diff for D130115: [DAGCombiner] More load-store forwarding for big-endian.

Update diff to show diffs with new tests.

Aug 9 2022, 2:49 PM · Restricted Project, Restricted Project

Aug 8 2022

RolandF committed rGd6bd3d373e89: [DAGCombiner] Add some BE store forwarding tests; NFC (authored by RolandF).
[DAGCombiner] Add some BE store forwarding tests; NFC
Aug 8 2022, 1:33 PM · Restricted Project, Restricted Project

Aug 5 2022

RolandF added a comment to D130115: [DAGCombiner] More load-store forwarding for big-endian.

@t.p.northover , any concerns?

Aug 5 2022, 7:29 AM · Restricted Project, Restricted Project

Jul 25 2022

RolandF added a reviewer for D130115: [DAGCombiner] More load-store forwarding for big-endian: nemanjai.
Jul 25 2022, 12:56 PM · Restricted Project, Restricted Project

Jul 21 2022

RolandF updated the diff for D130115: [DAGCombiner] More load-store forwarding for big-endian.

Forgot to include the new tests.

Jul 21 2022, 12:42 PM · Restricted Project, Restricted Project

Jul 20 2022

RolandF added a reviewer for D130115: [DAGCombiner] More load-store forwarding for big-endian: sfertile.
Jul 20 2022, 7:30 AM · Restricted Project, Restricted Project

Jul 19 2022

RolandF requested review of D130115: [DAGCombiner] More load-store forwarding for big-endian.
Jul 19 2022, 12:52 PM · Restricted Project, Restricted Project

Dec 1 2021

RolandF accepted D109417: Cost model for VPMemory operations on PowerPC..

LGTM

Dec 1 2021, 12:02 PM · Restricted Project

Nov 30 2021

RolandF added inline comments to D109417: Cost model for VPMemory operations on PowerPC..
Nov 30 2021, 1:14 PM · Restricted Project

Nov 29 2021

RolandF added inline comments to D109417: Cost model for VPMemory operations on PowerPC..
Nov 29 2021, 2:51 PM · Restricted Project

Nov 1 2021

RolandF accepted D109416: getVPMemoryOpCost interface.

LGTM

Nov 1 2021, 8:07 AM · Restricted Project

Oct 28 2021

RolandF added inline comments to D109417: Cost model for VPMemory operations on PowerPC..
Oct 28 2021, 12:12 PM · Restricted Project
RolandF added inline comments to D109416: getVPMemoryOpCost interface.
Oct 28 2021, 11:58 AM · Restricted Project

Oct 20 2021

RolandF requested changes to D109417: Cost model for VPMemory operations on PowerPC..
Oct 20 2021, 12:02 PM · Restricted Project
RolandF added inline comments to D109379: Implementation of PPC lowering for vp_load/vp_store with no mask.
Oct 20 2021, 8:09 AM · Restricted Project

Oct 18 2021

RolandF added inline comments to D109416: getVPMemoryOpCost interface.
Oct 18 2021, 12:33 PM · Restricted Project
RolandF added a comment to D109416: getVPMemoryOpCost interface.

Is this still needed?

Yes, this patch establishes the target interface that is implemented by D109417.

Oct 18 2021, 11:23 AM · Restricted Project

Oct 15 2021

RolandF added inline comments to D109416: getVPMemoryOpCost interface.
Oct 15 2021, 12:15 PM · Restricted Project
RolandF added a comment to D109416: getVPMemoryOpCost interface.

Is this still needed?

Oct 15 2021, 12:09 PM · Restricted Project

Oct 12 2021

RolandF committed rG28e648b29e84: [PowerPC] Simplify PPC codegen test pre-inc-disable.ll (authored by RolandF).
[PowerPC] Simplify PPC codegen test pre-inc-disable.ll
Oct 12 2021, 1:13 PM
RolandF closed D111333: [PowerPC] Simplify PPC codegen test pre-inc-disable.ll.
Oct 12 2021, 1:13 PM · Restricted Project

Oct 7 2021

RolandF requested review of D111333: [PowerPC] Simplify PPC codegen test pre-inc-disable.ll.
Oct 7 2021, 12:03 PM · Restricted Project

Jan 22 2021

RolandF committed rG99a0aa07e9f4: [Analysis] Support AIX vec_malloc routines (authored by scui).
[Analysis] Support AIX vec_malloc routines
Jan 22 2021, 1:05 PM
RolandF closed D94710: [Analysis] Support AIX vec_malloc routines.
Jan 22 2021, 1:04 PM · Restricted Project
RolandF accepted D94710: [Analysis] Support AIX vec_malloc routines.

LGTM

Jan 22 2021, 7:53 AM · Restricted Project

Jan 21 2021

RolandF added inline comments to D94710: [Analysis] Support AIX vec_malloc routines.
Jan 21 2021, 2:54 PM · Restricted Project
RolandF added a comment to D94710: [Analysis] Support AIX vec_malloc routines.

Test case?

Jan 21 2021, 12:22 PM · Restricted Project

Aug 24 2020

RolandF committed rGb6d7ed469f2f: [PowerPC] Extend custom lower of vector truncate to handle wider input (authored by RolandF).
[PowerPC] Extend custom lower of vector truncate to handle wider input
Aug 24 2020, 12:36 PM
RolandF closed D68035: [PowerPC] Extend custom lower of vector truncate to handle wider input.
Aug 24 2020, 12:36 PM · Restricted Project

Aug 11 2020

RolandF added a comment to D68035: [PowerPC] Extend custom lower of vector truncate to handle wider input.

Ping?

Aug 11 2020, 7:08 AM · Restricted Project

Jul 22 2020

RolandF added a comment to D68035: [PowerPC] Extend custom lower of vector truncate to handle wider input.

Resolve comment.

Jul 22 2020, 3:04 PM · Restricted Project
RolandF added a comment to D68035: [PowerPC] Extend custom lower of vector truncate to handle wider input.

Resolve some comments.

Jul 22 2020, 2:58 PM · Restricted Project
RolandF updated the diff for D68035: [PowerPC] Extend custom lower of vector truncate to handle wider input.

Improved testing and tightened checks.

Jul 22 2020, 2:55 PM · Restricted Project

Jun 16 2020

RolandF accepted D77448: [PowerPC] Canonicalize shuffles to match more single-instruction masks on LE.

LGTM

Jun 16 2020, 7:42 AM · Restricted Project

May 19 2020

RolandF added inline comments to D77448: [PowerPC] Canonicalize shuffles to match more single-instruction masks on LE.
May 19 2020, 10:22 AM · Restricted Project

May 15 2020

RolandF added inline comments to D77448: [PowerPC] Canonicalize shuffles to match more single-instruction masks on LE.
May 15 2020, 1:03 PM · Restricted Project

May 14 2020

RolandF added inline comments to D77448: [PowerPC] Canonicalize shuffles to match more single-instruction masks on LE.
May 14 2020, 3:14 PM · Restricted Project

Oct 15 2019

RolandF updated the diff for D68035: [PowerPC] Extend custom lower of vector truncate to handle wider input.

Update test to use CHECK-COUNT.

Oct 15 2019, 9:42 AM · Restricted Project

Sep 27 2019

RolandF updated the diff for D68035: [PowerPC] Extend custom lower of vector truncate to handle wider input.

Moved the tests to a separate file and added hand-written checks which include the permute control data.

Sep 27 2019, 8:10 AM · Restricted Project

Sep 25 2019

RolandF created D68035: [PowerPC] Extend custom lower of vector truncate to handle wider input.
Sep 25 2019, 9:44 AM · Restricted Project

Aug 26 2019

RolandF committed rG18db4e9ae151: Recommit [PowerPC] Update P9 vector costs for insert/extract (authored by RolandF).
Recommit [PowerPC] Update P9 vector costs for insert/extract
Aug 26 2019, 12:28 PM

Aug 23 2019

RolandF committed rGb4051e57b104: [PowerPC] Expand v1i128 smin (authored by RolandF).
[PowerPC] Expand v1i128 smin
Aug 23 2019, 12:06 PM

Jul 18 2019

RolandF created D64960: [PowerPC] Expand v1i128 smin.
Jul 18 2019, 4:26 PM · Restricted Project

Jun 27 2019

RolandF committed rG9f7f5858fe46: Recommit [PowerPC] Update P9 vector costs for insert/extract element (authored by RolandF).
Recommit [PowerPC] Update P9 vector costs for insert/extract element
Jun 27 2019, 9:21 AM

Jun 24 2019

RolandF committed rGea08248b2bc9: [CodeGen] Add missing vector type legalization for ctlz_zero_undef (authored by RolandF).
[CodeGen] Add missing vector type legalization for ctlz_zero_undef
Jun 24 2019, 12:29 PM

Jun 18 2019

RolandF updated the diff for D63463: [CodeGen] Add missing vector type legalization for ctlz_zero_undef.

Generated test checks with script.

Jun 18 2019, 3:55 PM · Restricted Project
RolandF updated the diff for D63463: [CodeGen] Add missing vector type legalization for ctlz_zero_undef.

Add X86 test.

Jun 18 2019, 10:47 AM · Restricted Project

Jun 17 2019

RolandF created D63463: [CodeGen] Add missing vector type legalization for ctlz_zero_undef.
Jun 17 2019, 4:06 PM · Restricted Project

May 13 2019

RolandF added a comment to D59514: [PGO] Profile guided code size optimization..

The following file and command should reproduce the failure we are seeing.

May 13 2019, 3:16 PM · Restricted Project

May 6 2019

RolandF added a comment to D59514: [PGO] Profile guided code size optimization..

This revision causes a traceback when compiling SPEC2017 523.xalanbmk_r with -O3 -m64 -mcpu=power9 -flto and PGO, on the -fprofile-use compile step for XMLDateTime.cpp. Can you please take a look? Let me know if don't have access to SPEC0217 source and need a reproducer. The traceback was as follows:

May 6 2019, 9:34 AM · Restricted Project

Apr 29 2019

RolandF committed rG728e13970088: [PowerPC] Try harder to avoid load/move-to VSR for partial vector loads (authored by RolandF).
[PowerPC] Try harder to avoid load/move-to VSR for partial vector loads
Apr 29 2019, 2:12 PM

Apr 26 2019

RolandF committed rG4b17772b9ee7: [PowerPC] Update P9 vector costs for insert/extract element (authored by RolandF).
[PowerPC] Update P9 vector costs for insert/extract element
Apr 26 2019, 9:15 AM

Apr 18 2019

RolandF committed rGa5dd08cac299: [PowerPC] Add some PPC vec cost tests to prep for D60160 NFC (authored by RolandF).
[PowerPC] Add some PPC vec cost tests to prep for D60160 NFC
Apr 18 2019, 11:11 AM

Apr 2 2019

RolandF added reviewers for D60160: [PowerPC] Update P9 vector costs for insert/extract element: jsji, nemanjai.
Apr 2 2019, 3:48 PM · Restricted Project
RolandF created D60160: [PowerPC] Update P9 vector costs for insert/extract element.
Apr 2 2019, 3:45 PM · Restricted Project

Apr 1 2019

RolandF created D60102: [PowerPC] Try harder to avoid load/move-to VSR for partial vector loads.
Apr 1 2019, 3:41 PM · Restricted Project

Feb 11 2019

RolandF committed rG732fe22454da: [PowerPC] Avoid scalarization of vector truncate (authored by RolandF).
[PowerPC] Avoid scalarization of vector truncate
Feb 11 2019, 9:29 AM

Feb 8 2019

RolandF updated the diff for D56507: [PowerPC] Avoid scalarization of vector truncate.

Update diff to show test changes and respond to comments.

Feb 8 2019, 3:30 PM · Restricted Project

Feb 7 2019

Herald added a project to D56507: [PowerPC] Avoid scalarization of vector truncate: Restricted Project.
Feb 7 2019, 3:42 PM · Restricted Project

Feb 6 2019

RolandF added inline comments to D56507: [PowerPC] Avoid scalarization of vector truncate.
Feb 6 2019, 2:47 PM · Restricted Project
RolandF committed rG42f58498c563: [PowerPC] Add vector truncate test to prep for D56507 NFC (authored by RolandF).
[PowerPC] Add vector truncate test to prep for D56507 NFC
Feb 6 2019, 1:36 PM

Jan 25 2019

RolandF added a comment to D55461: [PowerPC] Update Vector Costs for P9.

@nemanjai Yes, please, I do need it committed for me. Last time I promise!

Jan 25 2019, 2:45 PM
RolandF updated the diff for D55461: [PowerPC] Update Vector Costs for P9.

Deleted commented out code and added suggested comments.

Jan 25 2019, 11:51 AM

Jan 17 2019

RolandF added inline comments to D55461: [PowerPC] Update Vector Costs for P9.
Jan 17 2019, 10:24 AM
RolandF updated the diff for D55461: [PowerPC] Update Vector Costs for P9.

Added a new TTI method vectorCostAdjustment to consolidate and make uniform for all instruction types the checks and cost modification. Also added direct cost model test.

Jan 17 2019, 10:02 AM

Jan 11 2019

RolandF added a comment to D55461: [PowerPC] Update Vector Costs for P9.

For memory ops it should be the same as arithmetic. The LSUs are a separate resource from the slices, but a vector load or store still consumes multiple LSUs (2x if aligned, 3x if not). I don't follow why there should be a problem with shuffle - I assume a shuffle will require one or more vector ALU ops.

Jan 11 2019, 2:44 PM

Jan 9 2019

RolandF added a comment to D55461: [PowerPC] Update Vector Costs for P9.

I don't think adding a new TTI function is necessary, as I think the way I am modeling the costs here is what is expected. The following comment (from TargetTransformInfo.h: getArithmeticInstrCost()) may shed some light:

Jan 9 2019, 3:50 PM
RolandF updated the diff for D56507: [PowerPC] Avoid scalarization of vector truncate.

Fix comment.

Jan 9 2019, 3:28 PM · Restricted Project
RolandF created D56507: [PowerPC] Avoid scalarization of vector truncate.
Jan 9 2019, 11:17 AM · Restricted Project

Dec 18 2018

RolandF added a comment to D55461: [PowerPC] Update Vector Costs for P9.

I am confused by this patch.

In LoopVectorize.cpp, getInstructionCost returns the execution time cost of an instruction for a given vector.

So I think the cost model here should be related to latency.

Why we need to take into consideration of execution units for execution time cost ?
Considering execution units looks more like throughput cost model?

Dec 18 2018, 9:43 AM

Dec 7 2018

RolandF created D55461: [PowerPC] Update Vector Costs for P9.
Dec 7 2018, 4:02 PM

Nov 21 2018

RolandF added inline comments to D54663: [PowerPC] Complete the custom legalization of vector int to fp conversion.
Nov 21 2018, 3:19 PM

Nov 20 2018

RolandF added a comment to D54663: [PowerPC] Complete the custom legalization of vector int to fp conversion.

Nice! Glad to see this stuff get completed.

Nov 20 2018, 10:49 AM

Oct 23 2018

RolandF updated the diff for D53346: [PowerPC] Keep vector int to fp conversions in vector domain.

Address review comments - change variable names, support/test big-endian.

Oct 23 2018, 9:29 AM

Oct 16 2018

RolandF created D53346: [PowerPC] Keep vector int to fp conversions in vector domain.
Oct 16 2018, 4:09 PM

Aug 15 2018

RolandF added inline comments to D49879: [PowerPC] Generate Power9 extswsli extend sign and shift immediate instruction.
Aug 15 2018, 12:00 PM
RolandF updated the diff for D49879: [PowerPC] Generate Power9 extswsli extend sign and shift immediate instruction.

Changed == NULL to == nullptr.

Aug 15 2018, 11:57 AM

Aug 14 2018

RolandF updated the diff for D49879: [PowerPC] Generate Power9 extswsli extend sign and shift immediate instruction.

Update to access constant shift amount by dynamic cast and to reverse if from block form to early exit.

Aug 14 2018, 3:37 PM

Aug 8 2018

RolandF updated the diff for D49879: [PowerPC] Generate Power9 extswsli extend sign and shift immediate instruction.

Updated to incorporate review comments.

Aug 8 2018, 3:32 PM

Aug 2 2018

RolandF updated subscribers of D49879: [PowerPC] Generate Power9 extswsli extend sign and shift immediate instruction.
Aug 2 2018, 9:47 AM

Jul 26 2018

RolandF created D49879: [PowerPC] Generate Power9 extswsli extend sign and shift immediate instruction.
Jul 26 2018, 3:22 PM

Aug 24 2016

RolandF updated the diff for D23467: Generate -1/0/1 memcmp/strcmp result for z13.

Unlike the php test failure, which is dependent on the library memcmp behaviour and fails for both clang and gcc, the postgres test failure only happens with clang. The uuid regression test fails for clang, and the failure goes away if src/backend/utils/adt/uuid.c is compiled with gcc. The issue is the result for the uuid_internal_cmp function, which is just a 16 byte memcmp. The address of the function is stored in a table of builtins and only called by address, and the complexity of the application and test environment make it difficult to trace back to where this function is called, which may be many places. It might be desirable to just be compatible with gcc. This diff updates the approach to use the gcc-type IPM/SLL/SRA sequence. The sequence is first translated into a SELECT_CMP operation. This makes it easier to perform the memcmp compare to zero optimization (SRA kills the CC). It also should make it easier to add support for LOCHI, since the compare to zero code can be shared, and to get the promotion to 64-bit case with shared code.

Aug 24 2016, 12:16 PM