Page MenuHomePhabricator

jchlanda (Jakub Chlanda)
User

Projects

User does not belong to any projects.

User Details

User Since
Jun 23 2021, 6:39 AM (92 w, 3 d)

Recent Activity

Mon, Mar 27

jchlanda committed rGae3c981aa4b8: [NVPTX] Enforce half type support is present for builtins (authored by jchlanda).
[NVPTX] Enforce half type support is present for builtins
Mon, Mar 27, 11:48 PM · Restricted Project, Restricted Project, Restricted Project
jchlanda closed D146715: [NVPTX] Enforce half type support is present for builtins.
Mon, Mar 27, 11:48 PM · Restricted Project, Restricted Project, Restricted Project
jchlanda added inline comments to D146715: [NVPTX] Enforce half type support is present for builtins.
Mon, Mar 27, 2:37 AM · Restricted Project, Restricted Project, Restricted Project

Fri, Mar 24

jchlanda added inline comments to D146715: [NVPTX] Enforce half type support is present for builtins.
Fri, Mar 24, 4:35 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda updated the diff for D146715: [NVPTX] Enforce half type support is present for builtins.

Remove duplicated switch over builtins.

Fri, Mar 24, 4:33 AM · Restricted Project, Restricted Project, Restricted Project

Thu, Mar 23

jchlanda requested review of D146715: [NVPTX] Enforce half type support is present for builtins.
Thu, Mar 23, 5:33 AM · Restricted Project, Restricted Project, Restricted Project

Wed, Mar 15

jchlanda committed rG7258317bade0: [NVPTX] Expose LDU builtins (authored by jchlanda).
[NVPTX] Expose LDU builtins
Wed, Mar 15, 1:44 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda closed D145238: [NVPTX] Expose LDU builtins.
Wed, Mar 15, 1:44 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda updated the diff for D145238: [NVPTX] Expose LDU builtins.

Use CHECK-LABEL.

Wed, Mar 15, 1:37 AM · Restricted Project, Restricted Project, Restricted Project

Tue, Mar 14

jchlanda added a comment to D145238: [NVPTX] Expose LDU builtins.

@tra is there anything else I should do for this patch? Thank you.

Tue, Mar 14, 4:04 AM · Restricted Project, Restricted Project, Restricted Project

Thu, Mar 9

jchlanda added inline comments to D145238: [NVPTX] Expose LDU builtins.
Thu, Mar 9, 12:03 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda updated the diff for D145238: [NVPTX] Expose LDU builtins.

append -> +

Thu, Mar 9, 12:02 AM · Restricted Project, Restricted Project, Restricted Project

Tue, Mar 7

jchlanda added inline comments to D145238: [NVPTX] Expose LDU builtins.
Tue, Mar 7, 11:11 PM · Restricted Project, Restricted Project, Restricted Project
jchlanda updated the diff for D145238: [NVPTX] Expose LDU builtins.

Simplify the check for half tys support.

Tue, Mar 7, 11:10 PM · Restricted Project, Restricted Project, Restricted Project
jchlanda added inline comments to D145238: [NVPTX] Expose LDU builtins.
Tue, Mar 7, 1:41 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda updated the diff for D145238: [NVPTX] Expose LDU builtins.

Address PR comments

Tue, Mar 7, 1:41 AM · Restricted Project, Restricted Project, Restricted Project

Fri, Mar 3

jchlanda requested review of D145238: [NVPTX] Expose LDU builtins.
Fri, Mar 3, 6:13 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda committed rG71b06585857a: [NVPTX] Add f16 and v2f16 ldg builtins (authored by jchlanda).
[NVPTX] Add f16 and v2f16 ldg builtins
Fri, Mar 3, 3:49 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda closed D144961: [NVPTX] Add f16 and v2f16 ldg builtins.
Fri, Mar 3, 3:49 AM · Restricted Project, Restricted Project, Restricted Project

Mar 1 2023

jchlanda added inline comments to D144961: [NVPTX] Add f16 and v2f16 ldg builtins.
Mar 1 2023, 12:09 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda updated the diff for D144961: [NVPTX] Add f16 and v2f16 ldg builtins.

Add more verbose test names.

Mar 1 2023, 12:07 AM · Restricted Project, Restricted Project, Restricted Project

Feb 28 2023

jchlanda requested review of D144961: [NVPTX] Add f16 and v2f16 ldg builtins.
Feb 28 2023, 6:00 AM · Restricted Project, Restricted Project, Restricted Project

Oct 24 2022

jchlanda accepted D136311: [CUDA,NVPTX] Implement __bf16 support for NVPTX..

Looks good.

Oct 24 2022, 11:01 PM · Restricted Project, Restricted Project, Restricted Project

Oct 13 2022

jchlanda added a comment to D135428: [NVPTX] Support neg{.ftz} for f16 and f16x2.

In all honesty I don't know what the motivation for this was, it came to my attention as a DPC++ bug (https://github.com/intel/llvm/issues/6958). I do think that your point about flushing behavior is important and should be preserved.

FWIW, using negdirectly does not require a bitcast from Float16x2Regs to Int32Regs or Float16Regs to Int16Regs, as seen in the xor case.

Such bitcasts are essentially no-ops once ptxas is done with them. PTX ends up being a bit more verbose, but it usually has no impact on the SASS. FP and integers are kept in the same registers on the actual hardware. I've commented on the original bug.

Anyways, I think this change is fine. I just wanted to make sure I'm not missing something.

Oct 13 2022, 3:37 AM · Restricted Project, Restricted Project

Oct 11 2022

jchlanda added a comment to D135428: [NVPTX] Support neg{.ftz} for f16 and f16x2.

Just curious -- what prompts this change?

Does it buy us anything performance-wise? AFAICT llvm may be generating better code for gpus w/o fp16 support -- it does xor on 32-bit value w/o splitting it into 16-bit halfs. https://godbolt.org/z/Wjx7ceT75
Or is it needed to flush fp16 denormals consistently?

Oct 11 2022, 12:13 AM · Restricted Project, Restricted Project

Oct 7 2022

jchlanda requested review of D135428: [NVPTX] Support neg{.ftz} for f16 and f16x2.
Oct 7 2022, 12:50 AM · Restricted Project, Restricted Project

Aug 23 2022

jchlanda accepted D132516: [NVPTX] SHL.64 $r, 31 cannot be converted to a mulwide.s32.
Aug 23 2022, 11:11 PM · Restricted Project, Restricted Project

May 17 2022

jchlanda added inline comments to D124382: [Clang] Recognize target address space in superset calculation.
May 17 2022, 1:29 AM · Restricted Project, Restricted Project
jchlanda updated the diff for D124382: [Clang] Recognize target address space in superset calculation.

Use helper functions when handling address space values.

May 17 2022, 1:24 AM · Restricted Project, Restricted Project

May 12 2022

jchlanda accepted D125423: [Intrinsics] Fix `nvvm_prmt` intrinsic attributes.
May 12 2022, 12:39 AM · Restricted Project, Restricted Project

May 6 2022

jchlanda added inline comments to D124382: [Clang] Recognize target address space in superset calculation.
May 6 2022, 5:05 AM · Restricted Project, Restricted Project
jchlanda added a comment to D124382: [Clang] Recognize target address space in superset calculation.
May 6 2022, 4:57 AM · Restricted Project, Restricted Project
jchlanda added a comment to D124382: [Clang] Recognize target address space in superset calculation.

@Anastasia @tra apologies for a late reply, I'm catching up with the thread after holidays.

May 6 2022, 4:51 AM · Restricted Project, Restricted Project

Apr 28 2022

jchlanda added a comment to D124382: [Clang] Recognize target address space in superset calculation.
Apr 28 2022, 4:58 AM · Restricted Project, Restricted Project

Apr 25 2022

jchlanda added a comment to D124382: [Clang] Recognize target address space in superset calculation.

@tra @Naghasan @t4c1 you might find it interesting, a follow up from the discussion here: https://reviews.llvm.org/D112718

Apr 25 2022, 5:49 AM · Restricted Project, Restricted Project
jchlanda added reviewers for D124382: [Clang] Recognize target address space in superset calculation: tra, t4c1, Naghasan.
Apr 25 2022, 5:48 AM · Restricted Project, Restricted Project
jchlanda requested review of D124382: [Clang] Recognize target address space in superset calculation.
Apr 25 2022, 5:47 AM · Restricted Project, Restricted Project
jchlanda added a comment to D124171: [NVPTX] Support float <-> 2 x half bitcasts.

@tra would you be so kind and merge it in for me please?

Apr 25 2022, 4:36 AM · Restricted Project, Restricted Project

Apr 21 2022

jchlanda requested review of D124171: [NVPTX] Support float <-> 2 x half bitcasts.
Apr 21 2022, 6:54 AM · Restricted Project, Restricted Project

Mar 4 2022

jchlanda requested review of D120991: [NVPTX] Correctly set regs for neg, abs intrinsics.
Mar 4 2022, 5:35 AM · Restricted Project, Restricted Project

Mar 2 2022

Herald added a project to D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins: Restricted Project.

@tra thank you for landing the patches, it seems that the clang part (builtin declarations and tests) have been dropped, only llvm dir changes made it through. Is there any way I could fix it (same goes for the other two patches in this stack)?

Somehow arc export | git apply didn't pick clang changes when I was transferring the patch from the phabricator. I'll re-fetch the patches and will land the missing pieces shortly.

I'm not sure how the you've submitted the patch to phabricator. In general, it works best when the patch is supplied as a gic commit diff, with the author metadata, etc.
Or via arc diff. See for the details. https://llvm.org/docs/Phabricator.html#phabricator-reviews

Mar 2 2022, 12:20 AM · Restricted Project, Restricted Project, Restricted Project

Mar 1 2022

jchlanda added a comment to D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins.

@tra thank you for landing the patches, it seems that the clang part (builtin declarations and tests) have been dropped, only llvm dir changes made it through. Is there any way I could fix it (same goes for the other two patches in this stack)?

Mar 1 2022, 6:39 AM · Restricted Project, Restricted Project, Restricted Project

Feb 17 2022

jchlanda added a comment to D118977: [NVPTX] Add more FMA intriniscs/builtins.

@tra I've fixed the test failure (math-intrins.ll) the rest seems to be unrelated timeouts, would you be able to merge those patches in, as I don't have the commit access please? The same goes for https://reviews.llvm.org/D117887 and https://reviews.llvm.org/D119157 from Nicolas. Thanks.

Feb 17 2022, 4:31 AM · Restricted Project, Restricted Project

Feb 11 2022

jchlanda updated the diff for D118977: [NVPTX] Add more FMA intriniscs/builtins.

PTX/sm version tidy up.

Feb 11 2022, 9:07 AM · Restricted Project, Restricted Project

Feb 9 2022

jchlanda added inline comments to D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins.
Feb 9 2022, 11:37 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda added inline comments to D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins.
Feb 9 2022, 11:20 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda added inline comments to D118977: [NVPTX] Add more FMA intriniscs/builtins.
Feb 9 2022, 1:14 AM · Restricted Project, Restricted Project
jchlanda updated the diff for D118977: [NVPTX] Add more FMA intriniscs/builtins.

Tidy up FMA_TUPLE class.

Feb 9 2022, 1:12 AM · Restricted Project, Restricted Project

Feb 8 2022

jchlanda added a comment to D118977: [NVPTX] Add more FMA intriniscs/builtins.

Target ISA Notes
Requires sm_53 or higher.

I think we do need this constraint applied to the new builtins, too. Right now nothing stops using them on a GPU where they do not exist and that will likely crash the compiler when we fail to find a matching intrinsic.

Feb 8 2022, 8:40 AM · Restricted Project, Restricted Project
jchlanda updated the diff for D118977: [NVPTX] Add more FMA intriniscs/builtins.

Add sm/ptx version guard to f16{x2} builtins.

Feb 8 2022, 8:38 AM · Restricted Project, Restricted Project

Feb 6 2022

jchlanda added a comment to D118977: [NVPTX] Add more FMA intriniscs/builtins.

They all require PTX 7.0, SM_80.

According to https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#half-precision-floating-point-instructions-fma only fma.relu and bf16* variants require ptx70/sm80:

PTX ISA Notes
Introduced in PTX ISA version 4.2.

fma.relu.{f16, f16x2} and fma{.relu}.{bf16, bf16x2} introduced in PTX ISA version 7.0.

Target ISA Notes
Requires sm_53 or higher.

fma.relu.{f16, f16x2} and fma{.relu}.{bf16, bf16x2} require sm_80 or higher.
Feb 6 2022, 10:25 PM · Restricted Project, Restricted Project
jchlanda updated the diff for D118977: [NVPTX] Add more FMA intriniscs/builtins.

Set correct SM and PTX version.

Feb 6 2022, 10:22 PM · Restricted Project, Restricted Project

Feb 4 2022

jchlanda requested review of D118977: [NVPTX] Add more FMA intriniscs/builtins.
Feb 4 2022, 2:00 AM · Restricted Project, Restricted Project

Feb 2 2022

jchlanda added inline comments to D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins.
Feb 2 2022, 12:34 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda added a comment to D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins.
Feb 2 2022, 12:30 AM · Restricted Project, Restricted Project, Restricted Project
jchlanda updated the diff for D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins.

Added xorsign.abs variant and test.

Feb 2 2022, 12:29 AM · Restricted Project, Restricted Project, Restricted Project

Feb 1 2022

jchlanda added a comment to D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins.
Feb 1 2022, 7:24 AM · Restricted Project, Restricted Project, Restricted Project

Jan 21 2022

jchlanda requested review of D117887: [NVPTX] Expose float tys min, max, abs, neg as builtins.
Jan 21 2022, 5:52 AM · Restricted Project, Restricted Project, Restricted Project

Jan 20 2022

jchlanda abandoned D117787: [NVPTX] abs, neg, min, max intrinsics for half types.

I need to update this to the upstream, as it was based on intel's fork. Abandoning for now.

Jan 20 2022, 7:49 AM · Restricted Project, Restricted Project
jchlanda requested review of D117787: [NVPTX] abs, neg, min, max intrinsics for half types.
Jan 20 2022, 7:25 AM · Restricted Project, Restricted Project

Jul 30 2021

jchlanda abandoned D104848: [cmake] Handled utils/unittests before projects.

add_dependencies actually should work fine, regardless of the ordering. The problem is you're checking for the existence of the target.

Stop doing that.

Jul 30 2021, 6:57 AM · Restricted Project

Jul 29 2021

jchlanda added a comment to D104848: [cmake] Handled utils/unittests before projects.

This is actually not safe. You're creating a new cached value based on the value of another cached value. Both can be set and updated independently of each other. You should (at least) add a hard error if SYCL_INCLUDE_TESTS=On and LLVM_INCLUDE_TESTS=Off.

Jul 29 2021, 9:31 AM · Restricted Project
jchlanda added a comment to D104848: [cmake] Handled utils/unittests before projects.

SYCL already predicates building unittests on the value of LLVM_INCLUDE_TESTS (https://github.com/intel/llvm/blob/sycl/sycl/CMakeLists.txt#L212). The problem is that at the point of time when check-sycl is built gtests are not known to CMake (even though SYCL code did check for the existence of the source), so it is impossible to add them to the list of dependencies. Yes they will be built (if the LLVM_INCLUDE_TESTS is set on), but the dependency will not be set up correctly and depending on the order in which the targets are built check-sycl might fail to link. It boils down to inter-target dependencies in CMake.

Jul 29 2021, 1:53 AM · Restricted Project

Jul 28 2021

jchlanda added a comment to D104848: [cmake] Handled utils/unittests before projects.

What are you trying to do in a project that is motivating this?

I don’t have any real objection to this change, but the motivation concerns me. Generally we avoid things that require order-dependence, and setting up target dependencies does not.

Jul 28 2021, 8:09 AM · Restricted Project

Jul 20 2021

jchlanda added a comment to D104848: [cmake] Handled utils/unittests before projects.

ping

Jul 20 2021, 7:57 AM · Restricted Project
jchlanda updated subscribers of D104848: [cmake] Handled utils/unittests before projects.
Jul 20 2021, 7:57 AM · Restricted Project

Jun 24 2021

jchlanda requested review of D104848: [cmake] Handled utils/unittests before projects.
Jun 24 2021, 5:03 AM · Restricted Project