User Details
User Details
- User Since
- Sep 26 2019, 12:57 AM (209 w, 2 d)
Yesterday
Yesterday
jmmartinez added a comment to D144007: [DwarfDebug] Move emission of globals from beginModule() to endModule() (5/7).
Hi @dzhidzhoev !
What's missing to land the rest of the patches? I missed the last discussions and I saw you just rebased this one.
Wed, Sep 20
Wed, Sep 20
jmmartinez committed rG24f437f31e0b: [NFC][AMDGPU] Remove redundant hasSideEffects=1 (authored by jmmartinez).
[NFC][AMDGPU] Remove redundant hasSideEffects=1
jmmartinez committed rG356494c36e71: [NFC][AMDGPU] Perform a single lookup in map in SIInsertWaitcnts… (authored by jmmartinez).
[NFC][AMDGPU] Perform a single lookup in map in SIInsertWaitcnts…
Mon, Sep 11
Mon, Sep 11
jmmartinez added a comment to D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
@yaxunl I've addressed your remarks on a GitHub PR: https://github.com/llvm/llvm-project/pull/65938
Fri, Sep 8
Fri, Sep 8
jmmartinez committed rGd60c47476dde: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode (authored by jmmartinez).
[Clang] Propagate target-features if compatible when using mlink-builtin-bitcode
Wed, Sep 6
Wed, Sep 6
jmmartinez added inline comments to D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
jmmartinez updated the diff for D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
- Capitalize comment
jmmartinez updated the diff for D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
- Review
Mon, Sep 4
Mon, Sep 4
jmmartinez added inline comments to D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
jmmartinez updated the diff for D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
- Review
Aug 31 2023
Aug 31 2023
jmmartinez updated the diff for D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
- Rebased
jmmartinez added a comment to D159256: [NFC][Clang] Remove redundant function definitions.
Thanks for the review!
jmmartinez committed rG19550e79b50f: [NFC][Clang] Remove redundant function definitions (authored by jmmartinez).
[NFC][Clang] Remove redundant function definitions
jmmartinez added a comment to D159256: [NFC][Clang] Remove redundant function definitions.
jmmartinez updated the diff for D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
- Rollback, drop incompatible functions in a separate commit
jmmartinez added a comment to D159256: [NFC][Clang] Remove redundant function definitions.
@jhuber6 I was wondering if there is a reason you kept 3 versions of mergeDefaultFunctionDefinitionAttributes in https://reviews.llvm.org/D152391 ?
jmmartinez requested review of D159256: [NFC][Clang] Remove redundant function definitions.
Aug 30 2023
Aug 30 2023
jmmartinez added inline comments to D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
jmmartinez updated the diff for D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
- Drop incoming builtins with incompatible target-features
jmmartinez added inline comments to D159206: [Clang] Propagate target-features if compatible when using mlink-builtin-bitcode.
jmmartinez committed rG9b352540184e: [NFC][Clang] Remove unused function `CodeGenModule… (authored by jmmartinez).
[NFC][Clang] Remove unused function `CodeGenModule…
Aug 29 2023
Aug 29 2023
jmmartinez committed rGb63c6e585d86: [NFC][Clang] Add missing & to function argument (authored by jmmartinez).
[NFC][Clang] Add missing & to function argument
Aug 28 2023
Aug 28 2023
jmmartinez requested review of D158991: [NFC][Clang] Add missing & to function argument.
Aug 25 2023
Aug 25 2023
jmmartinez updated the diff for D156679: [WIP][AMDGPU][SIInsertWaitcnts] Do not add s_waitcnt when the counters are known to be 0 already.
- Added a few comments to describe what a soft waitcnt is
- Take into account reviews
- Rebased
Aug 24 2023
Aug 24 2023
jmmartinez added a comment to D158603: [AMDGPU][TargetMachine] Handle case when +extended-image-insts is set, and the user forces +wave64.
Aug 23 2023
Aug 23 2023
Aug 18 2023
Aug 18 2023
jmmartinez updated the diff for D156679: [WIP][AMDGPU][SIInsertWaitcnts] Do not add s_waitcnt when the counters are known to be 0 already.
- Updated the pseudo instruction names to S_WAITCNT_soft and S_WAITCNT_VSCNT_soft
Aug 17 2023
Aug 17 2023
jmmartinez updated the diff for D156679: [WIP][AMDGPU][SIInsertWaitcnts] Do not add s_waitcnt when the counters are known to be 0 already.
- Do not relax "hard" s_waitcnt instructions
- Rebased
- Only 1 test failing: I have to rethink this remaining test consisting mostly of kernels doing only a memory fence. These become just an s_endpgm since there are no memory accesses to fence.
Aug 16 2023
Aug 16 2023
jmmartinez updated the diff for D156679: [WIP][AMDGPU][SIInsertWaitcnts] Do not add s_waitcnt when the counters are known to be 0 already.
- Added S_SOFT_WAITCNT and S_SOFT_WAITCNT_VSCNT
- There is some refactoring in between that I should move to another PR
Aug 14 2023
Aug 14 2023
Aug 7 2023
Aug 7 2023
jmmartinez updated the diff for D156679: [WIP][AMDGPU][SIInsertWaitcnts] Do not add s_waitcnt when the counters are known to be 0 already.
- Fixed initialization of WaitcntBrackets structure for non-kernel functions (since the counters were initialized to 0, the inserted s_waitcnt 0 at function-entry was removed)
- Updated tests to help spot potential issues with the patch:
- One issue is that some memory-fence tests with a kernel doing only a memory fence become irrelevant since the wait instructions are removed. Adding some memory accesses to those would help.
- Added CodeGen/AMDGPU/preserve-user-waitcnt.ll to test how user-inserted waits are handled. Currently, with __builtin_amdgcn_s_waitcnt it is possible that the waitcnt instruction is going to be removed
Aug 4 2023
Aug 4 2023
jmmartinez added inline comments to D156707: AMDGPU: Handle multiple uses when matching sincos.
Aug 2 2023
Aug 2 2023
jmmartinez added inline comments to D156706: AMDGPU: Simplify and improve sincos matching.
Aug 1 2023
Aug 1 2023
jmmartinez added inline comments to D156706: AMDGPU: Simplify and improve sincos matching.
Jul 31 2023
Jul 31 2023
jmmartinez updated the diff for D156679: [WIP][AMDGPU][SIInsertWaitcnts] Do not add s_waitcnt when the counters are known to be 0 already.
- Rebase over main
jmmartinez added a comment to D156671: [AMDGPU][SIInsertWaitcnts] Initialize the WaitcntBrackets for non-kernel functions.
jmmartinez added a comment to D156671: [AMDGPU][SIInsertWaitcnts] Initialize the WaitcntBrackets for non-kernel functions.
jmmartinez added inline comments to D156678: AMDGPU: Refactor libcall simplify to help with future refined fast math flag usage.
jmmartinez added inline comments to D156680: AMDGPU: Don't store current instruction in AMDGPULibCalls member.
jmmartinez added inline comments to D156671: [AMDGPU][SIInsertWaitcnts] Initialize the WaitcntBrackets for non-kernel functions.
Jul 19 2023
Jul 19 2023
jmmartinez committed rG4e43ba259980: [NFC][AMDGPULowerModuleLDSPass] Use shorter APIs in markUsedByKernel (authored by jmmartinez).
[NFC][AMDGPULowerModuleLDSPass] Use shorter APIs in markUsedByKernel
jmmartinez committed rGfcbafc066cc5: [NFC][AMDGPULowerModuleLDSPass] Cleanup of getTableLookupKernelIndex (authored by jmmartinez).
[NFC][AMDGPULowerModuleLDSPass] Cleanup of getTableLookupKernelIndex
Jul 18 2023
Jul 18 2023
jmmartinez updated the diff for D155589: [NFC][AMDGPULowerModuleLDSPass] Use shorter APIs in markUsedByKernel.
- Remove "" from CreateCall
jmmartinez updated the diff for D155588: [NFC][AMDGPULowerModuleLDSPass] Cleanup of getTableLookupKernelIndex.
- Use structured binding declaration
jmmartinez requested review of D155589: [NFC][AMDGPULowerModuleLDSPass] Use shorter APIs in markUsedByKernel.
jmmartinez requested review of D155588: [NFC][AMDGPULowerModuleLDSPass] Cleanup of getTableLookupKernelIndex.
Jul 13 2023
Jul 13 2023
jmmartinez accepted D155132: [amdgpu][lds] Raise an explicit unimplemented error on absolute address LDS variables.
Looks good to me. If you think it's relevant to privde a better diagnostic with the souce location I'd welcome that change.
Jul 12 2023
Jul 12 2023
jmmartinez committed rG367b1f28dbcf: [NFC][AMDGPULowerModuleLDSPass] Fix buildbot santizier failed to compile (authored by jmmartinez).
[NFC][AMDGPULowerModuleLDSPass] Fix buildbot santizier failed to compile
jmmartinez committed rG3a75551e857b: Reland "[NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code" (authored by jmmartinez).
Reland "[NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code"
Jul 11 2023
Jul 11 2023
jmmartinez added a comment to D154977: Reland "[NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code"
Fixed compilation error and reudndant copy warning.
Lesson Learned: This time I'll wait for the LLVM buildbot to confirm all is good...
jmmartinez committed rGebdd610ad41b: Revert "[NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code" (authored by jmmartinez).
Revert "[NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code"
jmmartinez committed rG125b90749a98: [NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code (authored by jmmartinez).
[NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code
jmmartinez added a reviewer for D154970: [NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code: JonChesterfield.
jmmartinez requested review of D154970: [NFC][AMDGPULowerModuleLDSPass] Factorize repetead sort code.
jmmartinez updated the diff for D154946: [AMDGPULowerModuleLDSPass] Kernels do not always have entries in KernelToReplacement map.
- rebased
jmmartinez committed rG70bb5d2b9d81: [NFC][AMDGPULowerModuleLDSPass] Add const to some variables/parameters (authored by jmmartinez).
[NFC][AMDGPULowerModuleLDSPass] Add const to some variables/parameters
jmmartinez updated the summary of D154959: [NFC][AMDGPULowerModuleLDSPass] Add const to some variables/parameters.
jmmartinez added a comment to D154946: [AMDGPULowerModuleLDSPass] Kernels do not always have entries in KernelToReplacement map.
jmmartinez added a comment to D154946: [AMDGPULowerModuleLDSPass] Kernels do not always have entries in KernelToReplacement map.
jmmartinez committed rGabf081975eda: [NFC][AMDGPULowerModuleLDSPass] Remove dead variable (authored by jmmartinez).
[NFC][AMDGPULowerModuleLDSPass] Remove dead variable
Jun 29 2023
Jun 29 2023
jmmartinez committed rGdd1df099ae37: [InlineCost][TargetTransformInfo][AMDGPU] Consider cost of alloca instructions… (authored by jmmartinez).
[InlineCost][TargetTransformInfo][AMDGPU] Consider cost of alloca instructions…
jmmartinez committed rGcc8a346e3fa3: [InlineCost][TargetTransformInfo][AMDGPU] Consider cost of alloca instructions… (authored by jmmartinez).
[InlineCost][TargetTransformInfo][AMDGPU] Consider cost of alloca instructions…