Download Raw Diff

Details

Reviewers

arsenm
t-tye
RamNalamothu

Commits

rG691ff4682f8c: [AMDGPU] Skip CFIInstructions in SIInsertWaitcnts

Summary

CFI emitted during PEI at the beginning of the prologue needs to apply
to any inserted waitcnts on function entry.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

scott.linder created this revision.Mar 26 2020, 12:21 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 26 2020, 12:21 PM

Herald added subscribers: llvm-commits, kerbowa, hiraditya and 9 others. · View Herald Transcript

scott.linder added a parent revision: D76880: [AMDGPU] Emit entry function CFI.Mar 26 2020, 12:30 PM

scott.linder added a child revision: D76882: [AMDGPU] Implement CFI for non-kernel functions.

git clang-format

Harbormaster failed remote builds in B50595: Diff 252944!Mar 26 2020, 1:03 PM

Needs test

Add a test

scott.linder added reviewers: arsenm, t-tye, RamNalamothu.Mar 26 2020, 1:21 PM

arsenm added inline comments.Mar 26 2020, 1:21 PM

llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
1644	Why just CFI? Why not isMetaInstruction?
llvm/test/CodeGen/AMDGPU/waitcnt-skip-cfi.mir
3–9 ↗	(On Diff #252965)	Shouldn't need IR section

Harbormaster completed remote builds in B50604: Diff 252953.Mar 26 2020, 2:09 PM

Harbormaster completed remote builds in B50610: Diff 252965.

scott.linder marked an inline comment as done.Mar 26 2020, 2:28 PM

scott.linder added inline comments.

llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
1644	Seems reasonable to me. Is there a reason why isMetaInstruction doesn't include PHIs?

arsenm added inline comments.Mar 26 2020, 2:44 PM

llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
1644	It's a real instruction that will emit a copy

Skip all meta instructions, not just CFI

Harbormaster failed remote builds in B50623: Diff 252986!Mar 26 2020, 3:47 PM

arsenm accepted this revision.Mar 26 2020, 4:08 PM

This revision is now accepted and ready to land.Mar 26 2020, 4:08 PM

I may have pushed this up to soon, I missed some tests, and after thinking it
through again I don't know if it is actually correct to put the DEBUG_VALUE
before the waitcnt if the waitcnt could be ensuring some writes are available?

dbg.value isn't supposed to change codegen, so always ignoring them seems like the correct choice?

Harbormaster completed remote builds in B50643: Diff 253019.Mar 26 2020, 5:25 PM

Right, but the placement of the meta instructions here can't affect change codegen, can it? Also, if we do need to skip the DEBUG_VALUEs it may also be that we would need to either update or drop them to avoid producing incorrect debug info. I'm not actually certain this is an issue though, but the situation I have in mind is if we pass an argument by-ref or on the stack, and in the callee we claim the value is in memory when the appropriate counts haven't been waited on yet.

In D76881#1946435, @scott.linder wrote:

Right, but the placement of the meta instructions here can't affect change codegen, can it? Also, if we do need to skip the DEBUG_VALUEs it may also be that we would need to either update or drop them to avoid producing incorrect debug info. I'm not actually certain this is an issue though, but the situation I have in mind is if we pass an argument by-ref or on the stack, and in the callee we claim the value is in memory when the appropriate counts haven't been waited on yet.

Doesn't the placement of CFI matter relative to the real instructions?

In D76881#1946539, @arsenm wrote:

In D76881#1946435, @scott.linder wrote:

Right, but the placement of the meta instructions here can't affect change codegen, can it? Also, if we do need to skip the DEBUG_VALUEs it may also be that we would need to either update or drop them to avoid producing incorrect debug info. I'm not actually certain this is an issue though, but the situation I have in mind is if we pass an argument by-ref or on the stack, and in the callee we claim the value is in memory when the appropriate counts haven't been waited on yet.

Doesn't the placement of CFI matter relative to the real instructions?

Yes, and same for DEBUG_VALUE, which is where my uncertainty comes up. For CFI we absolutely need the directives at the beginning of the block to precede any real instructions, because they apply "on entry". For example, we reference the ABI return address registers here, and we can't have these be defined incorrectly until after the initial s_waitcnt. Conversely for DEBUG_VALUEs it may be that we are constructing them assuming they can reference memory locations for arguments that are only valid after the initial s_waitcnt. So I think we need to handle the two cases distinctly.

scott.linder added a project: debug-info.Mar 27 2020, 11:17 AM

In D76881#1946554, @scott.linder wrote:

In D76881#1946539, @arsenm wrote:

In D76881#1946435, @scott.linder wrote:

Right, but the placement of the meta instructions here can't affect change codegen, can it? Also, if we do need to skip the DEBUG_VALUEs it may also be that we would need to either update or drop them to avoid producing incorrect debug info. I'm not actually certain this is an issue though, but the situation I have in mind is if we pass an argument by-ref or on the stack, and in the callee we claim the value is in memory when the appropriate counts haven't been waited on yet.

Doesn't the placement of CFI matter relative to the real instructions?

Yes, and same for DEBUG_VALUE, which is where my uncertainty comes up. For CFI we absolutely need the directives at the beginning of the block to precede any real instructions, because they apply "on entry". For example, we reference the ABI return address registers here, and we can't have these be defined incorrectly until after the initial s_waitcnt. Conversely for DEBUG_VALUEs it may be that we are constructing them assuming they can reference memory locations for arguments that are only valid after the initial s_waitcnt. So I think we need to handle the two cases distinctly.

In D76881#1946594, @arsenm wrote:

In D76881#1946554, @scott.linder wrote:

In D76881#1946539, @arsenm wrote:

In D76881#1946435, @scott.linder wrote:

Right, but the placement of the meta instructions here can't affect change codegen, can it? Also, if we do need to skip the DEBUG_VALUEs it may also be that we would need to either update or drop them to avoid producing incorrect debug info. I'm not actually certain this is an issue though, but the situation I have in mind is if we pass an argument by-ref or on the stack, and in the callee we claim the value is in memory when the appropriate counts haven't been waited on yet.

Doesn't the placement of CFI matter relative to the real instructions?

Yes, and same for DEBUG_VALUE, which is where my uncertainty comes up. For CFI we absolutely need the directives at the beginning of the block to precede any real instructions, because they apply "on entry". For example, we reference the ABI return address registers here, and we can't have these be defined incorrectly until after the initial s_waitcnt. Conversely for DEBUG_VALUEs it may be that we are constructing them assuming they can reference memory locations for arguments that are only valid after the initial s_waitcnt. So I think we need to handle the two cases distinctly.

ok

From a debug information point of view, the memory is updated when the instruction is issued, regardless of the waitcnt having completed. The Debugger has to stop the wave before it can see that waves state. The act of stopping a wave ensures that all memory actions it has performed are made coherent with the debugger. That includes ensuring all outstanding operations are complete, and caches are flushed and written back.

In D76881#1946979, @t-tye wrote:

In D76881#1946594, @arsenm wrote:

In D76881#1946554, @scott.linder wrote:

In D76881#1946539, @arsenm wrote:

In D76881#1946435, @scott.linder wrote:

Right, but the placement of the meta instructions here can't affect change codegen, can it? Also, if we do need to skip the DEBUG_VALUEs it may also be that we would need to either update or drop them to avoid producing incorrect debug info. I'm not actually certain this is an issue though, but the situation I have in mind is if we pass an argument by-ref or on the stack, and in the callee we claim the value is in memory when the appropriate counts haven't been waited on yet.

Doesn't the placement of CFI matter relative to the real instructions?

Yes, and same for DEBUG_VALUE, which is where my uncertainty comes up. For CFI we absolutely need the directives at the beginning of the block to precede any real instructions, because they apply "on entry". For example, we reference the ABI return address registers here, and we can't have these be defined incorrectly until after the initial s_waitcnt. Conversely for DEBUG_VALUEs it may be that we are constructing them assuming they can reference memory locations for arguments that are only valid after the initial s_waitcnt. So I think we need to handle the two cases distinctly.

ok

From a debug information point of view, the memory is updated when the instruction is issued, regardless of the waitcnt having completed. The Debugger has to stop the wave before it can see that waves state. The act of stopping a wave ensures that all memory actions it has performed are made coherent with the debugger. That includes ensuring all outstanding operations are complete, and caches are flushed and written back.

Ok, that makes sense, in that case I do think that the patch is correct as-is, skipping all meta instructions.

Rebase

Harbormaster completed remote builds in B59945: Diff 270079.Jun 11 2020, 4:50 AM

Closed by commit rG691ff4682f8c: [AMDGPU] Skip CFIInstructions in SIInsertWaitcnts (authored by scott.linder). · Explain WhyJun 17 2020, 9:41 AM

This revision was automatically updated to reflect the committed changes.

scott.linder removed a parent revision: D76880: [AMDGPU] Emit entry function CFI.Jun 17 2020, 9:44 AM

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Skip CFIInstructions in SIInsertWaitcnts
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 252944

llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Skip CFIInstructions in SIInsertWaitcntsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 252944

llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp

[AMDGPU] Skip CFIInstructions in SIInsertWaitcnts
ClosedPublic