This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Fix extra waitcnt being added with BUFFER_INVL2
ClosedPublic

Authored by kerbowa on May 11 2021, 9:46 AM.

Details

Summary

The waitcnt pass would increment the number of vmem events for some buffer
invalidates that were not handled by the pass.

Diff Detail

Event Timeline

kerbowa created this revision.May 11 2021, 9:46 AM
kerbowa requested review of this revision.May 11 2021, 9:46 AM
Herald added a project: Restricted Project. · View Herald TranscriptMay 11 2021, 9:46 AM
rampitec accepted this revision.May 11 2021, 11:27 AM

LGTM with a nit: typo in comment.

llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
950

Typo: affect.

This revision is now accepted and ready to land.May 11 2021, 11:27 AM
t-tye added inline comments.May 11 2021, 12:50 PM
llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
1323

Is this the right query? Or is the query really asking if the instruction requires the vmem counts to be tracked? Should the writeback also be handled here? Are invalidate/writeback considered to load/store memory?

This revision was automatically updated to reflect the committed changes.