Download Raw Diff

Details

Reviewers

nicolasvasilache
herhut
christopherbate

Commits

rGfc37f717770a: [mlir][NVGPU]: Fix op description of nvgpu.device_async_wait.

Summary

According to the NVIDIA documentation on cp.async.wait_group
(https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-cp-async-wait-group-cp-async-wait-all),
the numGroups attribute in nvgpu.device_async_wait should give an upper
bound of pending async group count (instead of a lower bound) when the
executing thread can be unblocked.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

yaoyuannnn created this revision.Jun 29 2023, 12:11 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 29 2023, 12:11 AM

Herald added subscribers: bviyer, Moerafaat, zero9178 and 22 others. · View Herald Transcript

yaoyuannnn updated this revision to Diff 535654.Jun 29 2023, 12:21 AM

This comment was removed by yaoyuannnn.

yaoyuannnn published this revision for review.Jun 29 2023, 12:25 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptJun 29 2023, 12:25 AM

Herald added a reviewer: herhut. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B241990: Diff 535654.Jun 29 2023, 1:04 AM

yaoyuannnn added a reviewer: christopherbate.Jun 29 2023, 9:12 AM

In case anyone needs an example to verify: if you push 16 groups, and set the wait to numGroups = 12, then that means you want to unblock when 12 groups or fewer are in flight (4 groups have completed).

It might be a bit clearer if a better example is provided.

This revision is now accepted and ready to land.Jun 29 2023, 9:43 AM

Updated with an example of using numGroups.

Fixed a typo.

Harbormaster completed remote builds in B242140: Diff 535866.Jun 29 2023, 11:44 AM

Hi Christopher, can you help push the patch?

Closed by commit rGfc37f717770a: [mlir][NVGPU]: Fix op description of nvgpu.device_async_wait. (authored by yaoyuannnn). · Explain WhyJun 30 2023, 3:48 PM

This revision was automatically updated to reflect the committed changes.

yaoyuannnn added a commit: rGfc37f717770a: [mlir][NVGPU]: Fix op description of nvgpu.device_async_wait..

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][NVGPU]: Fix op description of nvgpu.device_async_wait.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 536469

mlir/include/mlir/Dialect/NVGPU/IR/NVGPU.td

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][NVGPU]: Fix op description of nvgpu.device_async_wait.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 536469

mlir/include/mlir/Dialect/NVGPU/IR/NVGPU.td

[mlir][NVGPU]: Fix op description of nvgpu.device_async_wait.
ClosedPublic