This is an archive of the discontinued LLVM Phabricator instance.

[NVPTX] Lower extraction of upper half of i32/i64 as partial move.
ClosedPublic

Authored by tra on Feb 6 2023, 5:18 PM.

Details

Summary

This produces better SASS than right-shift + truncate and is fairly common for
CUDA code that operates on __half2 values represented as opaque integer.

Diff Detail

Event Timeline

tra created this revision.Feb 6 2023, 5:18 PM
Herald added a project: Restricted Project. · View Herald TranscriptFeb 6 2023, 5:18 PM
tra published this revision for review.Feb 7 2023, 11:41 AM
tra added a reviewer: jlebar.
jlebar accepted this revision.Feb 7 2023, 11:44 AM
This revision is now accepted and ready to land.Feb 7 2023, 11:44 AM
This revision was landed with ongoing or failed builds.Feb 7 2023, 2:21 PM
This revision was automatically updated to reflect the committed changes.