This is an archive of the discontinued LLVM Phabricator instance.

[CUDA/NVPTX] Improve handling of memcpy for -Os compilations.
ClosedPublic

Authored by tra on Aug 17 2023, 3:38 PM.

Details

Summary

We had some instances when LLVM would not inline fixed-count memcpy and ended up
attempting to lower it a a libcall, which would not work on NVPTX as there's no
standard library to call.

The patch relaxes the threshold used for -Os compilation so we're always allowed
to inline memory copy functions.

Diff Detail

Event Timeline

tra created this revision.Aug 17 2023, 3:38 PM
Herald added a project: Restricted Project. · View Herald TranscriptAug 17 2023, 3:38 PM
tra published this revision for review.Aug 17 2023, 3:39 PM
tra added a reviewer: alexfh.
Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptAug 17 2023, 3:39 PM
alexfh accepted this revision.Aug 18 2023, 8:41 AM

LGTM

This revision is now accepted and ready to land.Aug 18 2023, 8:41 AM
This revision was landed with ongoing or failed builds.Aug 18 2023, 11:28 AM
This revision was automatically updated to reflect the committed changes.