This is an archive of the discontinued LLVM Phabricator instance.

[ScalarizeMaskedMemIntrin] Add support for scalarizing expandload and compressstore intrinsics.
ClosedPublic

Authored by craig.topper on Mar 9 2019, 2:27 PM.

Details

Summary

This adds support for scalarizing these intrinsics as well the X86TargetTransformInfo support to avoid scalarizing them in the cases X86 can handle.

I've omitted handling special cases for constant masks for this first pass. Though CodeGenPrepare can constant fold the branch conditions and remove some of the control flow anyway.

Fixes PR40994 and is covers most of PR3666. Might want to implement constant masks to close that.

Diff Detail

Repository
rL LLVM

Event Timeline

craig.topper created this revision.Mar 9 2019, 2:27 PM

llc tests for PR40994 and PR39666?

lib/CodeGen/ScalarizeMaskedMemIntrin.cpp
750 ↗(On Diff #189994)

Pre-commit the above NFC?

Rebase after precommit of NFC change. Add llc tests for existing PRs.

Herald added a project: Restricted Project. · View Herald TranscriptMar 20 2019, 11:12 PM
RKSimon accepted this revision.Mar 21 2019, 9:31 AM

LGTM - cheers

This revision is now accepted and ready to land.Mar 21 2019, 9:31 AM
This revision was automatically updated to reflect the committed changes.