This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Split x8 and x16 vector loads instead of scalarize
ClosedPublic

Authored by arsenm on Nov 2 2015, 4:43 PM.

Details

Reviewers
tstellarAMD
Summary

The one regression in the builtin tests is in the read2 test which now
(again) has many extra copies, but this should be solved once the pass
is replaced with a DAG combine.

Diff Detail

Event Timeline

arsenm updated this revision to Diff 39007.Nov 2 2015, 4:43 PM
arsenm retitled this revision from to AMDGPU: Split x8 and x16 vector loads instead of scalarize.
arsenm updated this object.
arsenm added a reviewer: tstellarAMD.
arsenm added a subscriber: llvm-commits.
tstellarAMD added inline comments.Nov 4 2015, 6:06 PM
lib/Target/AMDGPU/AMDGPUISelLowering.cpp
405–406

Was this member variable added in a different patch?

arsenm added inline comments.Nov 4 2015, 6:42 PM
lib/Target/AMDGPU/AMDGPUISelLowering.cpp
405–406

D14267 adds it

tstellarAMD accepted this revision.Nov 19 2015, 2:57 PM
tstellarAMD edited edge metadata.

LGTM.

This revision is now accepted and ready to land.Nov 19 2015, 2:57 PM
arsenm closed this revision.Nov 24 2015, 4:22 AM

r253974