This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Optimize VOP2 operand legalization
ClosedPublic

Authored by arsenm on Sep 22 2015, 7:58 PM.

Details

Reviewers
tstellarAMD
Summary

Don't use commuteInstruction, and don't commute if
doing so will not improve legality.

With this patch and others leading up to it, this
reduces the compile time of SIFixSGPRCopies on
some of the LuxMark 2 kernels from ~8ms to ~5ms
on my system.

Diff Detail

Event Timeline

arsenm updated this revision to Diff 35458.Sep 22 2015, 7:58 PM
arsenm retitled this revision from to AMDGPU: Optimize VOP2 operand legalization.
arsenm updated this object.
arsenm added a reviewer: tstellarAMD.
arsenm added a subscriber: llvm-commits.
arsenm updated this revision to Diff 36686.Oct 6 2015, 5:19 PM

Better handling of VOP2b implicit VCC uses

lib/Target/AMDGPU/SIInstrInfo.cpp
1702

Which patch adds this function?

arsenm updated this revision to Diff 38056.Oct 21 2015, 3:18 PM

Rebase and squash other "Simplify VOP2 operand legalization" patch

tstellarAMD accepted this revision.Dec 1 2015, 7:54 AM
tstellarAMD edited edge metadata.

LGTM.

This revision is now accepted and ready to land.Dec 1 2015, 7:54 AM
arsenm closed this revision.Dec 1 2015, 12:00 PM

r254452

lib/Target/AMDGPU/SIInstrInfo.cpp
1702