This is an archive of the discontinued LLVM Phabricator instance.

[ARM] Fix missed vpadd combine opportunity.
Needs ReviewPublic

Authored by yroux on Nov 16 2017, 9:30 AM.

Details

Reviewers
efriedma
Summary

This fixes Bug 32999.
Adds pattern recognition for ADD(EXTR_SUBV(VUZP.0), EXTR_SUBV(VUZP.1)).

Passed regression tests (x86 host) and nightly test-suite (x86 and ARM hosts)
without regressions.

Diff Detail

Event Timeline

yroux created this revision.Nov 16 2017, 9:30 AM
efriedma added inline comments.
lib/Target/ARM/ARMISelLowering.cpp
9423

Please rename this, or refactor the code, so the difference between AddCombineToVPADD and and this function is clear.

9432

Do you need to check the index of the EXTRACT_SUBVECTORs?

test/CodeGen/ARM/vpadd.ll
406

I'd like to see more tests, to make sure we have some test coverage for cases where the combine isn't legal.