This is extremly slow on AMDGPU, which has a lot of physical register
and a lot of register classes. Visit registers with common regunits
instead. NFC, except for compile time improvement.
Details
Details
- Reviewers
qcolombet
Diff Detail
Diff Detail
Event Timeline
lib/CodeGen/RegUsageInfoCollector.cpp | ||
---|---|---|
211 | We miss the setting of HasAtLeastOneSubreg in the loop. |
Comment Actions
Remove loop entirely. Ultimately determineCalleeSaves adds everything from MCRegAliasIterator anyway
Comment Actions
Remove loop entirely. Ultimately determineCalleeSaves adds everything from MCRegAliasIterator anyway
Hmm, I missed that. Why did we add this code in the first place?
Comment Actions
It was added by D46315
Looks like it was papering over some SystemZ issues... or maybe it was the TRI::getCalleeSavedRegs vs. MRI::getCalleeSavedRegs.
Anyhow, LGTM.
We miss the setting of HasAtLeastOneSubreg in the loop.