Allocating larger register classes first should give better allocation results (and more importantly for myself, make the lit tests more stable with respect to scheduler changes).
This is a suggestion to work on, I did not bother to fix the lit tests.