This patch is to enable partial unrolling and runtime unrolling for AArch64 target. Applying this patch with the runtime unrolling prologue changes I just sent together, the SPEC2000 got improved by 0.6%, and for SPEC2006, the improved number is 0.8%. For code size, the images of two benchmarks got same 20% inflation. This experiment is done on A57.
Thanks,
Kevin