Most aarch64 function epilogues look the same: A series of ldp
instructions followed by a ret. In fact about all epilogues fall in 1 of
16 patterns. These 16 epilogues are put into compiler-rt to be shared
and the epilogue code is replaced with a jump to these epilogues.
In a testsuite compiled with -Os sharing the epilogues gives a 1.7% percent
code size reduction.
This patch adds the -aarch64-shared-epilogues switch I will perform more
benchmarking to decide whether this is a candidate for -Os or -Oz.
Related to rdar://23082514
When does RET occur here? I can't remember a way to bypass RET_ReallyLR.