This introduces a new family of combiner helper routines that re-use the target specific cost model from SelectionDAG, and generate inline implementations of the memcpy family of intrinsics.
The combines are only enabled at optimization levels higher than -O0, and give very substantial performance improvements.