And use this TTI for Cyclone. As it was explained in the original RFC
(http://thread.gmane.org/gmane.comp.compilers.llvm.devel/92758), the HW
prefetcher work up to 2KB strides.
I am also adding tests for this and the previous change (D17943):
- Cyclone prefetching accesses with a large stride
- Cyclone not prefetching accesses with a small stride
- Generic Aarch64 subtarget not prefetching either