Fix the cost model for FADD vector reduction. Cost was being over-estimated from BaseT::getArithmeticReductionCost function. Add special case to AArch64TTIImpl::getArithmeticReductionCost. Reflects a lowering where vector through element-wise vector adds until it is less than or equal to the size of a vector register. Then it is reduced with a pairwise add.
Correction also enables a more optimal lowering of dot product as shown through tests. Originally, the cost model was erroneously preventing this special lowering.
clang-format not found in user’s local PATH; not linting file.