If the X86 subtarget supports FMA, then allow it to aggressively generate FMA nodes, even if it means we have duplicated mul(x,y) and fma(x,y,z) cases
This demonstrates a likely flaw in the existing enableAggressiveFMAFusion folds - should we fold fadd(fmul(x,y), fmul(x,y)) -> fma(x,y,fmul(x,y)) ?
This comment needs to be updated.