This patch includes proper schedule some numbers for SSE 4.1 and AVX instructions on btver2 CPU.
Details
Details
Diff Detail
Diff Detail
Event Timeline
Comment Actions
I realise this was for SSE41 instructions, but given that its just dot product ops, it might be better to rename it and add the VDPPSY cases as well?
Comment Actions
LGTM - Please rebase to fix the SSE41-schedule diffs (test_extractps and test_pextrd) before commit
AMD docs says this should be [3,3] - go with that