This patch adds a simplfiied version of count for the simplify intrinsics pass, allowing the function to be inlined.
This was done specifically to help improve performance for exchange2, and provides a ~12% performance increase.
Differential D142209
[flang] Add Count to simplified intrinsics SBallantyne on Jan 20 2023, 6:35 AM. Authored by
Details This patch adds a simplfiied version of count for the simplify intrinsics pass, allowing the function to be inlined. This was done specifically to help improve performance for exchange2, and provides a ~12% performance increase.
Diff Detail
Event TimelineComment Actions Looks OK in general, just some minor tidy-up suggestions.
Comment Actions Fix zero64/one64 values being swapped around
Comment Actions I think this is good to go in now, subject to @vzakhari not finding anything else wrong. Comment Actions Thank you for the changes. Just one minor thing.
|