This patch adds a simplfiied version of count for the simplify intrinsics pass, allowing the function to be inlined.
This was done specifically to help improve performance for exchange2, and provides a ~12% performance increase.
Paths
| Differential D142209
[flang] Add Count to simplified intrinsics ClosedPublic Authored by SBallantyne on Jan 20 2023, 6:35 AM.
Details Summary This patch adds a simplfiied version of count for the simplify intrinsics pass, allowing the function to be inlined. This was done specifically to help improve performance for exchange2, and provides a ~12% performance increase.
Diff Detail
Event TimelineHerald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJan 20 2023, 6:35 AM Comment Actions Looks OK in general, just some minor tidy-up suggestions.
SBallantyne added inline comments.
SBallantyne marked an inline comment as done. Comment ActionsChange to test for 0 in simplified count rather than add logicals
Comment Actions Fix zero64/one64 values being swapped around
Comment Actions I think this is good to go in now, subject to @vzakhari not finding anything else wrong. This revision is now accepted and ready to land.Jan 26 2023, 11:53 AM Comment Actions Thank you for the changes. Just one minor thing.
This revision was landed with ongoing or failed builds.Jan 27 2023, 8:30 AM Closed by commit rG7d2e198729df: [flang] Add Count to simplified intrinsics (authored by SBallantyne, committed by MatsPetersson). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 492769 flang/lib/Optimizer/Transforms/SimplifyIntrinsics.cpp
flang/test/Transforms/simplifyintrinsics.fir
|
Need to document the new argument.