With complex recursive structure of async dispatch function LLVM can't always propagate constants to the parallel_compute_fn and it often prevents optimizations like loop unrolling and vectorization. We help LLVM by pushing known constants into the parallel_compute_fn explicitly.
Details
Details
Summary
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Herald added subscribers: sdasgup3, wenzhicui, wrengr and 20 others. · View Herald TranscriptDec 7 2021, 9:44 AM
This revision is now accepted and ready to land.Dec 9 2021, 6:42 AM
This revision was landed with ongoing or failed builds.Dec 9 2021, 6:48 AM
Closed by commit rG9f151b784be0: [mlir] AsyncParallelFor: sink constants into the parallel compute function (authored by ezhulenev). · Explain Why
This revision was automatically updated to reflect the committed changes.