This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
docs/
-
PassManagement.md
-
include/mlir/
-
mlir/
-
Pass/
2/2
Pass.h
3/3
PassManager.h
-
Support/
15/15
Timing.h
-
lib/
-
Pass/
-
Pass.cpp
-
PassManagerOptions.cpp
1/1
PassTiming.cpp
-
Support/
-
CMakeLists.txt
3/3
MlirOptMain.cpp
7/7
Timing.cpp
-
test/Pass/
-
Pass/
-
pass-timing.mlir
-
pipeline-parsing.mlir

Differential D100647

[MLIR] Factor pass timing out into a dedicated timing manager
ClosedPublic

Authored by fabianschuiki on Apr 16 2021, 6:57 AM.

Download Raw Diff

Details

Reviewers

rriddle
lattner

Commits

rG33f908c42881: [MLIR] Factor pass timing out into a dedicated timing manager

Summary

This factors out the pass timing code into a separate TimingManager that can be plugged into the PassManager from the outside. Users are able to provide their own implementation of this manager, and use it to time additional code paths outside of the pass manager. Also allows for multiple PassManagers to run and contribute to a single timing report.

More specifically, moves most of the existing infrastructure in Pass/PassTiming.cpp into a new Support/Timing.cpp file and adds a public interface in Support/Timing.h. The PassTiming instrumentation becomes a wrapper around the new timing infrastructure which adapts the instrumentation callbacks to the new timers.

This is a concrete suggestion towards https://llvm.discourse.group/t/add-entries-to-pass-timing-from-outside-the-passmanager/3115.

Diff Detail

Event Timeline

fabianschuiki created this revision.Apr 16 2021, 6:57 AM

Herald added a reviewer: rriddle. · View Herald TranscriptApr 16 2021, 6:57 AM

Herald added subscribers: dcaballe, cota, teijeong and 16 others. · View Herald Transcript

fabianschuiki requested review of this revision.Apr 16 2021, 6:57 AM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald TranscriptApr 16 2021, 6:57 AM

fabianschuiki updated this revision to Diff 338103.Apr 16 2021, 7:21 AM

Harbormaster completed remote builds in B99164: Diff 338095.Apr 16 2021, 7:40 AM

Harbormaster completed remote builds in B99168: Diff 338103.Apr 16 2021, 7:59 AM

I support making the timing infrastructure usable and pluggable outside of the pass manager! Thanks for working on this, I looked into it at some point but didn't need it badly enough to finish it.

However I'm not sure about this approach at the moment: it seems that you're trying to keep the PM in charge of all the timing, and using it as a "timing manager" for things that aren't related to pass management.
I would think that it'd be better modeled by extracting the timing manager as a separate entities usable outside of the PM, and have the PM report to "timing manager" (through injection or similar).

In D100647#2695523, @mehdi_amini wrote:

I support making the timing infrastructure usable and pluggable outside of the pass manager! Thanks for working on this, I looked into it at some point but didn't need it badly enough to finish it.

However I'm not sure about this approach at the moment: it seems that you're trying to keep the PM in charge of all the timing, and using it as a "timing manager" for things that aren't related to pass management.
I would think that it'd be better modeled by extracting the timing manager as a separate entities usable outside of the PM, and have the PM report to "timing manager" (through injection or similar).

+1 to what Mehdi says here. We want to have general infra related to timing (that ideally users can inject their own implementations of), with the pass manager simply hooking into that.

I agree with Mehdi and River. it seems like the passmgr should be a client of the more general shared thing.

I'd also recommend changing the top level flyer:
... Pass execution timing report ... to just ... Execution time report ...

Thanks for the great feedback! I also had a more general timing manager in mind that we could inject into the PM, but opted for this simpler change to get a discussion rolling. Let me factor out the timing interface into some abstract base class and the current implementation of time keeping into a concrete subclass. That would allow us to default to an MLIR-provided TimingManager class as a convenience (and to not break existing code maybe?), but still allow the user to provide a custom implementation if desired.

Moved most of the timing infrastructure in Pass/PassTiming.cpp into a new Support/Timing.cpp file
Added public interfaces for the timing stuff in Support/Timing.h
Made the pass manager use the timers provided by the new Timing.h
Added a new TimingManager that holds the root timer and prints timing reports (similar to the role the PassTiming instrumentation had previously)
Added TimingManagerOptions and corresponding registerTimingManagerCLOptions and applyTimingManagerCLOptions functions

One problem I ran into when merging async executions of the same pipeline was identifying whether two executed passes were actually "the same" from the user's perspective (i.e. just cloned for the sake of threading). Just using the pass pointer as unique identifier would cause each thread to show up as a separate set of passes. An obvious solution would have been to use pass.getTypeID().getAsOpaquePointer(), but if the user intentionally added the same pass multiple times they would all be collapsed into one. I ended up adding a passPrototype field to Pass itself to track the "pass identity" through cloning inside the PM. (The original pass timing had the same problem and opted for merging the passes in a pipeline by position.)

I'm not sure how far we want to preserve compatibility with respect to CL options. The patch as it is now preserves the old --pass-timing and --pass-timing-display options of the PM, as well as the PM's enableTiming(config) function. If no external timing manager is provided, the PM will just create its own temporary one internally, report there, and print results as before. The new --timing and --timing-display can still be used and if present take precedence over --pass-timing. I'm also happy to throw out the old --pass-timing options, or rename the new ones to --pass-timing instead.

Finally I also added reporting for the "remaining" time not covered by dedicated timers, as suggested by @lattner. It is currently only shown for the top of the hierarchy. It's easy to add for all nested timers that have children, but that adds *a lot* of entries to the pipeline.

The output from the CIRCT project's firtool with the new timing manager looks as follows:

===-------------------------------------------------------------------------===
                        ... Execution timing report ...
===-------------------------------------------------------------------------===
  Total Execution Time: 0.0067 seconds

  ----User Time----  ----Wall Time----  ----Name----
    0.0010 ( 13.4%)    0.0010 ( 14.9%)  Parser
    0.0012 ( 15.7%)    0.0012 ( 17.5%)  'firrtl.circuit' Pipeline
    0.0004 (  6.0%)    0.0004 (  6.6%)    LowerFIRRTLTypes
    0.0006 (  7.7%)    0.0006 (  8.5%)    'firrtl.module' Pipeline
    0.0002 (  2.1%)    0.0002 (  2.3%)      CSE
    0.0000 (  0.1%)    0.0000 (  0.1%)        (A) DominanceInfo
    0.0004 (  5.3%)    0.0004 (  6.0%)      Canonicalizer
    0.0014 ( 19.3%)    0.0014 ( 21.5%)  LowerFIRRTLToRTL
    0.0002 (  2.2%)    0.0002 (  2.5%)  RTLMemSimImpl
    0.0011 ( 14.1%)    0.0004 (  5.5%)  'rtl.module' Pipeline
    0.0003 (  4.5%)    0.0002 (  2.3%)    RTLCleanup
    0.0002 (  3.2%)    0.0001 (  1.4%)    CSE
    0.0000 (  0.2%)    0.0000 (  0.1%)      (A) DominanceInfo
    0.0003 (  4.3%)    0.0001 (  1.6%)    Canonicalizer
    0.0005 (  6.5%)    0.0005 (  7.3%)  RTLLegalizeNames
    0.0002 (  2.2%)    0.0002 (  2.4%)    (A) circt::sv::LegalNamesAnalysis
    0.0008 ( 10.6%)    0.0008 ( 11.8%)  Output
    0.0008 ( 11.2%)    0.0008 ( 12.5%)  Rest
    0.0075 (100.0%)    0.0067 (100.0%)  Total

Of these Parser and Output are timers run before and after the PM, and Rest indicates the time that is not covered by the top-level timers.

Herald added subscribers: jfb, mgorny. · View Herald TranscriptApr 21 2021, 2:24 AM

Harbormaster completed remote builds in B99920: Diff 339144.Apr 21 2021, 3:00 AM

rriddle added inline comments.Apr 21 2021, 11:04 AM

mlir/include/mlir/Pass/Pass.h
150	You can't rely on this meaning that the pass is a threading sibling with the original pass.
mlir/include/mlir/Support/Timing.h
114	nit: Just spell out auto here.
122	This display mode looks specific to the current pass manager implementation, I'm not sure this is generally applicable to the other potential timer backends. Couple of related general questions: Do we need a general TimingConfig anymore? This was necessary before because of how the pass manager hid the internal timing instrumentation, but I'm not sure that it is really necessary anymore. How much of it would be shared across different potential timer backends? Can you rename TimingManagerBase base to just TimingManager? I don't think the default implementation should be called TimingManager, it should likely be called something else(not sure the name though). This would establish that TimingManager (as you currently have it) is just an implementation and not the interface. Can you add comment blocks? I find these to help separate different concepts within a source file (e.g. https://github.com/llvm/llvm-project/blob/a7f33d720b74a92031958fce0e209794d296393d/mlir/include/mlir/Pass/Pass.h#L310) Do you have an idea of what supporting another timing backend would look like? Not that you have to do the work, but when I've thought about this before ideally I would want to be able to support something like (1) or (2) without needing a new API/interface. Ideally these would all just be implementations that a user can choose from and pass the general infra to use for timer reporting. (1): https://github.com/llvm/llvm-project/blob/main/llvm/include/llvm/Support/TimeProfiler.h (2): https://github.com/wolfpld/tracy
mlir/lib/Support/Timing.cpp
135	The virtual isn't necessary when you have override.
140	nit: Drop the else after a return.
172	Drop the trivial braces here.

This is looking really great Fabian!

mlir/include/mlir/Pass/PassManager.h
317–324	Unless this is a widely used typedef that could occur in downstream code, I'd recommend just eliminating the `PassTimingConfig` name, updating clients to use `TimingConfig` directly.
mlir/include/mlir/Support/Timing.h
27	Please follow clang-tidy's advice here and add a virtual dtor. Also, please declare one method (e.g. the dtor) out of line in a .cpp file to give the vtable a home in a .o file.
162	I agree with River's comment that it would be great to eliminate this class.
mlir/lib/Support/Timing.cpp
108	As clang tidy suggests please declare this as a class. The difference matters for MSVC

fabianschuiki marked 10 inline comments as done.May 3 2021, 8:55 AM

fabianschuiki added inline comments.

mlir/include/mlir/Pass/Pass.h
150	Renamed this to `threadingSibling` and moved the tracking of copies from `Pass::clone()` into the `OpPassManager` constructor. Since the `OpPassManager` is only duplicated in `OpToOpPassAdaptor::runOnOperationAsyncImpl`, that should reliably identify clones that were made for the sake of threading.

Thanks for the great feedback @rriddle and @lattner! Updated the diff with your suggestions and fixed the check-mlir run.

@rriddle's question about how easily we can spin up additional implementations for stuff like the LLVM TimeProfiler or tracy got me thinking. The previous design would require you to subclass both TimerBase and TimingManagerBase to hook into the infrastructure. Since you had to hand out TimerBase instances, you had to allocate them somewhere in memory, which was particularly bad if the external timing API you were wrapping was giving you IDs or pointers to its own data structures. This revision now simplifies the design: TimerBase has been replaced by a Timer that is not intended to be subclassed, but packages a pointer to a timing manager and an opaque ID/handle identifying the timer. TimingManagerBase is now called TimingManager and has overridable callback functions for timer start, stop, and nesting which Timer forwards to. That means there is one thing for implementations to subclass (TimingManager), and one thing for library code to interact with (Timer, and by extension TimingScope). Makes things much easier to use. Also, wrappers around external timing libs can just pass around the pointers they get from those libs as opaque handles in Timer.

Dropped all the TimingConfig business and made the TimingDisplayMode a characteristic of the default timing manager implementation.

DefaultTimingManager is maybe not the best of names; suggestions are very welcome! Maybe ConsoleTimingManager, BasicTimingManager, SimpleTimingManager?

The redundancy between the timing options in PassManagerOptions and the new ones in TimingManagerOptions felt awkward. I dropped the timing options from PM altogether in favor of dealing with timing config completely outside of the PM. The pass manager can own a TimingManager and there is now a applyDefaultTimingPassManagerCLOptions() function which generates a dedicated timing manager for the PM with some options in case the user does not want to deal with timing themselves. The intended usage though is through PassManager::enableTiming(TimingManager).

I renamed the --pass-timing and --pass-timing-display options to --timing and --timing-display, respectively, to reflect that the scope of these has broadened beyond pass timing.

I updated MlirOptMain and the pass-timing.mlir test to use the new timing infrastructure and options, and to act as an example on how to use them.

Harbormaster completed remote builds in B102302: Diff 342412.May 3 2021, 10:21 AM

This looks really nice Fabian!

This revision is now accepted and ready to land.May 3 2021, 9:50 PM

How do you envision this being used deep within the compiler? Should we have a static API that is accessible from anywhere? Should we have something on the context? What are your thoughts here?

Also, please fix the clang tidy issues.

mlir/include/mlir/Support/Timing.h
160	nit: Please make bool conversion operators explicit.

(Apologies for the delay in review, will take a pass over tomorrow)

fabianschuiki marked an inline comment as done.May 3 2021, 11:22 PM

Fixed clang-tidy issues and made bool conversion explicit. Sorry about those, having a somewhat hard time to get my local clang-tidy to reproduce the set of upstream checks.

Regarding usage within the compiler: Having to thread TimingScopes through everywhere sounds pretty unergonomic if you just want to time a few things within the compiler. Ideally we would have a per-thread stack of active timing scopes somewhere (Maybe in the context? As a thread-local static?). Pointers to the active TimingScopes would then be kept on that stack, and you could just ask for the currently-active one to nest into. That way you could get at whatever scope within the parent code you're implicitly executing in, without needing to explicitly push scope references down the call chain. Maybe context.getTimingScope(id, nameBuilder) and context.getTimer(id, nameBuilder)?

Harbormaster completed remote builds in B102467: Diff 342652.May 4 2021, 12:07 AM

In general, I have concerns with any design that would single global data structure for book-keeping. I would look into hooking the storage onto the MLIRContext as needed.

In D100647#2735576, @fabianschuiki wrote:

Fixed clang-tidy issues and made bool conversion explicit. Sorry about those, having a somewhat hard time to get my local clang-tidy to reproduce the set of upstream checks.

Regarding usage within the compiler: Having to thread TimingScopes through everywhere sounds pretty unergonomic if you just want to time a few things within the compiler. Ideally we would have a per-thread stack of active timing scopes somewhere (Maybe in the context? As a thread-local static?). Pointers to the active TimingScopes would then be kept on that stack, and you could just ask for the currently-active one to nest into. That way you could get at whatever scope within the parent code you're implicitly executing in, without needing to explicitly push scope references down the call chain. Maybe context.getTimingScope(id, nameBuilder) and context.getTimer(id, nameBuilder)?

I would not add timing API to the context itself, but instead hook in a reference to a timing manager. Mehdi touches a bit on the general opinion that MLIR takes about these in terms of static vs non-static, and returning a manager object is inline with the other things hooked into the context. One important piece here though, is that it should generally be zero-cost if there is no timing manager.

mlir/include/mlir/Pass/PassManager.h
371	Why can't this be handled by the timing instrumentation? I don't think we need these fields here.
mlir/include/mlir/Support/Timing.h
17	This is an indicator that the dependencies are off. Support/ shouldn't depend on anything in IR/.
333	nit: I don't think the llvm:: here is necessary if you include LLVM.h
mlir/lib/Pass/PassTiming.cpp
108	nit: Drop the trivial braces here.
mlir/lib/Support/MlirOptMain.cpp
51	I'm not sure about this type of integration. This seems to always enforce timing, which isn't something that I would expect. If we start integrating timing more deeply into the compiler. I would expect it to do nothing unless explicitly enabled.

This revision now requires changes to proceed.May 4 2021, 10:01 PM

I think having a reference to a TimingManager in the context would be great. That would allow us to encode timing enable/disable through the presence/absence of a such a reference (right now it's an explicit field in the TM). For the sake of keeping code that uses the timing API streamlined, we should probably have the getTimingManager() return a NullTimingManager reference that always hands out disabled Timers. Since the latter are just a wrapper around a pointer, and their functions all return immediately if disabled, the runtime cost in code should be equivalent to having manual "is timing enabled" checks.

Do you guys want to tackle the integration into MLIRContext already as part of this diff, or push that off into a separate one?

mlir/include/mlir/Pass/PassManager.h
371	Yeah good point, I'll move that into the timing instrumentation.
mlir/include/mlir/Support/Timing.h
17	Good point. Will check for some other/dedicated mechanism to map a string to a opaque pointer identifying the string content. Since we have a `TimingManager` now, that could just as well go in there.
mlir/lib/Support/MlirOptMain.cpp
51	The timing is disabled in the manager upon construction. The call to `applyDefaultTimingManagerCLOptions(tm)` on the next line enables timing if the options are present on the command line. If the timing remains disabled, `getRootScope()` below returns a disabled `TimingScope` that has a fast path to do nothing (just a pointer comparison). With some integration into `MLIRContext` and a stack of running timers, you could remove the `timing` variable below and instead have things like: DefaultTimingManager tm; applyDefaultTimingManagerCLOptions(tm); context->setTimingManager(tm); // or some ownership transfer auto parserTiming = context->getTimingManager()->addTimingScope("Parser"); parserTiming.stop() auto outputTiming = context->getTimingManager()->addTimingScope("Output");

In D100647#2738046, @fabianschuiki wrote:

I think having a reference to a TimingManager in the context would be great. That would allow us to encode timing enable/disable through the presence/absence of a such a reference (right now it's an explicit field in the TM). For the sake of keeping code that uses the timing API streamlined, we should probably have the getTimingManager() return a NullTimingManager reference that always hands out disabled Timers. Since the latter are just a wrapper around a pointer, and their functions all return immediately if disabled, the runtime cost in code should be equivalent to having manual "is timing enabled" checks.

Do you guys want to tackle the integration into MLIRContext already as part of this diff, or push that off into a separate one?

I'm okay with shifting that a follow up, so that it is easier to review the diff.

Can you also draft up some documentation for the timing infra? with examples on users would interact with it, and how it could be used to inject custom timers? I'm fine with that being a follow up as well.

This looks really great! I'm looking forward to being able to use it.

mlir/include/mlir/Support/Timing.h
87–88	/// Implementations override this method to provide access to the top-level timer. This method isn't virtual, what do you mean by the comment above?
mlir/lib/Support/MlirOptMain.cpp
51	Ah, I completely missed that. Thanks for pointing it out! I like what you have here.

fabianschuiki marked 3 inline comments as done.May 5 2021, 12:27 AM

fabianschuiki added inline comments.

mlir/include/mlir/Support/Timing.h
87–88	Oops, my bad -- that's a leftover from when `rootTimer()` below wasn't a thing.

fabianschuiki marked 5 inline comments as done.May 5 2021, 1:48 AM

Moved the owned timing manager and scope into PassTiming, dropping those fields from PassManager.
Added a TimingIdentifier to remove the dependency on IR/Identifier.h. This new identifier is a version of Identifier/MLIRContext but stripped-down to cover only the functionality needed by the TM.
Other cleanup

Harbormaster completed remote builds in B102684: Diff 342966.May 5 2021, 2:43 AM

Looks good to me with comments resolved.

(Also should clarify when I said docs, I was thinking something in mlir/docs/)

mlir/include/mlir/Support/Timing.h
79	demarcate
230	Can you change all of the uses of `std::function<> &&` to function_ref? std::function generally ends up allocating memory, which we don't really want to do. Given that we aren't storing the function, we only really need a reference anyways.
402	Is this something we could make non-virtual? I think it would be nice if we could avoid an indirect call in the case where timing is disabled.
mlir/lib/Support/Timing.cpp
33	This shouldn't be necessary, StringRef is exported in the mlir namespace (see LLVM.h)
542	Can you prefix these with `mlir-`? I'm always hesitant about adding very commonly named things to the global command line namespace.

This revision is now accepted and ready to land.May 6 2021, 11:34 AM

fabianschuiki marked 5 inline comments as done.May 6 2021, 1:19 PM

fabianschuiki added inline comments.

mlir/include/mlir/Support/Timing.h
402	The way you use the TM at the moment is roughly: DefaultTimingManager tm; TimingScope ts = tm.getRootScope(); // all further timing calls go through `ts` If timing is disabled, `ts` is marked as diabled through a nullptr in its `tm` field, which causes all subsequent interaction to short-circuit into no-ops. So in its current implementation, if timing is disabled there is just one virtual function call for the duration of the program -- that should not be visible in practice. I do agree that it would be nicer if the setup-related interaction with the TM would be non-virtual. Especially if we start to keep per-thread stacks of active timing scopes, that should all be directly accessible. We could promote `setEnabled` and `isEnabled` into the `TimingManager` base class such that it can just never ask for the `rootTimer` if disabled. Pretty sure that all TM impls can get behind the concept of enabling/disabling. Do you want to tackle this in this diff, or push that off to a follow-up diff alongside tracking a TM in the context?
mlir/lib/Support/Timing.cpp
542	Yeah that's a good idea. `--timing` did seem awfully brief.

Applied suggestions by @rriddle.

rriddle added inline comments.May 6 2021, 1:20 PM

mlir/include/mlir/Support/Timing.h
402	Followup is alright with me, thanks!

Harbormaster completed remote builds in B103056: Diff 343490.May 6 2021, 2:02 PM

Awesome, thanks! What are the next steps (sorry, first time Phabricator user)?

git pull --rebase
ninja check-mlir # sanity check
git push

if needed:
git pull --rebase
git push

fabianschuiki set the repository for this revision to rG LLVM Github Monorepo.May 12 2021, 9:17 AM

Closed by commit rG33f908c42881: [MLIR] Factor pass timing out into a dedicated timing manager (authored by fabianschuiki). · Explain WhyMay 12 2021, 9:19 AM

This revision was automatically updated to reflect the committed changes.

fabianschuiki added a commit: rG33f908c42881: [MLIR] Factor pass timing out into a dedicated timing manager.

Thanks @lattner! Wasn't sure whether the ready-to-land is the green light to push to the repo myself. Done!

Thanks again Fabian. It would also be good to make sure this gets mentioned in MLIR biweekly

In D100647#2754448, @lattner wrote:

Thanks again Fabian. It would also be good to make sure this gets mentioned in MLIR biweekly

Yes please! :)

(here is the link: https://llvm.discourse.group/t/work-in-progress-next-mlir-news-33rd-edition-5-1-5-14-2021/3417 )

In D100647#2755459, @mehdi_amini wrote:

In D100647#2754448, @lattner wrote:

Thanks again Fabian. It would also be good to make sure this gets mentioned in MLIR biweekly

Yes please! :)

(here is the link: https://llvm.discourse.group/t/work-in-progress-next-mlir-news-33rd-edition-5-1-5-14-2021/3417 )

Done! :-)

Revision Contents

Path

Size

mlir/

docs/

PassManagement.md

12 lines

include/

mlir/

Pass/

Pass.h

19 lines

PassManager.h

58 lines

Support/

Timing.h

424 lines

lib/

Pass/

Pass.cpp

7 lines

PassManagerOptions.cpp

35 lines

PassTiming.cpp

532 lines

Support/

CMakeLists.txt

2 lines

MlirOptMain.cpp

10 lines

Timing.cpp

568 lines

test/

Pass/

pass-timing.mlir

29 lines

pipeline-parsing.mlir

4 lines

Diff 342966

mlir/docs/PassManagement.md

	Show First 20 Lines • Show All 969 Lines • ▼ Show 20 Lines

	#### Pass Timing			#### Pass Timing

	The PassTiming instrumentation provides timing information about the execution			The PassTiming instrumentation provides timing information about the execution
	of passes and computation of analyses. This provides a quick glimpse into what			of passes and computation of analyses. This provides a quick glimpse into what
	passes are taking the most time to execute, as well as how much of an effect a			passes are taking the most time to execute, as well as how much of an effect a
	pass has on the total execution time of the pipeline. Users can enable this			pass has on the total execution time of the pipeline. Users can enable this
	instrumentation directly on the PassManager via `enableTiming`. This			instrumentation directly on the PassManager via `enableTiming`. This
	instrumentation is also made available in mlir-opt via the `-pass-timing` flag.			instrumentation is also made available in mlir-opt via the `-timing` flag.
	The PassTiming instrumentation provides several different display modes for the			The PassTiming instrumentation provides several different display modes for the
	timing results, each of which is described below:			timing results, each of which is described below:

	##### List Display Mode			##### List Display Mode

	In this mode, the results are displayed in a list sorted by total time with each			In this mode, the results are displayed in a list sorted by total time with each
	pass/analysis instance aggregated into one unique result. This view is useful			pass/analysis instance aggregated into one unique result. This view is useful
	for getting an overview of what analyses/passes are taking the most time in a			for getting an overview of what analyses/passes are taking the most time in a
	pipeline. This display mode is available in mlir-opt via			pipeline. This display mode is available in mlir-opt via
	`-pass-timing-display=list`.			`-timing-display=list`.

	```shell			```shell
	$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='func(cse,canonicalize)' -convert-std-to-llvm -pass-timing -pass-timing-display=list			$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='func(cse,canonicalize)' -convert-std-to-llvm -timing -timing-display=list

	===-------------------------------------------------------------------------===			===-------------------------------------------------------------------------===
	... Pass execution timing report ...			... Pass execution timing report ...
	===-------------------------------------------------------------------------===			===-------------------------------------------------------------------------===
	Total Execution Time: 0.0203 seconds			Total Execution Time: 0.0203 seconds

	---Wall Time--- --- Name ---			---Wall Time--- --- Name ---
	0.0047 ( 55.9%) Canonicalizer			0.0047 ( 55.9%) Canonicalizer
	0.0019 ( 22.2%) VerifierPass			0.0019 ( 22.2%) VerifierPass
	0.0016 ( 18.5%) LLVMLoweringPass			0.0016 ( 18.5%) LLVMLoweringPass
	0.0003 ( 3.4%) CSE			0.0003 ( 3.4%) CSE
	0.0002 ( 1.9%) (A) DominanceInfo			0.0002 ( 1.9%) (A) DominanceInfo
	0.0084 (100.0%) Total			0.0084 (100.0%) Total
	```			```

	##### Pipeline Display Mode			##### Tree Display Mode

	In this mode, the results are displayed in a nested pipeline view that mirrors			In this mode, the results are displayed in a nested pipeline view that mirrors
	the internal pass pipeline that is being executed in the pass manager. This view			the internal pass pipeline that is being executed in the pass manager. This view
	is useful for understanding specifically which parts of the pipeline are taking			is useful for understanding specifically which parts of the pipeline are taking
	the most time, and can also be used to identify when analyses are being			the most time, and can also be used to identify when analyses are being
	invalidated and recomputed. This is the default display mode.			invalidated and recomputed. This is the default display mode.

	```shell			```shell
	$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='func(cse,canonicalize)' -convert-std-to-llvm -pass-timing			$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='func(cse,canonicalize)' -convert-std-to-llvm -timing

	===-------------------------------------------------------------------------===			===-------------------------------------------------------------------------===
	... Pass execution timing report ...			... Pass execution timing report ...
	===-------------------------------------------------------------------------===			===-------------------------------------------------------------------------===
	Total Execution Time: 0.0249 seconds			Total Execution Time: 0.0249 seconds

	---Wall Time--- --- Name ---			---Wall Time--- --- Name ---
	0.0058 ( 70.8%) 'func' Pipeline			0.0058 ( 70.8%) 'func' Pipeline
	Show All 14 Lines
	slightly changes. First, a new timing column is added, `User Time`, that			slightly changes. First, a new timing column is added, `User Time`, that
	displays the total time spent across all threads. Secondly, the `Wall Time`			displays the total time spent across all threads. Secondly, the `Wall Time`
	column displays the longest individual time spent amongst all of the threads.			column displays the longest individual time spent amongst all of the threads.
	This means that the `Wall Time` column will continue to give an indicator on the			This means that the `Wall Time` column will continue to give an indicator on the
	perceived time, or clock time, whereas the `User Time` will display the total			perceived time, or clock time, whereas the `User Time` will display the total
	cpu time.			cpu time.

	```shell			```shell
	$ mlir-opt foo.mlir -pass-pipeline='func(cse,canonicalize)' -convert-std-to-llvm -pass-timing			$ mlir-opt foo.mlir -pass-pipeline='func(cse,canonicalize)' -convert-std-to-llvm -timing

	===-------------------------------------------------------------------------===			===-------------------------------------------------------------------------===
	... Pass execution timing report ...			... Pass execution timing report ...
	===-------------------------------------------------------------------------===			===-------------------------------------------------------------------------===
	Total Execution Time: 0.0078 seconds			Total Execution Time: 0.0078 seconds

	---User Time--- ---Wall Time--- --- Name ---			---User Time--- ---Wall Time--- --- Name ---
	0.0177 ( 88.5%) 0.0057 ( 71.3%) 'func' Pipeline			0.0177 ( 88.5%) 0.0057 ( 71.3%) 'func' Pipeline
	▲ Show 20 Lines • Show All 157 Lines • Show Last 20 Lines

mlir/include/mlir/Pass/Pass.h

Show First 20 Lines • Show All 139 Lines • ▼ Show 20 Lines	public:
/// Assign the statistic to the given value.		/// Assign the statistic to the given value.
Statistic &operator=(unsigned value);		Statistic &operator=(unsigned value);
};		};

/// Returns the main statistics for this pass instance.		/// Returns the main statistics for this pass instance.
ArrayRef<Statistic *> getStatistics() const { return statistics; }		ArrayRef<Statistic *> getStatistics() const { return statistics; }
MutableArrayRef<Statistic *> getStatistics() { return statistics; }		MutableArrayRef<Statistic *> getStatistics() { return statistics; }

		/// Returns the thread sibling of this pass.
		///
		/// If this pass was cloned by the pass manager for the sake of
		rriddleUnsubmitted Done Reply Inline Actions You can't rely on this meaning that the pass is a threading sibling with the original pass. rriddle: You can't rely on this meaning that the pass is a threading sibling with the original pass.
		fabianschuikiAuthorUnsubmitted Done Reply Inline Actions Renamed this to `threadingSibling` and moved the tracking of copies from `Pass::clone()` into the `OpPassManager` constructor. Since the `OpPassManager` is only duplicated in `OpToOpPassAdaptor::runOnOperationAsyncImpl`, that should reliably identify clones that were made for the sake of threading. fabianschuiki: Renamed this to `threadingSibling` and moved the tracking of copies from `Pass::clone()` into…
		/// multi-threading, this function returns the original pass it was cloned
		/// from. This is useful for diagnostic purposes to distinguish passes that
		/// were replicated for threading purposes from passes instantiated by the
		/// user. Used to collapse passes in timing statistics.
		const Pass *getThreadingSibling() const { return threadingSibling; }

		/// Returns the thread sibling of this pass, or the pass itself it has no
		/// sibling. See `getThreadingSibling()` for details.
		const Pass *getThreadingSiblingOrThis() const {
		return threadingSibling ? threadingSibling : this;
		}

protected:		protected:
explicit Pass(TypeID passID, Optional<StringRef> opName = llvm::None)		explicit Pass(TypeID passID, Optional<StringRef> opName = llvm::None)
: passID(passID), opName(opName) {}		: passID(passID), opName(opName) {}
Pass(const Pass &other) : Pass(other.passID, other.opName) {}		Pass(const Pass &other) : Pass(other.passID, other.opName) {}

/// Returns the current pass state.		/// Returns the current pass state.
detail::PassExecutionState &getPassState() {		detail::PassExecutionState &getPassState() {
assert(passState && "pass state was never initialized");		assert(passState && "pass state was never initialized");
▲ Show 20 Lines • Show All 131 Lines • ▼ Show 20 Lines	private:
Optional<detail::PassExecutionState> passState;		Optional<detail::PassExecutionState> passState;

/// The set of statistics held by this pass.		/// The set of statistics held by this pass.
std::vector<Statistic *> statistics;		std::vector<Statistic *> statistics;

/// The pass options registered to this pass instance.		/// The pass options registered to this pass instance.
detail::PassOptions passOptions;		detail::PassOptions passOptions;

		/// A pointer to the pass this pass was cloned from, if the clone was made by
		/// the pass manager for the sake of multi-threading.
		const Pass *threadingSibling = nullptr;

/// Allow access to 'clone'.		/// Allow access to 'clone'.
friend class OpPassManager;		friend class OpPassManager;

/// Allow access to 'passState'.		/// Allow access to 'passState'.
friend detail::OpToOpPassAdaptor;		friend detail::OpToOpPassAdaptor;

/// Allow access to 'passOptions'.		/// Allow access to 'passOptions'.
friend class PassInfo;		friend class PassInfo;
▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

mlir/include/mlir/Pass/PassManager.h

//===- PassManager.h - Pass Management Interface ----------------- C++ --===//		//===- PassManager.h - Pass Management Interface ----------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef MLIR_PASS_PASSMANAGER_H		#ifndef MLIR_PASS_PASSMANAGER_H
#define MLIR_PASS_PASSMANAGER_H		#define MLIR_PASS_PASSMANAGER_H

#include "mlir/IR/Dialect.h"		#include "mlir/IR/Dialect.h"
#include "mlir/IR/OperationSupport.h"		#include "mlir/IR/OperationSupport.h"
#include "mlir/Support/LogicalResult.h"		#include "mlir/Support/LogicalResult.h"
		#include "mlir/Support/Timing.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/iterator.h"		#include "llvm/ADT/iterator.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

#include <functional>		#include <functional>
#include <vector>		#include <vector>

▲ Show 20 Lines • Show All 285 Lines • ▼ Show 20 Lines	void enableIRPrinting(
[](Pass , Operation ) { return true; },		[](Pass , Operation ) { return true; },
bool printModuleScope = true, bool printAfterOnlyOnChange = true,		bool printModuleScope = true, bool printAfterOnlyOnChange = true,
raw_ostream &out = llvm::errs(),		raw_ostream &out = llvm::errs(),
OpPrintingFlags opPrintingFlags = OpPrintingFlags());		OpPrintingFlags opPrintingFlags = OpPrintingFlags());

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Pass Timing		// Pass Timing

/// A configuration struct provided to the pass timing feature.		/// Add an instrumentation to time the execution of passes and the computation
class PassTimingConfig {		/// of analyses. Timing will be reported by nesting timers into the provided
public:		/// `timingScope`.
using PrintCallbackFn = function_ref<void(raw_ostream &)>;		///
		/// Note: Timing should be enabled after all other instrumentations to avoid
/// Initialize the configuration.		/// any potential "ghost" timing from other instrumentations being
/// * 'displayMode' switch between list or pipeline display (see the		/// unintentionally included in the timing results.
/// `PassDisplayMode` enum documentation).		void enableTiming(TimingScope &timingScope);
		lattnerUnsubmitted Done Reply Inline Actions Unless this is a widely used typedef that could occur in downstream code, I'd recommend just eliminating the `PassTimingConfig` name, updating clients to use `TimingConfig` directly. lattner: Unless this is a widely used typedef that could occur in downstream code, I'd recommend just…
explicit PassTimingConfig(
PassDisplayMode displayMode = PassDisplayMode::Pipeline)
: displayMode(displayMode) {}

virtual ~PassTimingConfig();

/// A hook that may be overridden by a derived config to control the
/// printing. The callback is supplied by the framework and the config is
/// responsible to call it back with a stream for the output.
virtual void printTiming(PrintCallbackFn printCallback);

/// Return the `PassDisplayMode` this config was created with.
PassDisplayMode getDisplayMode() { return displayMode; }

private:		/// Add an instrumentation to time the execution of passes and the computation
PassDisplayMode displayMode;		/// of analyses. The pass manager will take ownership of the timing manager
};		/// passed to the function and timing will be reported by nesting timers into
		/// the timing manager's root scope.
		///
		/// Note: Timing should be enabled after all other instrumentations to avoid
		/// any potential "ghost" timing from other instrumentations being
		/// unintentionally included in the timing results.
		void enableTiming(std::unique_ptr<TimingManager> tm);

/// Add an instrumentation to time the execution of passes and the computation		/// Add an instrumentation to time the execution of passes and the computation
/// of analyses.		/// of analyses. Creates a temporary TimingManager owned by this PassManager
		/// which will be used to report timing.
		///
/// Note: Timing should be enabled after all other instrumentations to avoid		/// Note: Timing should be enabled after all other instrumentations to avoid
/// any potential "ghost" timing from other instrumentations being		/// any potential "ghost" timing from other instrumentations being
/// unintentionally included in the timing results.		/// unintentionally included in the timing results.
void enableTiming(std::unique_ptr<PassTimingConfig> config = nullptr);		void enableTiming();

		//===--------------------------------------------------------------------===//
		// Pass Statistics

/// Prompts the pass manager to print the statistics collected for each of the		/// Prompts the pass manager to print the statistics collected for each of the
/// held passes after each call to 'run'.		/// held passes after each call to 'run'.
void		void
enableStatistics(PassDisplayMode displayMode = PassDisplayMode::Pipeline);		enableStatistics(PassDisplayMode displayMode = PassDisplayMode::Pipeline);

private:		private:
/// Dump the statistics of the passes within this pass manager.		/// Dump the statistics of the passes within this pass manager.
void dumpStatistics();		void dumpStatistics();

/// Run the pass manager with crash recover enabled.		/// Run the pass manager with crash recover enabled.
LogicalResult runWithCrashRecovery(Operation *op, AnalysisManager am);		LogicalResult runWithCrashRecovery(Operation *op, AnalysisManager am);
/// Run the given passes with crash recover enabled.		/// Run the given passes with crash recover enabled.
LogicalResult		LogicalResult
runWithCrashRecovery(MutableArrayRef<std::unique_ptr<Pass>> passes,		runWithCrashRecovery(MutableArrayRef<std::unique_ptr<Pass>> passes,
Operation *op, AnalysisManager am);		Operation *op, AnalysisManager am);

/// Context this PassManager was initialized with.		/// Context this PassManager was initialized with.
MLIRContext *context;		MLIRContext *context;

/// Flag that specifies if pass statistics should be dumped.		/// Flag that specifies if pass statistics should be dumped.
Optional<PassDisplayMode> passStatisticsMode;		Optional<PassDisplayMode> passStatisticsMode;

/// A manager for pass instrumentations.		/// A manager for pass instrumentations.
std::unique_ptr<PassInstrumentor> instrumentor;		std::unique_ptr<PassInstrumentor> instrumentor;
		rriddleUnsubmitted Done Reply Inline Actions Why can't this be handled by the timing instrumentation? I don't think we need these fields here. rriddle: Why can't this be handled by the timing instrumentation? I don't think we need these fields…
		fabianschuikiAuthorUnsubmitted Done Reply Inline Actions Yeah good point, I'll move that into the timing instrumentation. fabianschuiki: Yeah good point, I'll move that into the timing instrumentation.

/// An optional factory to use when generating a crash reproducer if valid.		/// An optional factory to use when generating a crash reproducer if valid.
ReproducerStreamFactory crashReproducerStreamFactory;		ReproducerStreamFactory crashReproducerStreamFactory;

/// A hash key used to detect when reinitialization is necessary.		/// A hash key used to detect when reinitialization is necessary.
llvm::hash_code initializationKey;		llvm::hash_code initializationKey;

/// Flag that specifies if pass timing is enabled.		/// Flag that specifies if pass timing is enabled.
Show All 9 Lines
/// Register a set of useful command-line options that can be used to configure		/// Register a set of useful command-line options that can be used to configure
/// a pass manager. The values of these options can be applied via the		/// a pass manager. The values of these options can be applied via the
/// 'applyPassManagerCLOptions' method below.		/// 'applyPassManagerCLOptions' method below.
void registerPassManagerCLOptions();		void registerPassManagerCLOptions();

/// Apply any values provided to the pass manager options that were registered		/// Apply any values provided to the pass manager options that were registered
/// with 'registerPassManagerOptions'.		/// with 'registerPassManagerOptions'.
void applyPassManagerCLOptions(PassManager &pm);		void applyPassManagerCLOptions(PassManager &pm);

		/// Apply any values provided to the timing manager options that were registered
		/// with `registerDefaultTimingManagerOptions`. This is a handy helper function
		/// if you do not want to bother creating your own timing manager and passing it
		/// to the pass manager.
		void applyDefaultTimingPassManagerCLOptions(PassManager &pm);

} // end namespace mlir		} // end namespace mlir

#endif // MLIR_PASS_PASSMANAGER_H		#endif // MLIR_PASS_PASSMANAGER_H

mlir/include/mlir/Support/Timing.h

This file was added.

				//===- Timing.h - Execution time measurement facilities ---------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// Facilities to measure and provide statistics on execution time.
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_SUPPORT_TIMING_H
				#define MLIR_SUPPORT_TIMING_H

				#include "mlir/Support/LLVM.h"
				#include "llvm/ADT/STLExtras.h"
				rriddleUnsubmitted Done Reply Inline Actions This is an indicator that the dependencies are off. Support/ shouldn't depend on anything in IR/. rriddle: This is an indicator that the dependencies are off. Support/ shouldn't depend on anything in…
				fabianschuikiAuthorUnsubmitted Done Reply Inline Actions Good point. Will check for some other/dedicated mechanism to map a string to a opaque pointer identifying the string content. Since we have a `TimingManager` now, that could just as well go in there. fabianschuiki: Good point. Will check for some other/dedicated mechanism to map a string to a opaque pointer…
				#include "llvm/ADT/StringMapEntry.h"
				#include "llvm/Support/raw_ostream.h"

				namespace mlir {

				class Timer;
				class TimingManager;
				class TimingScope;
				class DefaultTimingManager;
				namespace detail {
				lattnerUnsubmitted Done Reply Inline Actions Please follow clang-tidy's advice here and add a virtual dtor. Also, please declare one method (e.g. the dtor) out of line in a .cpp file to give the vtable a home in a .o file. lattner: Please follow clang-tidy's advice here and add a virtual dtor. Also, please declare one method…
				class TimingManagerImpl;
				class DefaultTimingManagerImpl;
				} // namespace detail

				//===----------------------------------------------------------------------===//
				// TimingIdentifier
				//===----------------------------------------------------------------------===//

				/// This class represesents a uniqued string owned by a `TimingManager`. Most
				/// importantly, instances of this class provide a stable opaque pointer that
				/// is guaranteed to be reproduced by later interning of the same string. The
				/// `TimingManager` uses this mechanism to provide timers with an opaque id
				/// even when the user of the API merely provided a string as identification
				/// (instead of a pass for example).
				///
				/// This is a POD type with pointer size, so it should be passed around by
				/// value. The underlying data is owned by the `TimingManager`.
				class TimingIdentifier {
				using EntryType = llvm::StringMapEntry<llvm::NoneType>;

				public:
				TimingIdentifier(const TimingIdentifier &) = default;
				TimingIdentifier &operator=(const TimingIdentifier &other) = default;

				/// Return an identifier for the specified string.
				static TimingIdentifier get(StringRef str, TimingManager &tm);

				/// Return a `StringRef` for the string.
				StringRef strref() const { return entry->first(); }

				/// Return an `std::string`.
				std::string str() const { return strref().str(); }

				/// Return the opaque pointer that corresponds to this identifier.
				const void *getAsOpaquePointer() const {
				return static_cast<const void *>(entry);
				}

				private:
				const EntryType *entry;
				explicit TimingIdentifier(const EntryType *entry) : entry(entry) {}
				};

				//===----------------------------------------------------------------------===//
				// TimingManager
				//===----------------------------------------------------------------------===//

				/// This class represents facilities to measure execution time.
				///
				/// Libraries and infrastructure code operate on opque `Timer` handles returned
				/// by various functions of this manager. Timers are started and stopped to
				/// demarkate regions in the code where execution time is of interest, and they
				rriddleUnsubmitted Done Reply Inline Actions demarcate rriddle: demarcate
				/// can be nested to provide more detailed timing resolution. Calls to the timer
				/// start, stop, and nesting functions must be balanced. To facilitate this,
				/// users are encouraged to leverage the `TimingScope` RAII-style wrapper around
				/// `Timer`s.
				///
				/// Users can provide their own implementation of `TimingManager`, or use the
				/// default `DefaultTimingManager` implementation in MLIR. Implementations
				/// override the various protected virtual functions to create, nest, start, and
				/// stop timers. A common pattern is for subclasses to provide a custom timer
				rriddleUnsubmitted Done Reply Inline Actions /// Implementations override this method to provide access to the top-level timer. This method isn't virtual, what do you mean by the comment above? rriddle: > /// Implementations override this method to provide access to the top-level timer. This…
				fabianschuikiAuthorUnsubmitted Done Reply Inline Actions Oops, my bad -- that's a leftover from when `rootTimer()` below wasn't a thing. fabianschuiki: Oops, my bad -- that's a leftover from when `rootTimer()` below wasn't a thing.
				/// class and simply pass pointers to instances of this class around as the
				/// opaque timer handle. The manager itself can then forward callbacks to the
				/// this class. Alternatively, external timing libraries may return their own
				/// opaque handles for timing scopes.
				///
				/// For example:
				/// ```
				/// void doWork(TimingManager &tm) {
				/// auto root = tm.getRootScope();
				///
				/// {
				/// auto scope = root.nest("First");
				/// doSomeWork();
				/// // <-- "First" timer stops here
				/// }
				///
				/// auto scope = root.nest("Second");
				/// doEvenMoreWork();
				/// scope.stop(); // <-- "Second" timer stops here
				///
				/// // <-- Root timer stops here
				/// }
				/// ```
				class TimingManager {
				public:
				explicit TimingManager();
				rriddleUnsubmitted Done Reply Inline Actions nit: Just spell out auto here. rriddle: nit: Just spell out auto here.
				virtual ~TimingManager();

				/// Get the root timer of this timing manager. The returned timer must be
				/// started and stopped manually. Execution time can be measured by nesting
				/// timers within this root timer and starting/stopping them as appropriate.
				/// Use this function only if you need access to the timer itself. Otherwise
				/// consider the more convenient `getRootScope()` which offers an RAII-style
				/// wrapper around the timer.
				rriddleUnsubmitted Done Reply Inline Actions This display mode looks specific to the current pass manager implementation, I'm not sure this is generally applicable to the other potential timer backends. Couple of related general questions: Do we need a general TimingConfig anymore? This was necessary before because of how the pass manager hid the internal timing instrumentation, but I'm not sure that it is really necessary anymore. How much of it would be shared across different potential timer backends? Can you rename TimingManagerBase base to just TimingManager? I don't think the default implementation should be called TimingManager, it should likely be called something else(not sure the name though). This would establish that TimingManager (as you currently have it) is just an implementation and not the interface. Can you add comment blocks? I find these to help separate different concepts within a source file (e.g. https://github.com/llvm/llvm-project/blob/a7f33d720b74a92031958fce0e209794d296393d/mlir/include/mlir/Pass/Pass.h#L310) Do you have an idea of what supporting another timing backend would look like? Not that you have to do the work, but when I've thought about this before ideally I would want to be able to support something like (1) or (2) without needing a new API/interface. Ideally these would all just be implementations that a user can choose from and pass the general infra to use for timer reporting. (1): https://github.com/llvm/llvm-project/blob/main/llvm/include/llvm/Support/TimeProfiler.h (2): https://github.com/wolfpld/tracy rriddle: This display mode looks specific to the current pass manager implementation, I'm not sure this…
				Timer getRootTimer();

				/// Get the root timer of this timing manager wrapped in a `TimingScope` for
				/// convenience. Automatically starts the timer and stops it as soon as the
				/// `TimingScope` is destroyed, e.g. when it goes out of scope.
				TimingScope getRootScope();

				protected:
				// Allow `Timer` access to the protected callbacks.
				friend class Timer;

				//===--------------------------------------------------------------------===//
				// Callbacks
				//
				// See the corresponding functions in `Timer` for additional details.

				/// Return the root timer. Implementations should return `llvm::None` if the
				/// collection of timing samples is disabled. This will cause the timers
				/// constructed from the manager to be tombstones which can be skipped
				/// quickly.
				virtual Optional<void *> rootTimer() = 0;

				/// Start the timer with the given handle.
				virtual void startTimer(void *handle) = 0;

				/// Stop the timer with the given handle.
				virtual void stopTimer(void *handle) = 0;

				/// Create a child timer nested within the one with the given handle. The `id`
				/// parameter is used to uniquely identify the timer within its parent.
				/// Multiple calls to this function with the same `handle` and `id` should
				/// return the same timer, or at least cause the samples of the returned
				/// timers to be combined for the final timing results.
				virtual void nestTimer(void handle, const void *id,
				std::function<std::string()> &&nameBuilder) = 0;

				/// Hide the timer in timing reports and directly show its children. This is
				/// merely a hint that implementations are free to ignore.
				rriddleUnsubmitted Done Reply Inline Actions nit: Please make bool conversion operators explicit. rriddle: nit: Please make bool conversion operators explicit.
				virtual void hideTimer(void *handle) {}

				lattnerUnsubmitted Done Reply Inline Actions I agree with River's comment that it would be great to eliminate this class. lattner: I agree with River's comment that it would be great to eliminate this class.
				protected:
				const std::unique_ptr<detail::TimingManagerImpl> impl;

				// Allow `TimingIdentifier::get` access to the private impl details.
				friend class TimingIdentifier;

				private:
				// Disallow copying the manager.
				TimingManager(const TimingManager &) = delete;
				void operator=(const TimingManager &) = delete;
				};

				//===----------------------------------------------------------------------===//
				// Timer
				//===----------------------------------------------------------------------===//

				/// A handle for a timer in a `TimingManager`.
				///
				/// This class encapsulates a pointer to a `TimingManager` and an opaque handle
				/// to a timer running within that manager. Libraries and infrastructure code
				/// operate on `Timer` rather than any concrete classes handed out by custom
				/// manager implementations.
				class Timer {
				public:
				Timer() {}
				Timer(const Timer &other) : tm(other.tm), handle(other.handle) {}
				Timer(Timer &&other) : Timer(other) {
				other.tm = nullptr;
				other.handle = nullptr;
				}

				Timer &operator=(Timer &&other) {
				tm = other.tm;
				handle = other.handle;
				other.tm = nullptr;
				other.handle = nullptr;
				return *this;
				}

				/// Returns whether this is a valid timer handle. Invalid timer handles are
				/// used when timing is disabled in the `TimingManager` to keep the impact on
				/// performance low.
				explicit operator bool() const { return tm != nullptr; }

				/// Start the timer. This must be accompanied by a corresponding call to
				/// `stop()` at a later point.
				void start() {
				if (tm)
				tm->startTimer(handle);
				}

				/// Stop the timer. This must have been preceded by a corresponding call to
				/// `start()` at an earlier point.
				void stop() {
				if (tm)
				tm->stopTimer(handle);
				}

				/// Create a child timer nested within this one. Multiple calls to this
				/// function with the same unique identifier `id` will return the same child
				/// timer. The timer must have been started when calling this function.
				///
				/// This function can be called from other threads, as long as this timer
				/// is not stopped before any uses of the child timer on the other thread are
				/// stopped.
				///
				/// The `nameBuilder` function is not guaranteed to be called.
				Timer nest(const void *id, std::function<std::string()> &&nameBuilder) {
				rriddleUnsubmitted Done Reply Inline Actions Can you change all of the uses of `std::function<> &&` to function_ref? std::function generally ends up allocating memory, which we don't really want to do. Given that we aren't storing the function, we only really need a reference anyways. rriddle: Can you change all of the uses of `std::function<> &&` to function_ref? std::function generally…
				return tm ? Timer(*tm, tm->nestTimer(handle, id, std::move(nameBuilder)))
				: Timer();
				}

				/// See above.
				Timer nest(TimingIdentifier name) {
				return tm ? nest(name.getAsOpaquePointer(), [=]() { return name.str(); })
				: Timer();
				}

				/// See above.
				Timer nest(StringRef name) {
				return tm ? nest(TimingIdentifier::get(name, *tm)) : Timer();
				}

				/// Hide the timer in timing reports and directly show its children.
				void hide() {
				if (tm)
				tm->hideTimer(handle);
				}

				protected:
				Timer(TimingManager &tm, void *handle) : tm(&tm), handle(handle) {}

				// Allow the `TimingManager` access to the above constructor.
				friend class TimingManager;

				private:
				/// The associated timing manager.
				TimingManager *tm = nullptr;
				/// An opaque handle that identifies the timer in the timing manager
				/// implementation.
				void *handle = nullptr;
				};

				//===----------------------------------------------------------------------===//
				// TimingScope
				//===----------------------------------------------------------------------===//

				/// An RAII-style wrapper around a timer that ensures the timer is properly
				/// started and stopped.
				class TimingScope {
				public:
				TimingScope() : timer() {}
				TimingScope(const Timer &other) : timer(other) {
				if (timer)
				timer.start();
				}
				TimingScope(Timer &&other) : timer(std::move(other)) {
				if (timer)
				timer.start();
				}
				TimingScope(TimingScope &&other) : timer(std::move(other.timer)) {}
				~TimingScope() { stop(); }

				TimingScope &operator=(TimingScope &&other) {
				stop();
				timer = std::move(other.timer);
				return *this;
				}

				/// Check if the timing scope actually contains a valid timer.
				explicit operator bool() const { return bool(timer); }

				// Disable copying of the `TimingScope`.
				TimingScope(const TimingScope &) = delete;
				TimingScope &operator=(const TimingScope &) = delete;

				/// Manually stop the timer early.
				void stop() {
				timer.stop();
				timer = Timer();
				}

				/// Create a nested timing scope.
				///
				/// This returns a new `TimingScope` with a timer nested within the current
				/// scope. In this fashion, the time in this scope may be further subdivided
				/// in a more fine-grained fashion.
				template <typename... Args>
				TimingScope nest(Args... args) {
				return TimingScope(std::move(timer.nest(std::forward<Args>(args)...)));
				}

				/// Hide the timer in timing reports and directly show its children.
				void hide() { timer.hide(); }

				private:
				/// The wrapped timer.
				Timer timer;
				};

				//===----------------------------------------------------------------------===//
				// DefaultTimingManager
				//===----------------------------------------------------------------------===//

				/// Facilities for time measurement and report printing to an output stream.
				///
				/// This is MLIR's default implementation of a `TimingManager`. Prints an
				/// execution time report upon destruction, or manually through `print()`. By
				/// default the results are printed in `DisplayMode::Tree` mode to stderr.
				/// Use `setEnabled(true)` to enable collection of timing samples; it is
				/// disabled by default.
				rriddleUnsubmitted Done Reply Inline Actions nit: I don't think the llvm:: here is necessary if you include LLVM.h rriddle: nit: I don't think the llvm:: here is necessary if you include LLVM.h
				///
				/// You should only instantiate a `DefaultTimingManager` if you are writing a
				/// tool and want to pass a timing manager to the remaining infrastructure. If
				/// you are writing library or infrastructure code, you should rather accept
				/// the `TimingManager` base class to allow for users of your code to substitute
				/// their own timing implementations. Also, if you only intend to collect time
				/// samples, consider accepting a `Timer` or `TimingScope` instead.
				class DefaultTimingManager : public TimingManager {
				public:
				/// The different display modes for printing the timers.
				enum class DisplayMode {
				/// In this mode the results are displayed in a list sorted by total time,
				/// with timers aggregated into one unique result per timer name.
				List,

				/// In this mode the results are displayed in a tree view, with child timers
				/// nested under their parents.
				Tree,
				};

				DefaultTimingManager();
				DefaultTimingManager(DefaultTimingManager &&rhs);
				virtual ~DefaultTimingManager();

				// Disable copying of the `DefaultTimingManager`.
				DefaultTimingManager(const DefaultTimingManager &rhs) = delete;
				DefaultTimingManager &operator=(const DefaultTimingManager &rhs) = delete;

				/// Enable or disable execution time sampling.
				void setEnabled(bool enabled);

				/// Return whether execution time sampling is enabled.
				bool isEnabled() const;

				/// Change the display mode.
				void setDisplayMode(DisplayMode displayMode);

				/// Return the current display mode;
				DisplayMode getDisplayMode() const;

				/// Change the stream where the output will be printed to.
				void setOutput(raw_ostream &os);

				/// Return the current output stream where the output will be printed to.
				raw_ostream &getOutput() const;

				/// Print and clear the timing results. Only call this when there are no more
				/// references to nested timers around, as printing post-processes and clears
				/// the timers.
				void print();

				/// Clear the timing results. Only call this when there are no more references
				/// to nested timers around, as clearing invalidates them.
				void clear();

				/// Debug print the timer data structures to an output stream.
				void dumpTimers(raw_ostream &os = llvm::errs());

				/// Debug print the timers as a list. Only call this when there are no more
				/// references to nested timers around.
				void dumpAsList(raw_ostream &os = llvm::errs());

				/// Debug print the timers as a tree. Only call this when there are no
				/// more references to nested timers around.
				void dumpAsTree(raw_ostream &os = llvm::errs());

				protected:
				// `TimingManager` callbacks
				Optional<void *> rootTimer() override;
				rriddleUnsubmitted Done Reply Inline Actions Is this something we could make non-virtual? I think it would be nice if we could avoid an indirect call in the case where timing is disabled. rriddle: Is this something we could make non-virtual? I think it would be nice if we could avoid an…
				fabianschuikiAuthorUnsubmitted Done Reply Inline Actions The way you use the TM at the moment is roughly: DefaultTimingManager tm; TimingScope ts = tm.getRootScope(); // all further timing calls go through `ts` If timing is disabled, `ts` is marked as diabled through a nullptr in its `tm` field, which causes all subsequent interaction to short-circuit into no-ops. So in its current implementation, if timing is disabled there is just one virtual function call for the duration of the program -- that should not be visible in practice. I do agree that it would be nicer if the setup-related interaction with the TM would be non-virtual. Especially if we start to keep per-thread stacks of active timing scopes, that should all be directly accessible. We could promote `setEnabled` and `isEnabled` into the `TimingManager` base class such that it can just never ask for the `rootTimer` if disabled. Pretty sure that all TM impls can get behind the concept of enabling/disabling. Do you want to tackle this in this diff, or push that off to a follow-up diff alongside tracking a TM in the context? fabianschuiki: The way you use the TM at the moment is roughly: ``` DefaultTimingManager tm; TimingScope ts =…
				rriddleUnsubmitted Done Reply Inline Actions Followup is alright with me, thanks! rriddle: Followup is alright with me, thanks!
				void startTimer(void *handle) override;
				void stopTimer(void *handle) override;
				void nestTimer(void handle, const void *id,
				std::function<std::string()> &&nameBuilder) override;
				void hideTimer(void *handle) override;

				private:
				const std::unique_ptr<detail::DefaultTimingManagerImpl> impl;
				};

				/// Register a set of useful command-line options that can be used to configure
				/// a `DefaultTimingManager`. The values of these options can be applied via the
				/// `applyDefaultTimingManagerCLOptions` method.
				void registerDefaultTimingManagerCLOptions();

				/// Apply any values that were registered with
				/// 'registerDefaultTimingManagerOptions' to a `DefaultTimingManager`.
				void applyDefaultTimingManagerCLOptions(DefaultTimingManager &tm);

				} // namespace mlir

				#endif // MLIR_SUPPORT_TIMING_H

mlir/lib/Pass/Pass.cpp

Show First 20 Lines • Show All 241 Lines • ▼ Show 20 Lines	OpPassManager::OpPassManager(Identifier name, Nesting nesting)
: impl(new OpPassManagerImpl(name, nesting)) {}		: impl(new OpPassManagerImpl(name, nesting)) {}
OpPassManager::OpPassManager(StringRef name, Nesting nesting)		OpPassManager::OpPassManager(StringRef name, Nesting nesting)
: impl(new OpPassManagerImpl(name, nesting)) {}		: impl(new OpPassManagerImpl(name, nesting)) {}
OpPassManager::OpPassManager(OpPassManager &&rhs) : impl(std::move(rhs.impl)) {}		OpPassManager::OpPassManager(OpPassManager &&rhs) : impl(std::move(rhs.impl)) {}
OpPassManager::OpPassManager(const OpPassManager &rhs) { *this = rhs; }		OpPassManager::OpPassManager(const OpPassManager &rhs) { *this = rhs; }
OpPassManager &OpPassManager::operator=(const OpPassManager &rhs) {		OpPassManager &OpPassManager::operator=(const OpPassManager &rhs) {
impl.reset(new OpPassManagerImpl(rhs.impl->name, rhs.impl->nesting));		impl.reset(new OpPassManagerImpl(rhs.impl->name, rhs.impl->nesting));
impl->initializationGeneration = rhs.impl->initializationGeneration;		impl->initializationGeneration = rhs.impl->initializationGeneration;
for (auto &pass : rhs.impl->passes)		for (auto &pass : rhs.impl->passes) {
impl->passes.emplace_back(pass->clone());		auto newPass = pass->clone();
		newPass->threadingSibling = pass.get();
		impl->passes.push_back(std::move(newPass));
		}
return *this;		return *this;
}		}

OpPassManager::~OpPassManager() {}		OpPassManager::~OpPassManager() {}

OpPassManager::pass_iterator OpPassManager::begin() {		OpPassManager::pass_iterator OpPassManager::begin() {
return MutableArrayRef<std::unique_ptr<Pass>>{impl->passes}.begin();		return MutableArrayRef<std::unique_ptr<Pass>>{impl->passes}.begin();
}		}
▲ Show 20 Lines • Show All 850 Lines • Show Last 20 Lines

mlir/lib/Pass/PassManagerOptions.cpp

//===- PassManagerOptions.cpp - PassManager Command Line Options ----------===//		//===- PassManagerOptions.cpp - PassManager Command Line Options ----------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Pass/Pass.h"		#include "mlir/Pass/Pass.h"
#include "mlir/Pass/PassManager.h"		#include "mlir/Pass/PassManager.h"
#include "mlir/Pass/PassRegistry.h"		#include "mlir/Pass/PassRegistry.h"
		#include "mlir/Support/Timing.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/ManagedStatic.h"		#include "llvm/Support/ManagedStatic.h"

using namespace mlir;		using namespace mlir;

namespace {		namespace {
struct PassManagerOptions {		struct PassManagerOptions {
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
Show All 32 Lines	llvm::cl::opt<bool> printModuleScope{
llvm::cl::desc("When printing IR for print-ir-[before\|after]{-all} "		llvm::cl::desc("When printing IR for print-ir-[before\|after]{-all} "
"always print the top-level operation"),		"always print the top-level operation"),
llvm::cl::init(false)};		llvm::cl::init(false)};

/// Add an IR printing instrumentation if enabled by any 'print-ir' flags.		/// Add an IR printing instrumentation if enabled by any 'print-ir' flags.
void addPrinterInstrumentation(PassManager &pm);		void addPrinterInstrumentation(PassManager &pm);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Pass Timing
//===--------------------------------------------------------------------===//
llvm::cl::opt<bool> passTiming{
"pass-timing",
llvm::cl::desc("Display the execution times of each pass")};
llvm::cl::opt<PassDisplayMode> passTimingDisplayMode{
"pass-timing-display",
llvm::cl::desc("Display method for pass timing data"),
llvm::cl::init(PassDisplayMode::Pipeline),
llvm::cl::values(
clEnumValN(PassDisplayMode::List, "list",
"display the results in a list sorted by total time"),
clEnumValN(PassDisplayMode::Pipeline, "pipeline",
"display the results with a nested pipeline view"))};

//===--------------------------------------------------------------------===//
// Pass Statistics		// Pass Statistics
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
llvm::cl::opt<bool> passStatistics{		llvm::cl::opt<bool> passStatistics{
"pass-statistics", llvm::cl::desc("Display the statistics of each pass")};		"pass-statistics", llvm::cl::desc("Display the statistics of each pass")};
llvm::cl::opt<PassDisplayMode> passStatisticsDisplayMode{		llvm::cl::opt<PassDisplayMode> passStatisticsDisplayMode{
"pass-statistics-display",		"pass-statistics-display",
llvm::cl::desc("Display method for pass statistics"),		llvm::cl::desc("Display method for pass statistics"),
llvm::cl::init(PassDisplayMode::Pipeline),		llvm::cl::init(PassDisplayMode::Pipeline),
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	void PassManagerOptions::addPrinterInstrumentation(PassManager &pm) {
if (!shouldPrintBeforePass && !shouldPrintAfterPass)		if (!shouldPrintBeforePass && !shouldPrintAfterPass)
return;		return;

// Otherwise, add the IR printing instrumentation.		// Otherwise, add the IR printing instrumentation.
pm.enableIRPrinting(shouldPrintBeforePass, shouldPrintAfterPass,		pm.enableIRPrinting(shouldPrintBeforePass, shouldPrintAfterPass,
printModuleScope, printAfterChange, llvm::errs());		printModuleScope, printAfterChange, llvm::errs());
}		}

/// Add a pass timing instrumentation if enabled by 'pass-timing' flags.
void PassManagerOptions::addTimingInstrumentation(PassManager &pm) {
if (passTiming)
pm.enableTiming(
std::make_unique<PassManager::PassTimingConfig>(passTimingDisplayMode));
}

void mlir::registerPassManagerCLOptions() {		void mlir::registerPassManagerCLOptions() {
// Make sure that the options struct has been constructed.		// Make sure that the options struct has been constructed.
*options;		*options;
}		}

void mlir::applyPassManagerCLOptions(PassManager &pm) {		void mlir::applyPassManagerCLOptions(PassManager &pm) {
if (!options.isConstructed())		if (!options.isConstructed())
return;		return;

// Generate a reproducer on crash/failure.		// Generate a reproducer on crash/failure.
if (options->reproducerFile.getNumOccurrences())		if (options->reproducerFile.getNumOccurrences())
pm.enableCrashReproducerGeneration(options->reproducerFile,		pm.enableCrashReproducerGeneration(options->reproducerFile,
options->localReproducer);		options->localReproducer);

// Enable statistics dumping.		// Enable statistics dumping.
if (options->passStatistics)		if (options->passStatistics)
pm.enableStatistics(options->passStatisticsDisplayMode);		pm.enableStatistics(options->passStatisticsDisplayMode);

// Add the IR printing instrumentation.		// Add the IR printing instrumentation.
options->addPrinterInstrumentation(pm);		options->addPrinterInstrumentation(pm);
		}

// Note: The pass timing instrumentation should be added last to avoid any		void mlir::applyDefaultTimingPassManagerCLOptions(PassManager &pm) {
// potential "ghost" timing from other instrumentations being unintentionally		// Create a temporary timing manager for the PM to own, apply its CL options,
// included in the timing results.		// and pass it to the PM.
options->addTimingInstrumentation(pm);		auto tm = std::make_unique<DefaultTimingManager>();
		applyDefaultTimingManagerCLOptions(*tm);
		pm.enableTiming(std::move(tm));
}		}

mlir/lib/Pass/PassTiming.cpp

	//===- PassTiming.cpp -----------------------------------------------------===//			//===- PassTiming.cpp -----------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "PassDetail.h"			#include "PassDetail.h"
	#include "mlir/Pass/PassManager.h"			#include "mlir/Pass/PassManager.h"
	#include "llvm/ADT/MapVector.h"
	#include "llvm/ADT/STLExtras.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/Statistic.h"
	#include "llvm/Support/Format.h"
	#include "llvm/Support/FormatVariadic.h"
	#include "llvm/Support/Threading.h"			#include "llvm/Support/Threading.h"

	#include <chrono>			#include <chrono>

	using namespace mlir;			using namespace mlir;
	using namespace mlir::detail;			using namespace mlir::detail;

	constexpr StringLiteral kPassTimingDescription =			//===----------------------------------------------------------------------===//
	"... Pass execution timing report ...";			// PassTiming
				//===----------------------------------------------------------------------===//

	namespace {			namespace {
	/// Simple record class to record timing information.
	struct TimeRecord {
	TimeRecord(double wall = 0.0, double user = 0.0) : wall(wall), user(user) {}

	TimeRecord &operator+=(const TimeRecord &other) {
	wall += other.wall;
	user += other.user;
	return *this;
	}

	/// Print the current time record to 'os', with a breakdown showing
	/// contributions to the give 'total' time record.
	void print(raw_ostream &os, const TimeRecord &total) {
	if (total.user != total.wall)
	os << llvm::format(" %7.4f (%5.1f%%) ", user,
	100.0 * user / total.user);
	os << llvm::format(" %7.4f (%5.1f%%) ", wall, 100.0 * wall / total.wall);
	}

	double wall, user;
	};

	/// An enumeration of the different types of timers.
	enum class TimerKind {
	/// This timer represents an ordered collection of pass timers, corresponding
	/// to a pass pipeline.
	Pipeline,

	/// This timer represents a collection of pipeline timers.
	PipelineCollection,

	/// This timer represents an analysis or pass timer.
	PassOrAnalysis
	};

	struct Timer {
	explicit Timer(std::string &&name, TimerKind kind)
	: name(std::move(name)), kind(kind) {}

	/// Start the timer.
	void start() { startTime = std::chrono::system_clock::now(); }

	/// Stop the timer.
	void stop() {
	auto newTime = std::chrono::system_clock::now() - startTime;
	wallTime += newTime;
	userTime += newTime;
	}

	/// Get or create a child timer with the provided name and id.
	Timer getChildTimer(const void id, TimerKind kind,
	std::function<std::string()> &&nameBuilder) {
	auto &child = children[id];
	if (!child)
	child = std::make_unique<Timer>(nameBuilder(), kind);
	return child.get();
	}

	/// Returns the total time for this timer in seconds.
	TimeRecord getTotalTime() {
	// If this is a pass or analysis timer, use the recorded time directly.
	if (kind == TimerKind::PassOrAnalysis) {
	return TimeRecord(
	std::chrono::duration_cast<std::chrono::duration<double>>(wallTime)
	.count(),
	std::chrono::duration_cast<std::chrono::duration<double>>(userTime)
	.count());
	}

	// Otherwise, accumulate the timing from each of the children.
	TimeRecord totalTime;
	for (auto &child : children)
	totalTime += child.second->getTotalTime();
	return totalTime;
	}

	/// A map of unique identifiers to child timers.
	using ChildrenMap = llvm::MapVector<const void *, std::unique_ptr<Timer>>;

	/// Merge the timing data from 'other' into this timer.
	void merge(Timer &&other) {
	if (wallTime < other.wallTime)
	wallTime = other.wallTime;
	userTime += other.userTime;
	mergeChildren(std::move(other.children));
	}

	/// Merge the timer children in 'otherChildren' with the children of this
	/// timer.
	void mergeChildren(ChildrenMap &&otherChildren) {
	// Check for an empty children list.
	if (children.empty()) {
	children = std::move(otherChildren);
	return;
	}

	// Pipeline merges are handled separately as the children are merged
	// lexicographically.
	if (kind == TimerKind::Pipeline) {
	assert(children.size() == otherChildren.size() &&
	"pipeline merge requires the same number of children");
	for (auto it : llvm::zip(children, otherChildren))
	std::get<0>(it).second->merge(std::move(*std::get<1>(it).second));
	return;
	}

	// Otherwise, we merge children based upon their timer key.
	for (auto &otherChild : otherChildren)
	mergeChild(std::move(otherChild));
	}

	/// Merge in the given child timer and id into this timer.
	void mergeChild(ChildrenMap::value_type &&childIt) {
	auto &child = children[childIt.first];
	if (!child)
	child = std::move(childIt.second);
	else
	child->merge(std::move(*childIt.second));
	}

	/// Raw timing information.
	std::chrono::time_point<std::chrono::system_clock> startTime;
	std::chrono::nanoseconds wallTime = std::chrono::nanoseconds(0);
	std::chrono::nanoseconds userTime = std::chrono::nanoseconds(0);

	/// A map of unique identifiers to child timers.
	ChildrenMap children;

	/// A descriptive name for this timer.
	std::string name;

	/// The type of timer this instance represents.
	TimerKind kind;
	};

	struct PassTiming : public PassInstrumentation {			struct PassTiming : public PassInstrumentation {
	PassTiming(std::unique_ptr<PassManager::PassTimingConfig> config)			PassTiming(TimingScope &timingScope) : rootScope(timingScope) {}
	: config(std::move(config)) {}			PassTiming(std::unique_ptr<TimingManager> tm)
	~PassTiming() override { print(); }			: ownedTimingManager(std::move(tm)),
				ownedTimingScope(ownedTimingManager->getRootScope()),
				rootScope(ownedTimingScope) {}

				/// If a pass can spawn additional work on other threads, it records the
				/// index to its currently active timer here. Passes that run on a
				/// newly-forked thread will check this list to find the active timer of the
				/// parent thread into which the new thread should be nested.
				DenseMap<PipelineParentInfo, unsigned> parentTimerIndices;

				/// A stack of the currently active timing scopes per thread.
				DenseMap<uint64_t, SmallVector<TimingScope, 4>> activeThreadTimers;

				/// The timing manager owned by this instrumentation (in case timing was
				/// enabled by the user on the pass manager without providing an external
				/// timing manager). This must appear before the `ownedTimingScope` to
				/// ensure the timing manager is destroyed after the scope, since the latter
				/// may hold a timer that points into the former.
				std::unique_ptr<TimingManager> ownedTimingManager;
				TimingScope ownedTimingScope;

				/// The root timing scope into which timing is reported.
				TimingScope &rootScope;

				//===--------------------------------------------------------------------===//
				// Pipeline
				//===--------------------------------------------------------------------===//

	/// Setup the instrumentation hooks.
	void runBeforePipeline(Identifier name,			void runBeforePipeline(Identifier name,
	const PipelineParentInfo &parentInfo) override;			const PipelineParentInfo &parentInfo) override {
	void runAfterPipeline(Identifier name,
	const PipelineParentInfo &parentInfo) override;
	void runBeforePass(Pass pass, Operation ) override { startPassTimer(pass); }
	void runAfterPass(Pass pass, Operation ) override;
	void runAfterPassFailed(Pass pass, Operation op) override {
	runAfterPass(pass, op);
	}
	void runBeforeAnalysis(StringRef name, TypeID id, Operation *) override {
	startAnalysisTimer(name, id);
	}
	void runAfterAnalysis(StringRef, TypeID, Operation *) override;

	/// Print and clear the timing results.
	void print();

	/// Start a new timer for the given pass.
	void startPassTimer(Pass *pass);

	/// Start a new timer for the given analysis.
	void startAnalysisTimer(StringRef name, TypeID id);

	/// Pop the last active timer for the current thread.
	Timer *popLastActiveTimer() {
	auto tid = llvm::get_threadid();			auto tid = llvm::get_threadid();
	auto &activeTimers = activeThreadTimers[tid];			auto &activeTimers = activeThreadTimers[tid];
	assert(!activeTimers.empty() && "expected active timer");
	return activeTimers.pop_back_val();
	}

	/// Print the timing result in list mode.
	void printResultsAsList(raw_ostream &os, Timer *root, TimeRecord totalTime);

	/// Print the timing result in pipeline mode.
	void printResultsAsPipeline(raw_ostream &os, Timer *root,
	TimeRecord totalTime);

	/// Returns a timer for the provided identifier and name.
	Timer getTimer(const void id, TimerKind kind,
	std::function<std::string()> &&nameBuilder) {
	auto tid = llvm::get_threadid();

	// If there is no active timer then add to the root timer.			TimingScope *parentScope;
	auto &activeTimers = activeThreadTimers[tid];
	Timer *parentTimer;
	if (activeTimers.empty()) {			if (activeTimers.empty()) {
	auto &rootTimer = rootTimers[tid];			auto it = parentTimerIndices.find(parentInfo);
	if (!rootTimer)			if (it != parentTimerIndices.end())
	rootTimer = std::make_unique<Timer>("root", TimerKind::Pipeline);			parentScope =
	parentTimer = rootTimer.get();			&activeThreadTimers[parentInfo.parentThreadID][it->second];
				else
				parentScope = &rootScope;
	} else {			} else {
	// Otherwise, add this to the active timer.			parentScope = &activeTimers.back();
	parentTimer = activeTimers.back();
	}			}
				activeTimers.push_back(parentScope->nest(name.getAsOpaquePointer(), [name] {
	auto timer = parentTimer->getChildTimer(id, kind, std::move(nameBuilder));			return ("'" + name.strref() + "' Pipeline").str();
	activeTimers.push_back(timer);			}));
	return timer;
	}			}

	/// The root top level timers for each thread.			void runAfterPipeline(Identifier, const PipelineParentInfo &) override {
	DenseMap<uint64_t, std::unique_ptr<Timer>> rootTimers;			auto &activeTimers = activeThreadTimers[llvm::get_threadid()];
				assert(!activeTimers.empty() && "expected active timer");
	/// A stack of the currently active pass timers per thread.			activeTimers.pop_back();
	DenseMap<uint64_t, SmallVector<Timer *, 4>> activeThreadTimers;			}

	/// The configuration object to use when printing the timing results.			//===--------------------------------------------------------------------===//
	std::unique_ptr<PassManager::PassTimingConfig> config;			// Pass
				//===--------------------------------------------------------------------===//

	/// A mapping of pipeline timers that need to be merged into the parent			void runBeforePass(Pass pass, Operation ) override {
	/// collection. The timers are mapped to the parent info to merge into.			auto tid = llvm::get_threadid();
	DenseMap<PipelineParentInfo, SmallVector<Timer::ChildrenMap::value_type, 4>>			auto &activeTimers = activeThreadTimers[tid];
	pipelinesToMerge;			auto &parentScope = activeTimers.empty() ? rootScope : activeTimers.back();
	};
	} // end anonymous namespace

	void PassTiming::runBeforePipeline(Identifier name,			if (auto *adaptor = dyn_cast<OpToOpPassAdaptor>(pass)) {
	const PipelineParentInfo &parentInfo) {			parentTimerIndices[{tid, pass}] = activeTimers.size();
	// We don't actually want to time the pipelines, they gather their total			auto scope =
	// from their held passes.			parentScope.nest(pass->getThreadingSiblingOrThis(),
	getTimer(name.getAsOpaquePointer(), TimerKind::Pipeline,			[adaptor]() { return adaptor->getAdaptorName(); });
	[&] { return ("'" + name.strref() + "' Pipeline").str(); });			if (adaptor->getPassManagers().size() <= 1)
				scope.hide();
				activeTimers.push_back(std::move(scope));
				} else {
				activeTimers.push_back(
				parentScope.nest(pass->getThreadingSiblingOrThis(),
				[pass]() { return std::string(pass->getName()); }));
				}
	}			}

	void PassTiming::runAfterPipeline(Identifier name,			void runAfterPass(Pass pass, Operation ) override {
	const PipelineParentInfo &parentInfo) {
	// Pop the timer for the pipeline.
	auto tid = llvm::get_threadid();			auto tid = llvm::get_threadid();
				if (isa<OpToOpPassAdaptor>(pass))
				rriddleUnsubmitted Done Reply Inline Actions nit: Drop the trivial braces here. rriddle: nit: Drop the trivial braces here.
				parentTimerIndices.erase({tid, pass});
	auto &activeTimers = activeThreadTimers[tid];			auto &activeTimers = activeThreadTimers[tid];
	assert(!activeTimers.empty() && "expected active timer");			assert(!activeTimers.empty() && "expected active timer");
	activeTimers.pop_back();			activeTimers.pop_back();

	// If the current thread is the same as the parent, there is nothing left to
	// do.
	if (tid == parentInfo.parentThreadID)
	return;

	// Otherwise, mark the pipeline timer for merging into the correct parent
	// thread.
	assert(activeTimers.empty() && "expected parent timer to be root");
	auto *parentTimer = rootTimers[tid].get();
	assert(parentTimer->children.size() == 1 &&
	parentTimer->children.count(name.getAsOpaquePointer()) &&
	"expected a single pipeline timer");
	pipelinesToMerge[parentInfo].push_back(
	std::move(*parentTimer->children.begin()));
	rootTimers.erase(tid);
	}

	/// Start a new timer for the given pass.
	void PassTiming::startPassTimer(Pass *pass) {
	auto kind = isa<OpToOpPassAdaptor>(pass) ? TimerKind::PipelineCollection
	: TimerKind::PassOrAnalysis;
	Timer *timer = getTimer(pass, kind, [pass]() -> std::string {
	if (auto *adaptor = dyn_cast<OpToOpPassAdaptor>(pass))
	return adaptor->getAdaptorName();
	return std::string(pass->getName());
	});

	// We don't actually want to time the adaptor passes, they gather their total
	// from their held passes.
	if (!isa<OpToOpPassAdaptor>(pass))
	timer->start();
	}

	/// Start a new timer for the given analysis.
	void PassTiming::startAnalysisTimer(StringRef name, TypeID id) {
	Timer *timer = getTimer(id.getAsOpaquePointer(), TimerKind::PassOrAnalysis,
	[name] { return "(A) " + name.str(); });
	timer->start();
	}

	/// Stop a pass timer.
	void PassTiming::runAfterPass(Pass pass, Operation ) {
	Timer *timer = popLastActiveTimer();

	// Check to see if we need to merge in the timing data for the pipelines
	// running on other threads.
	auto toMerge = pipelinesToMerge.find({llvm::get_threadid(), pass});
	if (toMerge != pipelinesToMerge.end()) {
	for (auto &it : toMerge->second)
	timer->mergeChild(std::move(it));
	pipelinesToMerge.erase(toMerge);
	}

	timer->stop();
	}

	/// Stop a timer.
	void PassTiming::runAfterAnalysis(StringRef, TypeID, Operation *) {
	popLastActiveTimer()->stop();
	}

	/// Utility to print the timer heading information.
	static void printTimerHeader(raw_ostream &os, TimeRecord total) {
	os << "===" << std::string(73, '-') << "===\n";
	// Figure out how many spaces to description name.
	unsigned padding = (80 - kPassTimingDescription.size()) / 2;
	os.indent(padding) << kPassTimingDescription << '\n';
	os << "===" << std::string(73, '-') << "===\n";

	// Print the total time followed by the section headers.
	os << llvm::format(" Total Execution Time: %5.4f seconds\n\n", total.wall);
	if (total.user != total.wall)
	os << " ---User Time---";
	os << " ---Wall Time--- --- Name ---\n";
	}

	/// Utility to print a single line entry in the timer output.
	static void printTimeEntry(raw_ostream &os, unsigned indent, StringRef name,
	TimeRecord time, TimeRecord totalTime) {
	time.print(os, totalTime);
	os.indent(indent) << name << "\n";
	}

	/// Print out the current timing information.
	void PassTiming::print() {
	// Don't print anything if there is no timing data.
	if (rootTimers.empty())
	return;

	assert(rootTimers.size() == 1 && "expected one remaining root timer");

	auto printCallback = [&](raw_ostream &os) {
	auto &rootTimer = rootTimers.begin()->second;
	// Print the timer header.
	TimeRecord totalTime = rootTimer->getTotalTime();
	printTimerHeader(os, totalTime);
	// Defer to a specialized printer for each display mode.
	switch (config->getDisplayMode()) {
	case PassDisplayMode::List:
	printResultsAsList(os, rootTimer.get(), totalTime);
	break;
	case PassDisplayMode::Pipeline:
	printResultsAsPipeline(os, rootTimer.get(), totalTime);
	break;
	}
	printTimeEntry(os, 0, "Total", totalTime, totalTime);
	os.flush();

	// Reset root timers.
	rootTimers.clear();
	activeThreadTimers.clear();
	};

	config->printTiming(printCallback);
	}			}

	// The default implementation for printTiming uses			void runAfterPassFailed(Pass pass, Operation op) override {
	// `llvm::CreateInfoOutputFile()` as stream, it can be overridden by clients			runAfterPass(pass, op);
	// to customize the output.
	void PassManager::PassTimingConfig::printTiming(PrintCallbackFn printCallback) {
	printCallback(*llvm::CreateInfoOutputFile());
	}

	/// Print the timing result in list mode.
	void PassTiming::printResultsAsList(raw_ostream &os, Timer *root,
	TimeRecord totalTime) {
	llvm::StringMap<TimeRecord> mergedTimings;

	std::function<void(Timer )> addTimer = [&](Timer timer) {
	// Only add timing information for passes and analyses.
	if (timer->kind == TimerKind::PassOrAnalysis)
	mergedTimings[timer->name] += timer->getTotalTime();
	for (auto &children : timer->children)
	addTimer(children.second.get());
	};

	// Add each of the top level timers.
	for (auto &topLevelTimer : root->children)
	addTimer(topLevelTimer.second.get());

	// Sort the timing information by wall time.
	std::vector<std::pair<StringRef, TimeRecord>> timerNameAndTime;
	for (auto &it : mergedTimings)
	timerNameAndTime.emplace_back(it.first(), it.second);
	llvm::array_pod_sort(timerNameAndTime.begin(), timerNameAndTime.end(),
	[](const std::pair<StringRef, TimeRecord> *lhs,
	const std::pair<StringRef, TimeRecord> *rhs) {
	return llvm::array_pod_sort_comparator<double>(
	&rhs->second.wall, &lhs->second.wall);
	});

	// Print the timing information sequentially.
	for (auto &timeData : timerNameAndTime)
	printTimeEntry(os, 0, timeData.first, timeData.second, totalTime);
	}

	/// Print the timing result in pipeline mode.
	void PassTiming::printResultsAsPipeline(raw_ostream &os, Timer *root,
	TimeRecord totalTime) {
	std::function<void(unsigned, Timer *)> printTimer = [&](unsigned indent,
	Timer *timer) {
	// If this is a timer for a pipeline collection and the collection only has
	// one pipeline child, then only print the child.
	if (timer->kind == TimerKind::PipelineCollection &&
	timer->children.size() == 1)
	return printTimer(indent, timer->children.begin()->second.get());

	printTimeEntry(os, indent, timer->name, timer->getTotalTime(), totalTime);

	// If this timer is a pipeline, then print the children in-order.
	if (timer->kind == TimerKind::Pipeline) {
	for (auto &child : timer->children)
	printTimer(indent + 2, child.second.get());
	return;
	}			}

	// Otherwise, sort the children by name to give a deterministic ordering			//===--------------------------------------------------------------------===//
	// when emitting the time.			// Analysis
	SmallVector<Timer *, 4> children;			//===--------------------------------------------------------------------===//
	children.reserve(timer->children.size());
	for (auto &child : timer->children)
	children.push_back(child.second.get());
	llvm::array_pod_sort(children.begin(), children.end(),
	[](Timer const lhs, Timer const rhs) {
	return (lhs)->name.compare((rhs)->name);
	});
	for (auto &child : children)
	printTimer(indent + 2, child);
	};

	// Print each of the top level timers.			void runBeforeAnalysis(StringRef name, TypeID id, Operation *) override {
	for (auto &topLevelTimer : root->children)			auto tid = llvm::get_threadid();
	printTimer(0, topLevelTimer.second.get());			auto &activeTimers = activeThreadTimers[tid];
				auto &parentScope = activeTimers.empty() ? rootScope : activeTimers.back();
				activeTimers.push_back(parentScope.nest(
				id.getAsOpaquePointer(), [name] { return "(A) " + name.str(); }));
	}			}

	// Out-of-line as key function.			void runAfterAnalysis(StringRef, TypeID, Operation *) override {
	PassManager::PassTimingConfig::~PassTimingConfig() {}			auto &activeTimers = activeThreadTimers[llvm::get_threadid()];
				assert(!activeTimers.empty() && "expected active timer");
				activeTimers.pop_back();
				}
				};
				} // namespace

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// PassManager			// PassManager
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	/// Add an instrumentation to time the execution of passes and the computation			/// Add an instrumentation to time the execution of passes and the computation
	/// of analyses.			/// of analyses.
	void PassManager::enableTiming(std::unique_ptr<PassTimingConfig> config) {			void PassManager::enableTiming(TimingScope &timingScope) {
	// Check if pass timing is already enabled.			if (!timingScope)
	if (passTiming)
	return;			return;
	if (!config)			addInstrumentation(std::make_unique<PassTiming>(timingScope));
	config = std::make_unique<PassManager::PassTimingConfig>();			}
	addInstrumentation(std::make_unique<PassTiming>(std::move(config)));
	passTiming = true;			/// Add an instrumentation to time the execution of passes and the computation
				/// of analyses.
				void PassManager::enableTiming(std::unique_ptr<TimingManager> tm) {
				if (!tm->getRootTimer())
				return; // no need to keep the timing manager around if it's disabled
				addInstrumentation(std::make_unique<PassTiming>(std::move(tm)));
				}

				/// Add an instrumentation to time the execution of passes and the computation
				/// of analyses.
				void PassManager::enableTiming() {
				auto tm = std::make_unique<DefaultTimingManager>();
				tm->setEnabled(true);
				enableTiming(std::move(tm));
	}			}

mlir/lib/Support/CMakeLists.txt

	set(LLVM_OPTIONAL_SOURCES			set(LLVM_OPTIONAL_SOURCES
	DebugCounter.cpp			DebugCounter.cpp
	FileUtilities.cpp			FileUtilities.cpp
	IndentedOstream.cpp			IndentedOstream.cpp
	MlirOptMain.cpp			MlirOptMain.cpp
	StorageUniquer.cpp			StorageUniquer.cpp
				Timing.cpp
	ToolUtilities.cpp			ToolUtilities.cpp
	)			)

	add_mlir_library(MLIRSupport			add_mlir_library(MLIRSupport
	DebugCounter.cpp			DebugCounter.cpp
	FileUtilities.cpp			FileUtilities.cpp
	StorageUniquer.cpp			StorageUniquer.cpp
				Timing.cpp
	ToolUtilities.cpp			ToolUtilities.cpp

	ADDITIONAL_HEADER_DIRS			ADDITIONAL_HEADER_DIRS
	${MLIR_MAIN_INCLUDE_DIR}/mlir/Support			${MLIR_MAIN_INCLUDE_DIR}/mlir/Support

	LINK_LIBS PUBLIC			LINK_LIBS PUBLIC
	${LLVM_PTHREAD_LIB})			${LLVM_PTHREAD_LIB})

	Show All 22 Lines

mlir/lib/Support/MlirOptMain.cpp

Show All 18 Lines
#include "mlir/IR/Dialect.h"		#include "mlir/IR/Dialect.h"
#include "mlir/IR/Location.h"		#include "mlir/IR/Location.h"
#include "mlir/IR/MLIRContext.h"		#include "mlir/IR/MLIRContext.h"
#include "mlir/Parser.h"		#include "mlir/Parser.h"
#include "mlir/Pass/Pass.h"		#include "mlir/Pass/Pass.h"
#include "mlir/Pass/PassManager.h"		#include "mlir/Pass/PassManager.h"
#include "mlir/Support/DebugCounter.h"		#include "mlir/Support/DebugCounter.h"
#include "mlir/Support/FileUtilities.h"		#include "mlir/Support/FileUtilities.h"
		#include "mlir/Support/Timing.h"
#include "mlir/Support/ToolUtilities.h"		#include "mlir/Support/ToolUtilities.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/FileUtilities.h"		#include "llvm/Support/FileUtilities.h"
#include "llvm/Support/InitLLVM.h"		#include "llvm/Support/InitLLVM.h"
#include "llvm/Support/Regex.h"		#include "llvm/Support/Regex.h"
#include "llvm/Support/SourceMgr.h"		#include "llvm/Support/SourceMgr.h"
#include "llvm/Support/StringSaver.h"		#include "llvm/Support/StringSaver.h"
#include "llvm/Support/ToolOutputFile.h"		#include "llvm/Support/ToolOutputFile.h"

using namespace mlir;		using namespace mlir;
using namespace llvm;		using namespace llvm;
using llvm::SMLoc;		using llvm::SMLoc;

/// Perform the actions on the input file indicated by the command line flags		/// Perform the actions on the input file indicated by the command line flags
/// within the specified context.		/// within the specified context.
///		///
/// This typically parses the main source file, runs zero or more optimization		/// This typically parses the main source file, runs zero or more optimization
/// passes, then prints the output.		/// passes, then prints the output.
///		///
static LogicalResult performActions(raw_ostream &os, bool verifyDiagnostics,		static LogicalResult performActions(raw_ostream &os, bool verifyDiagnostics,
bool verifyPasses, SourceMgr &sourceMgr,		bool verifyPasses, SourceMgr &sourceMgr,
MLIRContext *context,		MLIRContext *context,
const PassPipelineCLParser &passPipeline) {		const PassPipelineCLParser &passPipeline) {
		DefaultTimingManager tm;
		rriddleUnsubmitted Done Reply Inline Actions I'm not sure about this type of integration. This seems to always enforce timing, which isn't something that I would expect. If we start integrating timing more deeply into the compiler. I would expect it to do nothing unless explicitly enabled. rriddle: I'm not sure about this type of integration. This seems to always enforce timing, which isn't…
		fabianschuikiAuthorUnsubmitted Done Reply Inline Actions The timing is disabled in the manager upon construction. The call to `applyDefaultTimingManagerCLOptions(tm)` on the next line enables timing if the options are present on the command line. If the timing remains disabled, `getRootScope()` below returns a disabled `TimingScope` that has a fast path to do nothing (just a pointer comparison). With some integration into `MLIRContext` and a stack of running timers, you could remove the `timing` variable below and instead have things like: DefaultTimingManager tm; applyDefaultTimingManagerCLOptions(tm); context->setTimingManager(tm); // or some ownership transfer auto parserTiming = context->getTimingManager()->addTimingScope("Parser"); parserTiming.stop() auto outputTiming = context->getTimingManager()->addTimingScope("Output"); fabianschuiki: The timing is disabled in the manager upon construction. The call to…
		rriddleUnsubmitted Done Reply Inline Actions Ah, I completely missed that. Thanks for pointing it out! I like what you have here. rriddle: Ah, I completely missed that. Thanks for pointing it out! I like what you have here.
		applyDefaultTimingManagerCLOptions(tm);
		TimingScope timing = tm.getRootScope();

// Disable multi-threading when parsing the input file. This removes the		// Disable multi-threading when parsing the input file. This removes the
// unnecessary/costly context synchronization when parsing.		// unnecessary/costly context synchronization when parsing.
bool wasThreadingEnabled = context->isMultithreadingEnabled();		bool wasThreadingEnabled = context->isMultithreadingEnabled();
context->disableMultithreading();		context->disableMultithreading();

// Parse the input file and reset the context threading state.		// Parse the input file and reset the context threading state.
		TimingScope parserTiming = timing.nest("Parser");
OwningModuleRef module(parseSourceFile(sourceMgr, context));		OwningModuleRef module(parseSourceFile(sourceMgr, context));
context->enableMultithreading(wasThreadingEnabled);		context->enableMultithreading(wasThreadingEnabled);
if (!module)		if (!module)
return failure();		return failure();
		parserTiming.stop();

// Apply any pass manager command line options.		// Apply any pass manager command line options.
PassManager pm(context, OpPassManager::Nesting::Implicit);		PassManager pm(context, OpPassManager::Nesting::Implicit);
pm.enableVerifier(verifyPasses);		pm.enableVerifier(verifyPasses);
applyPassManagerCLOptions(pm);		applyPassManagerCLOptions(pm);
		pm.enableTiming(timing);

auto errorHandler = [&](const Twine &msg) {		auto errorHandler = [&](const Twine &msg) {
emitError(UnknownLoc::get(context)) << msg;		emitError(UnknownLoc::get(context)) << msg;
return failure();		return failure();
};		};

// Build the provided pipeline.		// Build the provided pipeline.
if (failed(passPipeline.addToPipeline(pm, errorHandler)))		if (failed(passPipeline.addToPipeline(pm, errorHandler)))
return failure();		return failure();

// Run the pipeline.		// Run the pipeline.
if (failed(pm.run(*module)))		if (failed(pm.run(*module)))
return failure();		return failure();

// Print the output.		// Print the output.
		TimingScope outputTiming = timing.nest("Output");
module->print(os);		module->print(os);
os << '\n';		os << '\n';
return success();		return success();
}		}

/// Parses the memory buffer. If successfully, run a series of passes against		/// Parses the memory buffer. If successfully, run a series of passes against
/// it and print the result.		/// it and print the result.
static LogicalResult processBuffer(raw_ostream &os,		static LogicalResult processBuffer(raw_ostream &os,
▲ Show 20 Lines • Show All 102 Lines • ▼ Show 20 Lines	static cl::opt<bool> runRepro(
cl::init(false));		cl::init(false));

InitLLVM y(argc, argv);		InitLLVM y(argc, argv);

// Register any command line options.		// Register any command line options.
registerAsmPrinterCLOptions();		registerAsmPrinterCLOptions();
registerMLIRContextCLOptions();		registerMLIRContextCLOptions();
registerPassManagerCLOptions();		registerPassManagerCLOptions();
		registerDefaultTimingManagerCLOptions();
DebugCounter::registerCLOptions();		DebugCounter::registerCLOptions();
PassPipelineCLParser passPipeline("", "Compiler passes to run");		PassPipelineCLParser passPipeline("", "Compiler passes to run");

// Build the list of dialects as a header for the --help message.		// Build the list of dialects as a header for the --help message.
std::string helpHeader = (toolName + "\nAvailable Dialects: ").str();		std::string helpHeader = (toolName + "\nAvailable Dialects: ").str();
{		{
llvm::raw_string_ostream os(helpHeader);		llvm::raw_string_ostream os(helpHeader);
MLIRContext context;		MLIRContext context;
▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines

mlir/lib/Support/Timing.cpp

This file was added.

				//===- Timing.cpp - Execution time measurement facilities -----------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// Facilities to measure and provide statistics on execution time.
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Support/Timing.h"
				#include "mlir/Support/ThreadLocalCache.h"
				#include "llvm/ADT/MapVector.h"
				#include "llvm/ADT/Statistic.h"
				#include "llvm/ADT/StringMap.h"
				#include "llvm/ADT/StringSet.h"
				#include "llvm/Support/Allocator.h"
				#include "llvm/Support/CommandLine.h"
				#include "llvm/Support/Format.h"
				#include "llvm/Support/FormatVariadic.h"
				#include "llvm/Support/ManagedStatic.h"
				#include "llvm/Support/RWMutex.h"
				#include "llvm/Support/Threading.h"
				#include "llvm/Support/raw_ostream.h"

				#include <atomic>
				#include <chrono>

				using namespace mlir;
				using namespace detail;
				using llvm::StringRef;
				rriddleUnsubmitted Done Reply Inline Actions This shouldn't be necessary, StringRef is exported in the mlir namespace (see LLVM.h) rriddle: This shouldn't be necessary, StringRef is exported in the mlir namespace (see LLVM.h)
				using DisplayMode = DefaultTimingManager::DisplayMode;

				constexpr llvm::StringLiteral kTimingDescription =
				"... Execution time report ...";

				//===----------------------------------------------------------------------===//
				// TimingManager
				//===----------------------------------------------------------------------===//

				namespace mlir {
				namespace detail {
				/// Private implementation details of the `TimingManager`.
				class TimingManagerImpl {
				public:
				// Identifier allocator, map, and mutex for thread safety.
				llvm::BumpPtrAllocator identifierAllocator;
				llvm::StringSet<llvm::BumpPtrAllocator &> identifiers;
				llvm::sys::SmartRWMutex<true> identifierMutex;

				/// A thread local cache of identifiers to reduce lock contention.
				ThreadLocalCache<llvm::StringMap<llvm::StringMapEntry<llvm::NoneType> *>>
				localIdentifierCache;

				TimingManagerImpl() : identifiers(identifierAllocator) {}
				};
				} // namespace detail
				} // namespace mlir

				TimingManager::TimingManager() : impl(std::make_unique<TimingManagerImpl>()) {}

				TimingManager::~TimingManager() {}

				/// Get the root timer of this timing manager.
				Timer TimingManager::getRootTimer() {
				auto rt = rootTimer();
				return rt.hasValue() ? Timer(*this, rt.getValue()) : Timer();
				}

				/// Get the root timer of this timing manager wrapped in a `TimingScope`.
				TimingScope TimingManager::getRootScope() {
				return TimingScope(getRootTimer());
				}

				//===----------------------------------------------------------------------===//
				// Identifier uniquing
				//===----------------------------------------------------------------------===//

				/// Return an identifier for the specified string.
				TimingIdentifier TimingIdentifier::get(StringRef str, TimingManager &tm) {
				// Check for an existing instance in the local cache.
				auto &impl = *tm.impl;
				auto &localEntry = (impl.localIdentifierCache)[str];
				if (localEntry)
				return TimingIdentifier(localEntry);

				// Check for an existing identifier in read-only mode.
				{
				llvm::sys::SmartScopedReader<true> contextLock(impl.identifierMutex);
				auto it = impl.identifiers.find(str);
				if (it != impl.identifiers.end()) {
				localEntry = &*it;
				return TimingIdentifier(localEntry);
				}
				}

				// Acquire a writer-lock so that we can safely create the new instance.
				llvm::sys::SmartScopedWriter<true> contextLock(impl.identifierMutex);
				auto it = impl.identifiers.insert(str).first;
				localEntry = &*it;
				return TimingIdentifier(localEntry);
				}

				//===----------------------------------------------------------------------===//
				// Helpers for time record printing
				//===----------------------------------------------------------------------===//
				lattnerUnsubmitted Done Reply Inline Actions As clang tidy suggests please declare this as a class. The difference matters for MSVC lattner: As clang tidy suggests please declare this as a class. The difference matters for MSVC

				namespace {

				/// Simple record class to record timing information.
				struct TimeRecord {
				TimeRecord(double wall = 0.0, double user = 0.0) : wall(wall), user(user) {}

				TimeRecord &operator+=(const TimeRecord &other) {
				wall += other.wall;
				user += other.user;
				return *this;
				}

				TimeRecord &operator-=(const TimeRecord &other) {
				wall -= other.wall;
				user -= other.user;
				return *this;
				}

				/// Print the current time record to 'os', with a breakdown showing
				/// contributions to the give 'total' time record.
				void print(raw_ostream &os, const TimeRecord &total) {
				if (total.user != total.wall)
				os << llvm::format(" %8.4f (%5.1f%%)", user, 100.0 * user / total.user);
				os << llvm::format(" %8.4f (%5.1f%%) ", wall, 100.0 * wall / total.wall);
				}

				rriddleUnsubmitted Done Reply Inline Actions The virtual isn't necessary when you have override. rriddle: The virtual isn't necessary when you have override.
				double wall, user;
				};

				} // namespace

				rriddleUnsubmitted Done Reply Inline Actions nit: Drop the else after a return. rriddle: nit: Drop the else after a return.
				/// Utility to print a single line entry in the timer output.
				static void printTimeEntry(raw_ostream &os, unsigned indent, StringRef name,
				TimeRecord time, TimeRecord total) {
				time.print(os, total);
				os.indent(indent) << name << "\n";
				}

				/// Utility to print the timer heading information.
				static void printTimeHeader(raw_ostream &os, TimeRecord total) {
				// Figure out how many spaces to description name.
				unsigned padding = (80 - kTimingDescription.size()) / 2;
				os << "===" << std::string(73, '-') << "===\n";
				os.indent(padding) << kTimingDescription << '\n';
				os << "===" << std::string(73, '-') << "===\n";

				// Print the total time followed by the section headers.
				os << llvm::format(" Total Execution Time: %.4f seconds\n\n", total.wall);
				if (total.user != total.wall)
				os << " ----User Time----";
				os << " ----Wall Time---- ----Name----\n";
				}

				//===----------------------------------------------------------------------===//
				// Timer Implementation for DefaultTimingManager
				//===----------------------------------------------------------------------===//

				namespace {

				/// A timer used to sample execution time.
				///
				/// Separately tracks wall time and user time to account for parallel threads of
				/// execution. Timers are intended to be started and stopped multiple times.
				rriddleUnsubmitted Done Reply Inline Actions Drop the trivial braces here. rriddle: Drop the trivial braces here.
				/// Each start and stop will add to the timer's wall and user time.
				class TimerImpl {
				public:
				using ChildrenMap = llvm::MapVector<const void *, std::unique_ptr<TimerImpl>>;
				using AsyncChildrenMap = llvm::DenseMap<uint64_t, ChildrenMap>;

				TimerImpl(std::string &&name) : threadId(llvm::get_threadid()), name(name) {}

				/// Start the timer.
				void start() { startTime = std::chrono::system_clock::now(); }

				/// Stop the timer.
				void stop() {
				auto newTime = std::chrono::system_clock::now() - startTime;
				wallTime += newTime;
				userTime += newTime;
				}

				/// Create a child timer nested within this one. Multiple calls to this
				/// function with the same unique identifier `id` will return the same child
				/// timer.
				///
				/// This function can be called from other threads, as long as this timer
				/// outlives any uses of the child timer on the other thread.
				TimerImpl nest(const void id, std::function<std::string()> &&nameBuilder) {
				auto tid = llvm::get_threadid();
				if (tid == threadId)
				return nestTail(children[id], std::move(nameBuilder));
				std::unique_lock<std::mutex> lock(asyncMutex);
				return nestTail(asyncChildren[tid][id], std::move(nameBuilder));
				}

				/// Tail-called from `nest()`.
				TimerImpl *nestTail(std::unique_ptr<TimerImpl> &child,
				std::function<std::string()> &&nameBuilder) {
				if (!child)
				child = std::make_unique<TimerImpl>(nameBuilder());
				return child.get();
				}

				/// Finalize this timer and all its children.
				///
				/// If this timer has async children, which happens if `nest()` was called
				/// from another thread, this function merges the async childr timers into the
				/// main list of child timers.
				///
				/// Caution: Call this function only after all nested timers running on other
				/// threads no longer need their timers!
				void finalize() {
				addAsyncUserTime();
				mergeAsyncChildren();
				}

				/// Add the user time of all async children to this timer's user time. This is
				/// necessary since the user time already contains all regular child timers,
				/// but not the asynchronous ones (by the nesting nature of the timers).
				std::chrono::nanoseconds addAsyncUserTime() {
				auto added = std::chrono::nanoseconds(0);
				for (auto &child : children)
				added += child.second->addAsyncUserTime();
				for (auto &thread : asyncChildren) {
				for (auto &child : thread.second) {
				child.second->addAsyncUserTime();
				added += child.second->userTime;
				}
				}
				userTime += added;
				return added;
				}

				/// Ensure that this timer and recursively all its children have their async
				/// children folded into the main map of children.
				void mergeAsyncChildren() {
				for (auto &child : children)
				child.second->mergeAsyncChildren();
				mergeChildren(std::move(asyncChildren));
				assert(asyncChildren.empty());
				}

				/// Merge multiple child timers into this timer.
				///
				/// Children in `other` are added as children to this timer, or, if this timer
				/// already contains a child with the corresponding unique identifier, are
				/// merged into the existing child.
				void mergeChildren(ChildrenMap &&other) {
				if (children.empty()) {
				children = std::move(other);
				for (auto &child : other)
				child.second->mergeAsyncChildren();
				} else {
				for (auto &child : other)
				mergeChild(child.first, std::move(child.second));
				other.clear();
				}
				}

				/// See above.
				void mergeChildren(AsyncChildrenMap &&other) {
				for (auto &thread : other) {
				mergeChildren(std::move(thread.second));
				assert(thread.second.empty());
				}
				other.clear();
				}

				/// Merge a child timer into this timer for a given unique identifier.
				///
				/// Moves all child and async child timers of `other` into this timer's child
				/// for the given unique identifier.
				void mergeChild(const void *id, std::unique_ptr<TimerImpl> &&other) {
				auto &into = children[id];
				if (!into) {
				into = std::move(other);
				into->mergeAsyncChildren();
				} else {
				into->wallTime = std::max(into->wallTime, other->wallTime);
				into->userTime += other->userTime;
				into->mergeChildren(std::move(other->children));
				into->mergeChildren(std::move(other->asyncChildren));
				other.reset();
				}
				}

				/// Dump a human-readable tree representation of the timer and its children.
				/// This is useful for debugging the timing mechanisms and structure of the
				/// timers.
				void dump(raw_ostream &os, unsigned indent = 0, unsigned markThreadId = 0) {
				auto time = getTimeRecord();
				os << std::string(indent * 2, ' ') << name << " [" << threadId << "]"
				<< llvm::format(" %7.4f / %7.4f", time.user, time.wall);
				if (threadId != markThreadId && markThreadId != 0)
				os << " (*)";
				os << "\n";
				for (auto &child : children)
				child.second->dump(os, indent + 1, threadId);
				for (auto &thread : asyncChildren)
				for (auto &child : thread.second)
				child.second->dump(os, indent + 1, threadId);
				}

				/// Returns the time for this timer in seconds.
				TimeRecord getTimeRecord() {
				return TimeRecord(
				std::chrono::duration_cast<std::chrono::duration<double>>(wallTime)
				.count(),
				std::chrono::duration_cast<std::chrono::duration<double>>(userTime)
				.count());
				}

				/// Print the timing result in list mode.
				void printAsList(raw_ostream &os, TimeRecord total) {
				// Flatten the leaf timers in the tree and merge them by name.
				llvm::StringMap<TimeRecord> mergedTimers;
				std::function<void(TimerImpl )> addTimer = [&](TimerImpl timer) {
				mergedTimers[timer->name] += timer->getTimeRecord();
				for (auto &children : timer->children)
				addTimer(children.second.get());
				};
				addTimer(this);

				// Sort the timing information by wall time.
				std::vector<std::pair<StringRef, TimeRecord>> timerNameAndTime;
				for (auto &it : mergedTimers)
				timerNameAndTime.emplace_back(it.first(), it.second);
				llvm::array_pod_sort(timerNameAndTime.begin(), timerNameAndTime.end(),
				[](const std::pair<StringRef, TimeRecord> *lhs,
				const std::pair<StringRef, TimeRecord> *rhs) {
				return llvm::array_pod_sort_comparator<double>(
				&rhs->second.wall, &lhs->second.wall);
				});

				// Print the timing information sequentially.
				for (auto &timeData : timerNameAndTime)
				printTimeEntry(os, 0, timeData.first, timeData.second, total);
				}

				/// Print the timing result in tree mode.
				void printAsTree(raw_ostream &os, TimeRecord total, unsigned indent = 0) {
				unsigned childIndent = indent;
				if (!hidden) {
				printTimeEntry(os, indent, name, getTimeRecord(), total);
				childIndent += 2;
				}
				for (auto &child : children) {
				child.second->printAsTree(os, total, childIndent);
				}
				}

				/// Print the current timing information.
				void print(raw_ostream &os, DisplayMode displayMode) {
				// Print the banner.
				auto total = getTimeRecord();
				printTimeHeader(os, total);

				// Defer to a specialized printer for each display mode.
				switch (displayMode) {
				case DisplayMode::List:
				printAsList(os, total);
				break;
				case DisplayMode::Tree:
				printAsTree(os, total);
				break;
				}

				// Print the top-level time not accounted for by child timers, and the
				// total.
				auto rest = total;
				for (auto &child : children)
				rest -= child.second->getTimeRecord();
				printTimeEntry(os, 0, "Rest", rest, total);
				printTimeEntry(os, 0, "Total", total, total);
				os.flush();
				}

				/// The last time instant at which the timer was started.
				std::chrono::time_point<std::chrono::system_clock> startTime;

				/// Accumulated wall time. If multiple threads of execution are merged into
				/// this timer, the wall time will hold the maximum wall time of each thread
				/// of execution.
				std::chrono::nanoseconds wallTime = std::chrono::nanoseconds(0);

				/// Accumulated user time. If multiple threads of execution are merged into
				/// this timer, each thread's user time is added here.
				std::chrono::nanoseconds userTime = std::chrono::nanoseconds(0);

				/// The thread on which this timer is running.
				uint64_t threadId;

				/// A descriptive name for this timer.
				std::string name;

				/// Whether to omit this timer from reports and directly show its children.
				bool hidden = false;

				/// Child timers on the same thread the timer itself. We keep at most one
				/// timer per unique identifier.
				ChildrenMap children;

				/// Child timers on other threads. We keep at most one timer per unique
				/// identifier.
				AsyncChildrenMap asyncChildren;

				/// Mutex for the async children.
				std::mutex asyncMutex;
				};

				} // namespace

				//===----------------------------------------------------------------------===//
				// DefaultTimingManager
				//===----------------------------------------------------------------------===//

				namespace mlir {
				namespace detail {

				/// Implementation details of the `DefaultTimingManager`.
				class DefaultTimingManagerImpl {
				public:
				/// Whether we should do our work or not.
				bool enabled = false;

				/// The configured display mode.
				DisplayMode displayMode = DisplayMode::Tree;

				/// The stream where we should print our output. This will always be non-null.
				raw_ostream *output = &llvm::errs();

				/// The root timer.
				std::unique_ptr<TimerImpl> rootTimer;
				};

				} // namespace detail
				} // namespace mlir

				DefaultTimingManager::DefaultTimingManager()
				: impl(std::make_unique<DefaultTimingManagerImpl>()) {
				clear(); // initializes the root timer
				}

				DefaultTimingManager::~DefaultTimingManager() { print(); }

				/// Enable or disable execution time sampling.
				void DefaultTimingManager::setEnabled(bool enabled) { impl->enabled = enabled; }

				/// Return whether execution time sampling is enabled.
				bool DefaultTimingManager::isEnabled() const { return impl->enabled; }

				/// Change the display mode.
				void DefaultTimingManager::setDisplayMode(DisplayMode displayMode) {
				impl->displayMode = displayMode;
				}

				/// Return the current display mode;
				DefaultTimingManager::DisplayMode DefaultTimingManager::getDisplayMode() const {
				return impl->displayMode;
				}

				/// Change the stream where the output will be printed to.
				void DefaultTimingManager::setOutput(raw_ostream &os) { impl->output = &os; }

				/// Return the current output stream where the output will be printed to.
				raw_ostream &DefaultTimingManager::getOutput() const {
				assert(impl->output);
				return *impl->output;
				}

				/// Print and clear the timing results.
				void DefaultTimingManager::print() {
				if (impl->enabled) {
				impl->rootTimer->finalize();
				impl->rootTimer->print(*impl->output, impl->displayMode);
				}
				clear();
				}

				/// Clear the timing results.
				void DefaultTimingManager::clear() {
				impl->rootTimer = std::make_unique<TimerImpl>("root");
				impl->rootTimer->hidden = true;
				}

				/// Debug print the timer data structures to an output stream.
				void DefaultTimingManager::dumpTimers(raw_ostream &os) {
				impl->rootTimer->dump(os);
				}

				/// Debug print the timers as a list.
				void DefaultTimingManager::dumpAsList(raw_ostream &os) {
				impl->rootTimer->finalize();
				impl->rootTimer->print(os, DisplayMode::List);
				}

				/// Debug print the timers as a tree.
				void DefaultTimingManager::dumpAsTree(raw_ostream &os) {
				impl->rootTimer->finalize();
				impl->rootTimer->print(os, DisplayMode::Tree);
				}

				Optional<void *> DefaultTimingManager::rootTimer() {
				if (impl->enabled)
				return impl->rootTimer.get();
				return llvm::None;
				}

				void DefaultTimingManager::startTimer(void *handle) {
				static_cast<TimerImpl *>(handle)->start();
				}

				void DefaultTimingManager::stopTimer(void *handle) {
				static_cast<TimerImpl *>(handle)->stop();
				}

				void *
				DefaultTimingManager::nestTimer(void handle, const void id,
				std::function<std::string()> &&nameBuilder) {
				return static_cast<TimerImpl *>(handle)->nest(id, std::move(nameBuilder));
				}

				void DefaultTimingManager::hideTimer(void *handle) {
				static_cast<TimerImpl *>(handle)->hidden = true;
				}

				//===----------------------------------------------------------------------===//
				// DefaultTimingManager Command Line Options
				//===----------------------------------------------------------------------===//

				namespace {
				struct DefaultTimingManagerOptions {
				llvm::cl::opt<bool> timing{"timing",
				rriddleUnsubmitted Done Reply Inline Actions Can you prefix these with `mlir-`? I'm always hesitant about adding very commonly named things to the global command line namespace. rriddle: Can you prefix these with `mlir-`? I'm always hesitant about adding very commonly named things…
				fabianschuikiAuthorUnsubmitted Done Reply Inline Actions Yeah that's a good idea. `--timing` did seem awfully brief. fabianschuiki: Yeah that's a good idea. `--timing` did seem awfully brief.
				llvm::cl::desc("Display execution times"),
				llvm::cl::init(false)};
				llvm::cl::opt<DisplayMode> displayMode{
				"timing-display", llvm::cl::desc("Display method for timing data"),
				llvm::cl::init(DisplayMode::Tree),
				llvm::cl::values(
				clEnumValN(DisplayMode::List, "list",
				"display the results in a list sorted by total time"),
				clEnumValN(DisplayMode::Tree, "tree",
				"display the results ina with a nested tree view"))};
				};
				} // end anonymous namespace

				static llvm::ManagedStatic<DefaultTimingManagerOptions> options;

				void mlir::registerDefaultTimingManagerCLOptions() {
				// Make sure that the options struct has been constructed.
				*options;
				}

				void mlir::applyDefaultTimingManagerCLOptions(DefaultTimingManager &tm) {
				if (!options.isConstructed())
				return;
				tm.setEnabled(options->timing);
				tm.setDisplayMode(options->displayMode);
				}

mlir/test/Pass/pass-timing.mlir

	// RUN: mlir-opt %s -mlir-disable-threading=true -verify-each=true -pass-pipeline='func(cse,canonicalize,cse)' -pass-timing -pass-timing-display=list 2>&1 \| FileCheck -check-prefix=LIST %s			// RUN: mlir-opt %s -mlir-disable-threading=true -verify-each=true -pass-pipeline='func(cse,canonicalize,cse)' -timing -timing-display=list 2>&1 \| FileCheck -check-prefix=LIST %s
	// RUN: mlir-opt %s -mlir-disable-threading=true -verify-each=true -pass-pipeline='func(cse,canonicalize,cse)' -pass-timing -pass-timing-display=pipeline 2>&1 \| FileCheck -check-prefix=PIPELINE %s			// RUN: mlir-opt %s -mlir-disable-threading=true -verify-each=true -pass-pipeline='func(cse,canonicalize,cse)' -timing -timing-display=tree 2>&1 \| FileCheck -check-prefix=PIPELINE %s
	// RUN: mlir-opt %s -mlir-disable-threading=false -verify-each=true -pass-pipeline='func(cse,canonicalize,cse)' -pass-timing -pass-timing-display=list 2>&1 \| FileCheck -check-prefix=MT_LIST %s			// RUN: mlir-opt %s -mlir-disable-threading=false -verify-each=true -pass-pipeline='func(cse,canonicalize,cse)' -timing -timing-display=list 2>&1 \| FileCheck -check-prefix=MT_LIST %s
	// RUN: mlir-opt %s -mlir-disable-threading=false -verify-each=true -pass-pipeline='func(cse,canonicalize,cse)' -pass-timing -pass-timing-display=pipeline 2>&1 \| FileCheck -check-prefix=MT_PIPELINE %s			// RUN: mlir-opt %s -mlir-disable-threading=false -verify-each=true -pass-pipeline='func(cse,canonicalize,cse)' -timing -timing-display=tree 2>&1 \| FileCheck -check-prefix=MT_PIPELINE %s
	// RUN: mlir-opt %s -mlir-disable-threading=false -verify-each=false -test-pm-nested-pipeline -pass-timing -pass-timing-display=pipeline 2>&1 \| FileCheck -check-prefix=NESTED_MT_PIPELINE %s			// RUN: mlir-opt %s -mlir-disable-threading=false -verify-each=false -test-pm-nested-pipeline -timing -timing-display=tree 2>&1 \| FileCheck -check-prefix=NESTED_MT_PIPELINE %s

	// LIST: Pass execution timing report			// LIST: Execution time report
	// LIST: Total Execution Time:			// LIST: Total Execution Time:
	// LIST: Name			// LIST: Name
	// LIST-DAG: Canonicalizer			// LIST-DAG: Canonicalizer
	// LIST-DAG: CSE			// LIST-DAG: CSE
	// LIST-DAG: DominanceInfo			// LIST-DAG: DominanceInfo
	// LIST: Total			// LIST: Total

	// PIPELINE: Pass execution timing report			// PIPELINE: Execution time report
	// PIPELINE: Total Execution Time:			// PIPELINE: Total Execution Time:
	// PIPELINE: Name			// PIPELINE: Name
				// PIPELINE-NEXT: Parser
	// PIPELINE-NEXT: 'func' Pipeline			// PIPELINE-NEXT: 'func' Pipeline
	// PIPELINE-NEXT: CSE			// PIPELINE-NEXT: CSE
	// PIPELINE-NEXT: (A) DominanceInfo			// PIPELINE-NEXT: (A) DominanceInfo
	// PIPELINE-NEXT: Canonicalizer			// PIPELINE-NEXT: Canonicalizer
	// PIPELINE-NEXT: CSE			// PIPELINE-NEXT: CSE
	// PIPELINE-NEXT: (A) DominanceInfo			// PIPELINE-NEXT: (A) DominanceInfo
				// PIPELINE-NEXT: Output
				// PIPELINE-NEXT: Rest
	// PIPELINE-NEXT: Total			// PIPELINE-NEXT: Total

	// MT_LIST: Pass execution timing report			// MT_LIST: Execution time report
	// MT_LIST: Total Execution Time:			// MT_LIST: Total Execution Time:
	// MT_LIST: Name			// MT_LIST: Name
	// MT_LIST-DAG: Canonicalizer			// MT_LIST-DAG: Canonicalizer
	// MT_LIST-DAG: CSE			// MT_LIST-DAG: CSE
	// MT_LIST-DAG: DominanceInfo			// MT_LIST-DAG: DominanceInfo
	// MT_LIST: Total			// MT_LIST: Total

	// MT_PIPELINE: Pass execution timing report			// MT_PIPELINE: Execution time report
	// MT_PIPELINE: Total Execution Time:			// MT_PIPELINE: Total Execution Time:
	// MT_PIPELINE: Name			// MT_PIPELINE: Name
				// MT_PIPELINE-NEXT: Parser
	// MT_PIPELINE-NEXT: 'func' Pipeline			// MT_PIPELINE-NEXT: 'func' Pipeline
	// MT_PIPELINE-NEXT: CSE			// MT_PIPELINE-NEXT: CSE
	// MT_PIPELINE-NEXT: (A) DominanceInfo			// MT_PIPELINE-NEXT: (A) DominanceInfo
	// MT_PIPELINE-NEXT: Canonicalizer			// MT_PIPELINE-NEXT: Canonicalizer
	// MT_PIPELINE-NEXT: CSE			// MT_PIPELINE-NEXT: CSE
	// MT_PIPELINE-NEXT: (A) DominanceInfo			// MT_PIPELINE-NEXT: (A) DominanceInfo
				// MT_PIPELINE-NEXT: Output
				// MT_PIPELINE-NEXT: Rest
	// MT_PIPELINE-NEXT: Total			// MT_PIPELINE-NEXT: Total

	// NESTED_MT_PIPELINE: Pass execution timing report			// NESTED_MT_PIPELINE: Execution time report
	// NESTED_MT_PIPELINE: Total Execution Time:			// NESTED_MT_PIPELINE: Total Execution Time:
	// NESTED_MT_PIPELINE: Name			// NESTED_MT_PIPELINE: Name
				// NESTED_MT_PIPELINE-NEXT: Parser
	// NESTED_MT_PIPELINE-NEXT: Pipeline Collection : ['func', 'module']			// NESTED_MT_PIPELINE-NEXT: Pipeline Collection : ['func', 'module']
	// NESTED_MT_PIPELINE-NEXT: 'func' Pipeline			// NESTED_MT_PIPELINE-NEXT: 'func' Pipeline
	// NESTED_MT_PIPELINE-NEXT: TestFunctionPass			// NESTED_MT_PIPELINE-NEXT: TestFunctionPass
	// NESTED_MT_PIPELINE-NEXT: 'module' Pipeline			// NESTED_MT_PIPELINE-NEXT: 'module' Pipeline
	// NESTED_MT_PIPELINE-NEXT: TestModulePass			// NESTED_MT_PIPELINE-NEXT: TestModulePass
	// NESTED_MT_PIPELINE-NEXT: 'func' Pipeline			// NESTED_MT_PIPELINE-NEXT: 'func' Pipeline
	// NESTED_MT_PIPELINE-NEXT: TestFunctionPass			// NESTED_MT_PIPELINE-NEXT: TestFunctionPass
				// NESTED_MT_PIPELINE-NEXT: Output
				// NESTED_MT_PIPELINE-NEXT: Rest
	// NESTED_MT_PIPELINE-NEXT: Total			// NESTED_MT_PIPELINE-NEXT: Total

	func @foo() {			func @foo() {
	return			return
	}			}

	func @bar() {			func @bar() {
	return			return
	Show All 19 Lines

mlir/test/Pass/pipeline-parsing.mlir

	// RUN: mlir-opt %s -pass-pipeline='module(test-module-pass,func(test-function-pass)),func(test-function-pass)' -pass-pipeline="func(cse,canonicalize)" -verify-each=false -pass-timing -pass-timing-display=pipeline 2>&1 \| FileCheck %s			// RUN: mlir-opt %s -pass-pipeline='module(test-module-pass,func(test-function-pass)),func(test-function-pass)' -pass-pipeline="func(cse,canonicalize)" -verify-each=false -timing -timing-display=tree 2>&1 \| FileCheck %s
	// RUN: mlir-opt %s -test-textual-pm-nested-pipeline -verify-each=false -pass-timing -pass-timing-display=pipeline 2>&1 \| FileCheck %s --check-prefix=TEXTUAL_CHECK			// RUN: mlir-opt %s -test-textual-pm-nested-pipeline -verify-each=false -timing -timing-display=tree 2>&1 \| FileCheck %s --check-prefix=TEXTUAL_CHECK
	// RUN: not mlir-opt %s -pass-pipeline='module(test-module-pass' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_1 %s			// RUN: not mlir-opt %s -pass-pipeline='module(test-module-pass' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_1 %s
	// RUN: not mlir-opt %s -pass-pipeline='module(test-module-pass))' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_2 %s			// RUN: not mlir-opt %s -pass-pipeline='module(test-module-pass))' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_2 %s
	// RUN: not mlir-opt %s -pass-pipeline='module()(' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_3 %s			// RUN: not mlir-opt %s -pass-pipeline='module()(' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_3 %s
	// RUN: not mlir-opt %s -pass-pipeline=',' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_4 %s			// RUN: not mlir-opt %s -pass-pipeline=',' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_4 %s
	// RUN: not mlir-opt %s -pass-pipeline='func(test-module-pass)' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_5 %s			// RUN: not mlir-opt %s -pass-pipeline='func(test-module-pass)' 2>&1 \| FileCheck --check-prefix=CHECK_ERROR_5 %s

	// CHECK_ERROR_1: encountered unbalanced parentheses while parsing pipeline			// CHECK_ERROR_1: encountered unbalanced parentheses while parsing pipeline
	// CHECK_ERROR_2: encountered extra closing ')' creating unbalanced parentheses while parsing pipeline			// CHECK_ERROR_2: encountered extra closing ')' creating unbalanced parentheses while parsing pipeline
	Show All 31 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[MLIR] Factor pass timing out into a dedicated timing managerClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 342966

mlir/docs/PassManagement.md

mlir/include/mlir/Pass/Pass.h

mlir/include/mlir/Pass/PassManager.h

mlir/include/mlir/Support/Timing.h

mlir/lib/Pass/Pass.cpp

mlir/lib/Pass/PassManagerOptions.cpp

mlir/lib/Pass/PassTiming.cpp

mlir/lib/Support/CMakeLists.txt

mlir/lib/Support/MlirOptMain.cpp

mlir/lib/Support/Timing.cpp

mlir/test/Pass/pass-timing.mlir

mlir/test/Pass/pipeline-parsing.mlir

[MLIR] Factor pass timing out into a dedicated timing manager
ClosedPublic