Diff 102671

docs/BranchWeightMetadata.rst

	Show First 20 Lines • Show All 58 Lines • ▼ Show 20 Lines
	.. code-block:: none			.. code-block:: none

	!0 = metadata !{			!0 = metadata !{
	metadata !"branch_weights",			metadata !"branch_weights",
	i32 <LABEL_BRANCH_WEIGHT>			i32 <LABEL_BRANCH_WEIGHT>
	[ , i32 <LABEL_BRANCH_WEIGHT> ... ]			[ , i32 <LABEL_BRANCH_WEIGHT> ... ]
	}			}

				``CallInst``
				^^^^^^^^^^^^^^^^^^

				Calls may have branch weight metadata, containing the execution count of
				reamesUnsubmitted Not Done Reply Inline Actions Please describe the semantics of the metadata separately from the usage. I have an alternate profiling source and need to know how the metadata is used, not just where you assume it comes from. reames: Please describe the semantics of the metadata separately from the usage. I have an alternate…
				tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions Right now it is only used in SamplePGO mode (used to detect hot calls since profile counts on the function entry etc may not be accurate in sampling mode). I've updated the wording to indicate this. tejohnson: Right now it is only used in SamplePGO mode (used to detect hot calls since profile counts on…
				the call. It is currently used in SamplePGO mode only, to augment the
				block and entry counts which may not be accurate with sampling.

				.. code-block:: none

				!0 = metadata !{
				metadata !"branch_weights",
				i32 <CALL_BRANCH_WEIGHT>
				}

	Other			Other
	^^^^^			^^^^^

	Other terminator instructions are not allowed to contain Branch Weight Metadata.			Other terminator instructions are not allowed to contain Branch Weight Metadata.

	.. _\__builtin_expect:			.. _\__builtin_expect:

	Built-in ``expect`` Instructions			Built-in ``expect`` Instructions
	▲ Show 20 Lines • Show All 76 Lines • Show Last 20 Lines

docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 5,186 Lines • ▼ Show 20 Lines
	.. code-block:: llvm			.. code-block:: llvm

	$a = comdat any			$a = comdat any
	@a = global i32 1, comdat $a			@a = global i32 1, comdat $a
	@b = internal global i32 2, comdat $a, section "abc", !associated !0			@b = internal global i32 2, comdat $a, section "abc", !associated !0
	!0 = !{i32* @a}			!0 = !{i32* @a}


				'``prof``' Metadata
				^^^^^^^^^^^^^^^^^^^

				The ``prof`` metadata is used to record profile data in the IR.
				The first operand of the metadata node indicates the profile metadata
				type. There are currently 3 types:
				:ref:`branch_weights<prof_node_branch_weights>`,
				:ref:`function_entry_count<prof_node_function_entry_count>`, and
				:ref:`VP<prof_node_VP>`.

				.. _prof_node_branch_weights:

				branch_weights
				""""""""""""""

				Branch weight metadata attached to a branch, select, switch or call instruction
				davidxlUnsubmitted Not Done Reply Inline Actions Does it attach to the call instruction? It can be attached to SELECT instruction which should be mentioned. davidxl: Does it attach to the call instruction? It can be attached to SELECT instruction which should…
				tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions It attached to direct call instructions in SamplePGO mode. And according to the linked older documentation I found (http://llvm.org/docs/BranchWeightMetadata.html#indirectbrinst) it attaches to indirect calls, although I wonder if that is stale since we now have VP metadata for indirect calls? Will add "select" to the list of instruction types here. tejohnson: It attached to direct call instructions in SamplePGO mode. And according to the linked older…
				davidxlUnsubmitted Done Reply Inline Actions Need to document the direct call usage. indirectbr is not for indirect call, but computed gotos. So may be expand branch into conditional branch or indirect branches? Unconditional direct branch does not need this meta data. davidxl: Need to document the direct call usage. indirectbr is not for indirect call, but computed…
				represents the likeliness of the associated branch being taken.
				For more information, see :doc:`BranchWeightMetadata`.

				.. _prof_node_function_entry_count:

				function_entry_count
				""""""""""""""""""""

				Function entry count metadata can be attached to function definitions
				to record the number of times the function is called. Used with BFI
				davidxlUnsubmitted Not Done Reply Inline Actions to record the number of times the function is called. Used with BFI information, it is also used to derive basic block profile count. davidxl: to record the number of times the function is called. Used with BFI information, it is also…
				tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions will update. tejohnson: will update.
				information, it is also used to derive the basic block profile count.
				For more information, see :doc:`BranchWeightMetadata`.

				.. _prof_node_VP:

				VP
				""

				VP (value profile) metadata can be attached to instructions that have
				value profile information. Currently this is indirect calls (where it
				records the hottest callees) and calls to memory intrinsics such as memcpy,
				davidxlUnsubmitted Not Done Reply Inline Actions memcpy -- mempy like calls (including memset, etc) davidxl: memcpy -- mempy like calls (including memset, etc)
				tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions will change to "memory intrinsics such as memcpy, memmove and memset" tejohnson: will change to "memory intrinsics such as memcpy, memmove and memset"
				memmove, and memset (where it records the hottest byte lengths).

				Each VP metadata node contains "VP" string, then a uint32_t value for the value
				profiling kind, a uint64_t value for the total number of times the instruction
				is executed, followed by uint64_t value and execution count pairs.
				The value profiling kind is 0 for indirect call targets and 1 for memory
				reamesUnsubmitted Not Done Reply Inline Actions It would have been more clear to represent the types as strings in the metadata. Document what's there for the moment, but might be worth changing this. Clarification: Is it legal for the sum of the pairs to not add up to the total? (It should be.) Clarification: Is it required that the counts be exact executio counts? Or are they solely relative weightings? (Should be the later.) reames: It would have been more clear to represent the types as strings in the metadata. Document…
				tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions It would have been more clear to represent the types as strings in the metadata. Document what's there for the moment, but might be worth changing this. David may be able to give some context on why it is an int and not a string. But as a user I agree a string would be clearer. Clarification: Is it legal for the sum of the pairs to not add up to the total? (It should be.) Yes. In fact, we only profile the top N (N is configurable, default is 8), so the value counts often don't add up to the total. I have clarified this. Clarification: Is it required that the counts be exact execution counts? Or are they solely relative weightings (Should be the later.) Relative weightings should work fine. In fact, the total may not be equal to the BB execution count in a multi-threaded profile collection environment since counter updates are non-atomic. tejohnson: > It would have been more clear to represent the types as strings in the metadata. Document…
				operations. For indirect call targets, each profile value is a hash
				of the callee function name, and for memory operations each value is the
				byte length.

				Note that the value counts do not need to add up to the total count
				listed in the third operand (in practice only the top hottest values
				are tracked and reported).

				Indirect call example:

				.. code-block:: llvm

				call void %f(), !prof !1
				!1 = !{!"VP", i32 0, i64 1600, i64 7651369219802541373, i64 1030, i64 -4377547752858689819, i64 410}

				Note that the VP type is 0 (the second operand), which indicates this is
				an indirect call value profile data. The third operand indicates that the
				indirect call executed 1600 times. The 4th and 6th operands give the
				hashes of the 2 hottest target functions' names (this is the same hash used
				to represent function names in the profile database), and the 5th and 7th
				operands give the execution count that each of the respective prior target
				functions was called.

	Module Flags Metadata			Module Flags Metadata
	=====================			=====================

	Information about the module as a whole is difficult to convey to LLVM's			Information about the module as a whole is difficult to convey to LLVM's
	subsystems. The LLVM IR isn't sufficient to transmit this information.			subsystems. The LLVM IR isn't sufficient to transmit this information.
	The ``llvm.module.flags`` named metadata exists in order to facilitate			The ``llvm.module.flags`` named metadata exists in order to facilitate
	this. These flags are in the form of key / value pairs --- much like a			this. These flags are in the form of key / value pairs --- much like a
	dictionary --- making it easy for any subsystem who cares about a flag to			dictionary --- making it easy for any subsystem who cares about a flag to
	▲ Show 20 Lines • Show All 8,871 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Doc] Document prof metadata in LangRef
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 102671

docs/BranchWeightMetadata.rst

docs/LangRef.rst

This is an archive of the discontinued LLVM Phabricator instance.

[Doc] Document prof metadata in LangRefClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 102671

docs/BranchWeightMetadata.rst

docs/LangRef.rst

[Doc] Document prof metadata in LangRef
ClosedPublic