docs/BlockFrequencyTerminology.rst - llvm - Git at Google

 ================================
 LLVM Block Frequency Terminology
 ================================

 .. contents::
    :local:

 Introduction
 ============

 Block Frequency is a metric for estimating the relative frequency of different
 basic blocks.  This document describes the terminology that the
 ``BlockFrequencyInfo`` and ``MachineBlockFrequencyInfo`` analysis passes use.

 Branch Probability
 ==================

 Blocks with multiple successors have probabilities associated with each
 outgoing edge.  These are called branch probabilities.  For a given block, the
 sum of its outgoing branch probabilities should be 1.0.

 Branch Weight
 =============

 Rather than storing fractions on each edge, we store an integer weight.
 Weights are relative to the other edges of a given predecessor block.  The
 branch probability associated with a given edge is its own weight divided by
 the sum of the weights on the predecessor's outgoing edges.

 For example, consider this IR:

 .. code-block:: llvm

    define void @foo() {
        ; ...
        A:
            br i1 %cond, label %B, label %C, !prof !0
        ; ...
    }
    !0 = metadata !{metadata !"branch_weights", i32 7, i32 8}

 and this simple graph representation::

    A -> B  (edge-weight: 7)
    A -> C  (edge-weight: 8)

 The probability of branching from block A to block B is 7/15, and the
 probability of branching from block A to block C is 8/15.

 See :doc:`BranchWeightMetadata` for details about the branch weight IR
 representation.

 Block Frequency
 ===============

 Block frequency is a relative metric that represents the number of times a
 block executes.  The ratio of a block frequency to the entry block frequency is
 the expected number of times the block will execute per entry to the function.

 Block frequency is the main output of the ``BlockFrequencyInfo`` and
 ``MachineBlockFrequencyInfo`` analysis passes.

 Implementation: a series of DAGs
 ================================

 The implementation of the block frequency calculation analyses each loop,
 bottom-up, ignoring backedges; i.e., as a DAG.  After each loop is processed,
 it's packaged up to act as a pseudo-node in its parent loop's (or the
 function's) DAG analysis.

 Block Mass
 ==========

 For each DAG, the entry node is assigned a mass of ``UINT64_MAX`` and mass is
 distributed to successors according to branch weights.  Block Mass uses a
 fixed-point representation where ``UINT64_MAX`` represents ``1.0`` and ``0``
 represents a number just above ``0.0``.

 After mass is fully distributed, in any cut of the DAG that separates the exit
 nodes from the entry node, the sum of the block masses of the nodes succeeded
 by a cut edge should equal ``UINT64_MAX``.  In other words, mass is conserved
 as it "falls" through the DAG.

 If a function's basic block graph is a DAG, then block masses are valid block
 frequencies.  This works poorly in practise though, since downstream users rely
 on adding block frequencies together without hitting the maximum.

 Loop Scale
 ==========

 Loop scale is a metric that indicates how many times a loop iterates per entry.
 As mass is distributed through the loop's DAG, the (otherwise ignored) backedge
 mass is collected.  This backedge mass is used to compute the exit frequency,
 and thus the loop scale.

 Implementation: Getting from mass and scale to frequency
 ========================================================

 After analysing the complete series of DAGs, each block has a mass (local to
 its containing loop, if any), and each loop pseudo-node has a loop scale and
 its own mass (from its parent's DAG).

 We can get an initial frequency assignment (with entry frequency of 1.0) by
 multiplying these masses and loop scales together.  A given block's frequency
 is the product of its mass, the mass of containing loops' pseudo nodes, and the
 containing loops' loop scales.

 Since downstream users need integers (not floating point), this initial
 frequency assignment is shifted as necessary into the range of ``uint64_t``.

 Block Bias
 ==========

 Block bias is a proposed *absolute* metric to indicate a bias toward or away
 from a given block during a function's execution.  The idea is that bias can be
 used in isolation to indicate whether a block is relatively hot or cold, or to
 compare two blocks to indicate whether one is hotter or colder than the other.

 The proposed calculation involves calculating a *reference* block frequency,
 where:

 * every branch weight is assumed to be 1 (i.e., every branch probability
   distribution is even) and

 * loop scales are ignored.

 This reference frequency represents what the block frequency would be in an
 unbiased graph.

 The bias is the ratio of the block frequency to this reference block frequency.
	================================
	LLVM Block Frequency Terminology
	================================

	.. contents::
	:local:

	Introduction
	============

	Block Frequency is a metric for estimating the relative frequency of different
	basic blocks. This document describes the terminology that the
	``BlockFrequencyInfo`` and ``MachineBlockFrequencyInfo`` analysis passes use.

	Branch Probability
	==================

	Blocks with multiple successors have probabilities associated with each
	outgoing edge. These are called branch probabilities. For a given block, the
	sum of its outgoing branch probabilities should be 1.0.

	Branch Weight
	=============

	Rather than storing fractions on each edge, we store an integer weight.
	Weights are relative to the other edges of a given predecessor block. The
	branch probability associated with a given edge is its own weight divided by
	the sum of the weights on the predecessor's outgoing edges.

	For example, consider this IR:

	.. code-block:: llvm

	define void @foo() {
	; ...
	A:
	br i1 %cond, label %B, label %C, !prof !0
	; ...
	}
	!0 = metadata !{metadata !"branch_weights", i32 7, i32 8}

	and this simple graph representation::

	A -> B (edge-weight: 7)
	A -> C (edge-weight: 8)

	The probability of branching from block A to block B is 7/15, and the
	probability of branching from block A to block C is 8/15.

	See :doc:`BranchWeightMetadata` for details about the branch weight IR
	representation.

	Block Frequency
	===============

	Block frequency is a relative metric that represents the number of times a
	block executes. The ratio of a block frequency to the entry block frequency is
	the expected number of times the block will execute per entry to the function.

	Block frequency is the main output of the ``BlockFrequencyInfo`` and
	``MachineBlockFrequencyInfo`` analysis passes.

	Implementation: a series of DAGs
	================================

	The implementation of the block frequency calculation analyses each loop,
	bottom-up, ignoring backedges; i.e., as a DAG. After each loop is processed,
	it's packaged up to act as a pseudo-node in its parent loop's (or the
	function's) DAG analysis.

	Block Mass
	==========

	For each DAG, the entry node is assigned a mass of ``UINT64_MAX`` and mass is
	distributed to successors according to branch weights. Block Mass uses a
	fixed-point representation where ``UINT64_MAX`` represents ``1.0`` and ``0``
	represents a number just above ``0.0``.

	After mass is fully distributed, in any cut of the DAG that separates the exit
	nodes from the entry node, the sum of the block masses of the nodes succeeded
	by a cut edge should equal ``UINT64_MAX``. In other words, mass is conserved
	as it "falls" through the DAG.

	If a function's basic block graph is a DAG, then block masses are valid block
	frequencies. This works poorly in practise though, since downstream users rely
	on adding block frequencies together without hitting the maximum.

	Loop Scale
	==========

	Loop scale is a metric that indicates how many times a loop iterates per entry.
	As mass is distributed through the loop's DAG, the (otherwise ignored) backedge
	mass is collected. This backedge mass is used to compute the exit frequency,
	and thus the loop scale.

	Implementation: Getting from mass and scale to frequency
	========================================================

	After analysing the complete series of DAGs, each block has a mass (local to
	its containing loop, if any), and each loop pseudo-node has a loop scale and
	its own mass (from its parent's DAG).

	We can get an initial frequency assignment (with entry frequency of 1.0) by
	multiplying these masses and loop scales together. A given block's frequency
	is the product of its mass, the mass of containing loops' pseudo nodes, and the
	containing loops' loop scales.

	Since downstream users need integers (not floating point), this initial
	frequency assignment is shifted as necessary into the range of ``uint64_t``.

	Block Bias
	==========

	Block bias is a proposed absolute metric to indicate a bias toward or away
	from a given block during a function's execution. The idea is that bias can be
	used in isolation to indicate whether a block is relatively hot or cold, or to
	compare two blocks to indicate whether one is hotter or colder than the other.

	The proposed calculation involves calculating a reference block frequency,
	where:

	* every branch weight is assumed to be 1 (i.e., every branch probability
	distribution is even) and

	* loop scales are ignored.

	This reference frequency represents what the block frequency would be in an
	unbiased graph.

	The bias is the ratio of the block frequency to this reference block frequency.