llvm/docs/GlobalISel/GMIR.rst - llvm-project - Git at Google

 .. _gmir:

 Generic Machine IR
 ==================

 .. contents::
    :local:

 Generic MIR (gMIR) is an intermediate representation that shares the same data
 structures as :doc:`MachineIR (MIR) <../MIRLangRef>` but has more relaxed
 constraints. As the compilation pipeline proceeds, these constraints are
 gradually tightened until gMIR has become MIR.

 The rest of this document will assume that you are familiar with the concepts
 in :doc:`MachineIR (MIR) <../MIRLangRef>` and will highlight the differences
 between MIR and gMIR.

 .. _gmir-instructions:

 Generic Machine Instructions
 ----------------------------

 .. note::

   This section expands on :ref:`mir-instructions` from the MIR Language
   Reference.

 Whereas MIR deals largely in Target Instructions and only has a small set of
 target independent opcodes such as ``COPY``, ``PHI``, and ``REG_SEQUENCE``,
 gMIR defines a rich collection of ``Generic Opcodes`` which are target
 independent and describe operations which are typically supported by targets.
 One example is ``G_ADD`` which is the generic opcode for an integer addition.
 More information on each of the generic opcodes can be found at
 :doc:`GenericOpcode`.

 The ``MachineIRBuilder`` class wraps the ``MachineInstrBuilder`` and provides
 a convenient way to create these generic instructions.

 .. _gmir-gvregs:

 Generic Virtual Registers
 -------------------------

 .. note::

   This section expands on :ref:`mir-registers` from the MIR Language
   Reference.

 Generic virtual registers are like virtual registers but they are not assigned a
 Register Class constraint. Instead, generic virtual registers have less strict
 constraints starting with a :ref:`gmir-llt` and then further constrained to a
 :ref:`gmir-regbank`. Eventually they will be constrained to a register class
 at which point they become normal virtual registers.

 Generic virtual registers can be used with all the virtual register API's
 provided by ``MachineRegisterInfo``. In particular, the def-use chain API's can
 be used without needing to distinguish them from non-generic virtual registers.

 For simplicity, most generic instructions only accept virtual registers (both
 generic and non-generic). There are some exceptions to this but in general:

 * instead of immediates, they use a generic virtual register defined by an
   instruction that materializes the immediate value (see
   :ref:`irtranslator-constants`). Typically this is a G_CONSTANT or a
   G_FCONSTANT. One example of an exception to this rule is G_SEXT_INREG where
   having an immediate is mandatory.
 * instead of physical register, they use a generic virtual register that is
   either defined by a ``COPY`` from the physical register or used by a ``COPY``
   that defines the physical register.

 .. admonition:: Historical Note

   We started with an alternative representation, where MRI tracks a size for
   each generic virtual register, and instructions have lists of types.
   That had two flaws: the type and size are redundant, and there was no generic
   way of getting a given operand's type (as there was no 1:1 mapping between
   instruction types and operands).
   We considered putting the type in some variant of MCInstrDesc instead:
   See `PR26576 <https://llvm.org/PR26576>`_: [GlobalISel] Generic MachineInstrs
   need a type but this increases the memory footprint of the related objects

 .. _gmir-regbank:

 Register Bank
 -------------

 A Register Bank is a set of register classes defined by the target. This
 definition is rather loose so let's talk about what they can achieve.

 Suppose we have a processor that has two register files, A and B. These are
 equal in every way and support the same instructions for the same cost. They're
 just physically stored apart and each instruction can only access registers from
 A or B but never a mix of the two. If we want to perform an operation on data
 that's in split between the two register files, we must first copy all the data
 into a single register file.

 Given a processor like this, we would benefit from clustering related data
 together into one register file so that we minimize the cost of copying data
 back and forth to satisfy the (possibly conflicting) requirements of all the
 instructions. Register Banks are a means to constrain the register allocator to
 use a particular register file for a virtual register.

 In practice, register files A and B are rarely equal. They can typically store
 the same data but there's usually some restrictions on what operations you can
 do on each register file. A fairly common pattern is for one of them to be
 accessible to integer operations and the other accessible to floating point
 operations. To accommodate this, let's rename A and B to GPR (general purpose
 registers) and FPR (floating point registers).

 We now have some additional constraints that limit us. An operation like G_FMUL
 has to happen in FPR and G_ADD has to happen in GPR. However, even though this
 prescribes a lot of the assignments we still have some freedom. A G_LOAD can
 happen in both GPR and FPR, and which we want depends on who is going to consume
 the loaded data. Similarly, G_FNEG can happen in both GPR and FPR. If we assign
 it to FPR, then we'll use floating point negation. However, if we assign it to
 GPR then we can equivalently G_XOR the sign bit with 1 to invert it.

 In summary, Register Banks are a means of disambiguating between seemingly
 equivalent choices based on some analysis of the differences when each choice
 is applied in a given context.

 To give some concrete examples:

 AArch64

   AArch64 has three main banks. GPR for integer operations, FPR for floating
   point and also for the NEON vector instruction set. The third is CCR and
   describes the condition code register used for predication.

 MIPS

   MIPS has five main banks of which many programs only really use one or two.
   GPR is the general purpose bank for integer operations. FGR or CP1 is for
   the floating point operations as well as the MSA vector instructions and a
   few other application specific extensions. CP0 is for system registers and
   few programs will use it. CP2 and CP3 are for any application specific
   coprocessors that may be present in the chip. Arguably, there is also a sixth
   for the LO and HI registers but these are only used for the result of a few
   operations and it's of questionable value to model distinctly from GPR.

 X86

   X86 can be seen as having 3 main banks: general-purpose, x87, and
   vector (which could be further split into a bank per domain for single vs
   double precision instructions). It also looks like there's arguably a few
   more potential banks such as one for the AVX512 Mask Registers.

 Register banks are described by a target-provided API,
 :ref:`RegisterBankInfo <api-registerbankinfo>`.

 .. _gmir-llt:

 Low Level Type
 --------------

 Additionally, every generic virtual register has a type, represented by an
 instance of the ``LLT`` class.

 Like ``EVT``/``MVT``/``Type``, it has no distinction between unsigned and signed
 integer types.  Furthermore, it also has no distinction between integer and
 floating-point types: it mainly conveys absolutely necessary information, such
 as size and number of vector lanes:

 * ``sN`` for scalars
 * ``pN`` for pointers
 * ``<N x sM>`` for vectors

 ``LLT`` is intended to replace the usage of ``EVT`` in SelectionDAG.

 Here are some LLT examples and their ``EVT`` and ``Type`` equivalents:

    =============  =========  ======================================
    LLT            EVT        IR Type
    =============  =========  ======================================
    ``s1``         ``i1``     ``i1``
    ``s8``         ``i8``     ``i8``
    ``s32``        ``i32``    ``i32``
    ``s32``        ``f32``    ``float``
    ``s17``        ``i17``    ``i17``
    ``s16``        N/A        ``{i8, i8}`` [#abi-dependent]_
    ``s32``        N/A        ``[4 x i8]`` [#abi-dependent]_
    ``p0``         ``iPTR``   ``i8*``, ``i32*``, ``%opaque*``
    ``p2``         ``iPTR``   ``i8 addrspace(2)*``
    ``<4 x s32>``  ``v4f32``  ``<4 x float>``
    ``s64``        ``v1f64``  ``<1 x double>``
    ``<3 x s32>``  ``v3i32``  ``<3 x i32>``
    =============  =========  ======================================


 Rationale: instructions already encode a specific interpretation of types
 (e.g., ``add`` vs. ``fadd``, or ``sdiv`` vs. ``udiv``).  Also encoding that
 information in the type system requires introducing bitcast with no real
 advantage for the selector.

 Pointer types are distinguished by address space.  This matches IR, as opposed
 to SelectionDAG where address space is an attribute on operations.
 This representation better supports pointers having different sizes depending
 on their addressspace.

 .. note::

   .. caution::

     Is this still true? I thought we'd removed the 1-element vector concept.
     Hypothetically, it could be distinct from a scalar but I think we failed to
     find a real occurrence.

   Currently, LLT requires at least 2 elements in vectors, but some targets have
   the concept of a '1-element vector'.  Representing them as their underlying
   scalar type is a nice simplification.

 .. rubric:: Footnotes

 .. [#abi-dependent] This mapping is ABI dependent. Here we've assumed no additional padding is required.

 Generic Opcode Reference
 ------------------------

 The Generic Opcodes that are available are described at :doc:`GenericOpcode`.
	.. _gmir:

	Generic Machine IR
	==================

	.. contents::
	:local:

	Generic MIR (gMIR) is an intermediate representation that shares the same data
	structures as :doc:`MachineIR (MIR) <../MIRLangRef>` but has more relaxed
	constraints. As the compilation pipeline proceeds, these constraints are
	gradually tightened until gMIR has become MIR.

	The rest of this document will assume that you are familiar with the concepts
	in :doc:`MachineIR (MIR) <../MIRLangRef>` and will highlight the differences
	between MIR and gMIR.

	.. _gmir-instructions:

	Generic Machine Instructions
	----------------------------

	.. note::

	This section expands on :ref:`mir-instructions` from the MIR Language
	Reference.

	Whereas MIR deals largely in Target Instructions and only has a small set of
	target independent opcodes such as ``COPY``, ``PHI``, and ``REG_SEQUENCE``,
	gMIR defines a rich collection of ``Generic Opcodes`` which are target
	independent and describe operations which are typically supported by targets.
	One example is ``G_ADD`` which is the generic opcode for an integer addition.
	More information on each of the generic opcodes can be found at
	:doc:`GenericOpcode`.

	The ``MachineIRBuilder`` class wraps the ``MachineInstrBuilder`` and provides
	a convenient way to create these generic instructions.

	.. _gmir-gvregs:

	Generic Virtual Registers
	-------------------------

	.. note::

	This section expands on :ref:`mir-registers` from the MIR Language
	Reference.

	Generic virtual registers are like virtual registers but they are not assigned a
	Register Class constraint. Instead, generic virtual registers have less strict
	constraints starting with a :ref:`gmir-llt` and then further constrained to a
	:ref:`gmir-regbank`. Eventually they will be constrained to a register class
	at which point they become normal virtual registers.

	Generic virtual registers can be used with all the virtual register API's
	provided by ``MachineRegisterInfo``. In particular, the def-use chain API's can
	be used without needing to distinguish them from non-generic virtual registers.

	For simplicity, most generic instructions only accept virtual registers (both
	generic and non-generic). There are some exceptions to this but in general:

	* instead of immediates, they use a generic virtual register defined by an
	instruction that materializes the immediate value (see
	:ref:`irtranslator-constants`). Typically this is a G_CONSTANT or a
	G_FCONSTANT. One example of an exception to this rule is G_SEXT_INREG where
	having an immediate is mandatory.
	* instead of physical register, they use a generic virtual register that is
	either defined by a ``COPY`` from the physical register or used by a ``COPY``
	that defines the physical register.

	.. admonition:: Historical Note

	We started with an alternative representation, where MRI tracks a size for
	each generic virtual register, and instructions have lists of types.
	That had two flaws: the type and size are redundant, and there was no generic
	way of getting a given operand's type (as there was no 1:1 mapping between
	instruction types and operands).
	We considered putting the type in some variant of MCInstrDesc instead:
	See `PR26576 <https://llvm.org/PR26576>`_: [GlobalISel] Generic MachineInstrs
	need a type but this increases the memory footprint of the related objects

	.. _gmir-regbank:

	Register Bank
	-------------

	A Register Bank is a set of register classes defined by the target. This
	definition is rather loose so let's talk about what they can achieve.

	Suppose we have a processor that has two register files, A and B. These are
	equal in every way and support the same instructions for the same cost. They're
	just physically stored apart and each instruction can only access registers from
	A or B but never a mix of the two. If we want to perform an operation on data
	that's in split between the two register files, we must first copy all the data
	into a single register file.

	Given a processor like this, we would benefit from clustering related data
	together into one register file so that we minimize the cost of copying data
	back and forth to satisfy the (possibly conflicting) requirements of all the
	instructions. Register Banks are a means to constrain the register allocator to
	use a particular register file for a virtual register.

	In practice, register files A and B are rarely equal. They can typically store
	the same data but there's usually some restrictions on what operations you can
	do on each register file. A fairly common pattern is for one of them to be
	accessible to integer operations and the other accessible to floating point
	operations. To accommodate this, let's rename A and B to GPR (general purpose
	registers) and FPR (floating point registers).

	We now have some additional constraints that limit us. An operation like G_FMUL
	has to happen in FPR and G_ADD has to happen in GPR. However, even though this
	prescribes a lot of the assignments we still have some freedom. A G_LOAD can
	happen in both GPR and FPR, and which we want depends on who is going to consume
	the loaded data. Similarly, G_FNEG can happen in both GPR and FPR. If we assign
	it to FPR, then we'll use floating point negation. However, if we assign it to
	GPR then we can equivalently G_XOR the sign bit with 1 to invert it.

	In summary, Register Banks are a means of disambiguating between seemingly
	equivalent choices based on some analysis of the differences when each choice
	is applied in a given context.

	To give some concrete examples:

	AArch64

	AArch64 has three main banks. GPR for integer operations, FPR for floating
	point and also for the NEON vector instruction set. The third is CCR and
	describes the condition code register used for predication.

	MIPS

	MIPS has five main banks of which many programs only really use one or two.
	GPR is the general purpose bank for integer operations. FGR or CP1 is for
	the floating point operations as well as the MSA vector instructions and a
	few other application specific extensions. CP0 is for system registers and
	few programs will use it. CP2 and CP3 are for any application specific
	coprocessors that may be present in the chip. Arguably, there is also a sixth
	for the LO and HI registers but these are only used for the result of a few
	operations and it's of questionable value to model distinctly from GPR.

	X86

	X86 can be seen as having 3 main banks: general-purpose, x87, and
	vector (which could be further split into a bank per domain for single vs
	double precision instructions). It also looks like there's arguably a few
	more potential banks such as one for the AVX512 Mask Registers.

	Register banks are described by a target-provided API,
	:ref:`RegisterBankInfo <api-registerbankinfo>`.

	.. _gmir-llt:

	Low Level Type
	--------------

	Additionally, every generic virtual register has a type, represented by an
	instance of the ``LLT`` class.

	Like ``EVT``/``MVT``/``Type``, it has no distinction between unsigned and signed
	integer types. Furthermore, it also has no distinction between integer and
	floating-point types: it mainly conveys absolutely necessary information, such
	as size and number of vector lanes:

	* ``sN`` for scalars
	* ``pN`` for pointers
	* ``<N x sM>`` for vectors

	``LLT`` is intended to replace the usage of ``EVT`` in SelectionDAG.

	Here are some LLT examples and their ``EVT`` and ``Type`` equivalents:

	============= ========= ======================================
	LLT EVT IR Type
	============= ========= ======================================
	``s1`` ``i1`` ``i1``
	``s8`` ``i8`` ``i8``
	``s32`` ``i32`` ``i32``
	``s32`` ``f32`` ``float``
	``s17`` ``i17`` ``i17``
	``s16`` N/A ``{i8, i8}`` [#abi-dependent]_
	``s32`` N/A ``[4 x i8]`` [#abi-dependent]_
	``p0`` ``iPTR`` ``i8``, ``i32``, ``%opaque*``
	``p2`` ``iPTR`` ``i8 addrspace(2)*``
	``<4 x s32>`` ``v4f32`` ``<4 x float>``
	``s64`` ``v1f64`` ``<1 x double>``
	``<3 x s32>`` ``v3i32`` ``<3 x i32>``
	============= ========= ======================================


	Rationale: instructions already encode a specific interpretation of types
	(e.g., ``add`` vs. ``fadd``, or ``sdiv`` vs. ``udiv``). Also encoding that
	information in the type system requires introducing bitcast with no real
	advantage for the selector.

	Pointer types are distinguished by address space. This matches IR, as opposed
	to SelectionDAG where address space is an attribute on operations.
	This representation better supports pointers having different sizes depending
	on their addressspace.

	.. note::

	.. caution::

	Is this still true? I thought we'd removed the 1-element vector concept.
	Hypothetically, it could be distinct from a scalar but I think we failed to
	find a real occurrence.

	Currently, LLT requires at least 2 elements in vectors, but some targets have
	the concept of a '1-element vector'. Representing them as their underlying
	scalar type is a nice simplification.

	.. rubric:: Footnotes

	.. [#abi-dependent] This mapping is ABI dependent. Here we've assumed no additional padding is required.

	Generic Opcode Reference
	------------------------

	The Generic Opcodes that are available are described at :doc:`GenericOpcode`.