|  | =================== | 
|  | Variable Names Plan | 
|  | =================== | 
|  |  | 
|  | .. contents:: | 
|  | :local: | 
|  |  | 
|  | This plan is *provisional*. It is not agreed upon. It is written with the | 
|  | intention of capturing the desires and concerns of the LLVM community, and | 
|  | forming them into a plan that can be agreed upon. | 
|  | The original author is somewhat naïve in the ways of LLVM so there will | 
|  | inevitably be some details that are flawed. You can help - you can edit this | 
|  | page (preferably with a Phabricator review for larger changes) or reply to the | 
|  | `Request For Comments thread | 
|  | <http://lists.llvm.org/pipermail/llvm-dev/2019-February/130083.html>`_. | 
|  |  | 
|  | Too Long; Didn't Read | 
|  | ===================== | 
|  |  | 
|  | Improve the readability of LLVM code. | 
|  |  | 
|  | Introduction | 
|  | ============ | 
|  |  | 
|  | The current `variable naming rule | 
|  | <../CodingStandards.html#name-types-functions-variables-and-enumerators-properly>`_ | 
|  | states: | 
|  |  | 
|  | Variable names should be nouns (as they represent state). The name should be | 
|  | camel case, and start with an upper case letter (e.g. Leader or Boats). | 
|  |  | 
|  | This rule is the same as that for type names. This is a problem because the | 
|  | type name cannot be reused for a variable name [*]_. LLVM developers tend to | 
|  | work around this by either prepending ``The`` to the type name:: | 
|  |  | 
|  | Triple TheTriple; | 
|  |  | 
|  | ... or more commonly use an acronym, despite the coding standard stating "Avoid | 
|  | abbreviations unless they are well known":: | 
|  |  | 
|  | Triple T; | 
|  |  | 
|  | The proliferation of acronyms leads to hard-to-read code such as `this | 
|  | <https://github.com/llvm/llvm-project/blob/0a8bc14ad7f3209fe702d18e250194cd90188596/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp#L7445>`_:: | 
|  |  | 
|  | InnerLoopVectorizer LB(L, PSE, LI, DT, TLI, TTI, AC, ORE, VF.Width, IC, | 
|  | &LVL, &CM); | 
|  |  | 
|  | Many other coding guidelines [LLDB]_ [Google]_ [WebKit]_ [Qt]_ [Rust]_ [Swift]_ | 
|  | [Python]_ require that variable names begin with a lower case letter in contrast | 
|  | to class names which begin with a capital letter. This convention means that the | 
|  | most readable variable name also requires the least thought:: | 
|  |  | 
|  | Triple triple; | 
|  |  | 
|  | There is some agreement that the current rule is broken [LattnerAgree]_ | 
|  | [ArsenaultAgree]_ [RobinsonAgree]_ and that acronyms are an obstacle to reading | 
|  | new code [MalyutinDistinguish]_ [CarruthAcronym]_ [PicusAcronym]_. There are | 
|  | some opposing views [ParzyszekAcronym2]_ [RicciAcronyms]_. | 
|  |  | 
|  | This work-in-progress proposal is to change the coding standard for variable | 
|  | names to require that they start with a lower case letter. | 
|  |  | 
|  | .. [*] In `some cases | 
|  | <https://github.com/llvm/llvm-project/blob/8b72080d4d7b13072f371712eed333f987b7a18e/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp#L2727>`_ | 
|  | the type name *is* reused as a variable name, but this shadows the type name | 
|  | and confuses many debuggers [DenisovCamelBack]_. | 
|  |  | 
|  | Variable Names Coding Standard Options | 
|  | ====================================== | 
|  |  | 
|  | There are two main options for variable names that begin with a lower case | 
|  | letter: ``camelBack`` and ``lower_case``. (These are also known by other names | 
|  | but here we use the terminology from clang-tidy). | 
|  |  | 
|  | ``camelBack`` is consistent with [WebKit]_, [Qt]_ and [Swift]_ while | 
|  | ``lower_case`` is consistent with [LLDB]_, [Google]_, [Rust]_ and [Python]_. | 
|  |  | 
|  | ``camelBack`` is already used for function names, which may be considered an | 
|  | advantage [LattnerFunction]_ or a disadvantage [CarruthFunction]_. | 
|  |  | 
|  | Approval for ``camelBack`` was expressed by [DenisovCamelBack]_ | 
|  | [LattnerFunction]_ [IvanovicDistinguish]_. | 
|  | Opposition to ``camelBack`` was expressed by [CarruthCamelBack]_ | 
|  | [TurnerCamelBack]_. | 
|  | Approval for ``lower_case`` was expressed by [CarruthLower]_ | 
|  | [CarruthCamelBack]_ [TurnerLLDB]_. | 
|  | Opposition to ``lower_case`` was expressed by [LattnerLower]_. | 
|  |  | 
|  | Differentiating variable kinds | 
|  | ------------------------------ | 
|  |  | 
|  | An additional requested change is to distinguish between different kinds of | 
|  | variables [RobinsonDistinguish]_ [RobinsonDistinguish2]_ [JonesDistinguish]_ | 
|  | [IvanovicDistinguish]_ [CarruthDistinguish]_ [MalyutinDistinguish]_. | 
|  |  | 
|  | Others oppose this idea [HähnleDistinguish]_ [GreeneDistinguish]_ | 
|  | [HendersonPrefix]_. | 
|  |  | 
|  | A possibility is for member variables to be prefixed with ``m_`` and for global | 
|  | variables to be prefixed with ``g_`` to distinguish them from local variables. | 
|  | This is consistent with [LLDB]_. The ``m_`` prefix is consistent with [WebKit]_. | 
|  |  | 
|  | A variation is for member variables to be prefixed with ``m`` | 
|  | [IvanovicDistinguish]_ [BeylsDistinguish]_. This is consistent with [Mozilla]_. | 
|  |  | 
|  | Another option is for member variables to be suffixed with ``_`` which is | 
|  | consistent with [Google]_ and similar to [Python]_. Opposed by | 
|  | [ParzyszekDistinguish]_. | 
|  |  | 
|  | Reducing the number of acronyms | 
|  | =============================== | 
|  |  | 
|  | While switching coding standard will make it easier to use non-acronym names for | 
|  | new code, it doesn't improve the existing large body of code that uses acronyms | 
|  | extensively to the detriment of its readability. Further, it is natural and | 
|  | generally encouraged that new code be written in the style of the surrounding | 
|  | code. Therefore it is likely that much newly written code will also use | 
|  | acronyms despite what the coding standard says, much as it is today. | 
|  |  | 
|  | As well as changing the case of variable names, they could also be expanded to | 
|  | their non-acronym form e.g. ``Triple T`` → ``Triple triple``. | 
|  |  | 
|  | There is support for expanding many acronyms [CarruthAcronym]_ [PicusAcronym]_ | 
|  | but there is a preference that expanding acronyms be deferred | 
|  | [ParzyszekAcronym]_ [CarruthAcronym]_. | 
|  |  | 
|  | The consensus within the community seems to be that at least some acronyms are | 
|  | valuable [ParzyszekAcronym]_ [LattnerAcronym]_. The most commonly cited acronym | 
|  | is ``TLI`` however that is used to refer to both ``TargetLowering`` and | 
|  | ``TargetLibraryInfo`` [GreeneDistinguish]_. | 
|  |  | 
|  | The following is a list of acronyms considered sufficiently useful that the | 
|  | benefit of using them outweighs the cost of learning them. Acronyms that are | 
|  | either not on the list or are used to refer to a different type should be | 
|  | expanded. | 
|  |  | 
|  | ============================ ============= | 
|  | Class name                   Variable name | 
|  | ============================ ============= | 
|  | DeterministicFiniteAutomaton dfa | 
|  | DominatorTree                dt | 
|  | LoopInfo                     li | 
|  | MachineFunction              mf | 
|  | MachineInstr                 mi | 
|  | MachineRegisterInfo          mri | 
|  | ScalarEvolution              se | 
|  | TargetInstrInfo              tii | 
|  | TargetLibraryInfo            tli | 
|  | TargetRegisterInfo           tri | 
|  | ============================ ============= | 
|  |  | 
|  | In some cases renaming acronyms to the full type name will result in overly | 
|  | verbose code. Unlike most classes, a variable's scope is limited and therefore | 
|  | some of its purpose can implied from that scope, meaning that fewer words are | 
|  | necessary to give it a clear name. For example, in an optimization pass the reader | 
|  | can assume that a variable's purpose relates to optimization and therefore an | 
|  | ``OptimizationRemarkEmitter`` variable could be given the name ``remarkEmitter`` | 
|  | or even ``remarker``. | 
|  |  | 
|  | The following is a list of longer class names and the associated shorter | 
|  | variable name. | 
|  |  | 
|  | ========================= ============= | 
|  | Class name                Variable name | 
|  | ========================= ============= | 
|  | BasicBlock                block | 
|  | ConstantExpr              expr | 
|  | ExecutionEngine           engine | 
|  | MachineOperand            operand | 
|  | OptimizationRemarkEmitter remarker | 
|  | PreservedAnalyses         analyses | 
|  | PreservedAnalysesChecker  checker | 
|  | TargetLowering            lowering | 
|  | TargetMachine             machine | 
|  | ========================= ============= | 
|  |  | 
|  | Transition Options | 
|  | ================== | 
|  |  | 
|  | There are three main options for transitioning: | 
|  |  | 
|  | 1. Keep the current coding standard | 
|  | 2. Laissez faire | 
|  | 3. Big bang | 
|  |  | 
|  | Keep the current coding standard | 
|  | -------------------------------- | 
|  |  | 
|  | Proponents of keeping the current coding standard (i.e. not transitioning at | 
|  | all) question whether the cost of transition outweighs the benefit | 
|  | [EmersonConcern]_ [ReamesConcern]_ [BradburyConcern]_. | 
|  | The costs are that ``git blame`` will become less usable; and that merging the | 
|  | changes will be costly for downstream maintainers. See `Big bang`_ for potential | 
|  | mitigations. | 
|  |  | 
|  | Laissez faire | 
|  | ------------- | 
|  |  | 
|  | The coding standard could allow both ``CamelCase`` and ``camelBack`` styles for | 
|  | variable names [LattnerTransition]_. | 
|  |  | 
|  | A code review to implement this is at https://reviews.llvm.org/D57896. | 
|  |  | 
|  | Advantages | 
|  | ********** | 
|  |  | 
|  | * Very easy to implement initially. | 
|  |  | 
|  | Disadvantages | 
|  | ************* | 
|  |  | 
|  | * Leads to inconsistency [BradburyConcern]_ [AminiInconsistent]_. | 
|  | * Inconsistency means it will be hard to know at a guess what name a variable | 
|  | will have [DasInconsistent]_ [CarruthInconsistent]_. | 
|  | * Some large-scale renaming may happen anyway, leading to its disadvantages | 
|  | without any mitigations. | 
|  |  | 
|  | Big bang | 
|  | -------- | 
|  |  | 
|  | With this approach, variables will be renamed by an automated script in a series | 
|  | of large commits. | 
|  |  | 
|  | The principle advantage of this approach is that it minimises the cost of | 
|  | inconsistency [BradburyTransition]_ [RobinsonTransition]_. | 
|  |  | 
|  | It goes against a policy of avoiding large-scale reformatting of existing code | 
|  | [GreeneDistinguish]_. | 
|  |  | 
|  | It has been suggested that LLD would be a good starter project for the renaming | 
|  | [Ueyama]_. | 
|  |  | 
|  | Keeping git blame usable | 
|  | ************************ | 
|  |  | 
|  | ``git blame`` (or ``git annotate``) permits quickly identifying the commit that | 
|  | changed a given line in a file. After renaming variables, many lines will show | 
|  | as being changed by that one commit, requiring a further invocation of ``git | 
|  | blame`` to identify prior, more interesting commits [GreeneGitBlame]_ | 
|  | [RicciAcronyms]_. | 
|  |  | 
|  | **Mitigation**: `git-hyper-blame | 
|  | <https://commondatastorage.googleapis.com/chrome-infra-docs/flat/depot_tools/docs/html/git-hyper-blame.html>`_ | 
|  | can ignore or "look through" a given set of commits. | 
|  | A ``.git-blame-ignore-revs`` file identifying the variable renaming commits | 
|  | could be added to the LLVM git repository root directory. | 
|  | It is being `investigated | 
|  | <https://public-inbox.org/git/20190324235020.49706-1-michael@platin.gs/>`_ | 
|  | whether similar functionality could be added to ``git blame`` itself. | 
|  |  | 
|  | Minimising cost of downstream merges | 
|  | ************************************ | 
|  |  | 
|  | There are many forks of LLVM with downstream changes. Merging a large-scale | 
|  | renaming change could be difficult for the fork maintainers. | 
|  |  | 
|  | **Mitigation**: A large-scale renaming would be automated. A fork maintainer can | 
|  | merge from the commit immediately before the renaming, then apply the renaming | 
|  | script to their own branch. They can then merge again from the renaming commit, | 
|  | resolving all conflicts by choosing their own version. This could be tested on | 
|  | the [SVE]_ fork. | 
|  |  | 
|  | Provisional Plan | 
|  | ================ | 
|  |  | 
|  | This is a provisional plan for the `Big bang`_ approach. It has not been agreed. | 
|  |  | 
|  | #. Investigate improving ``git blame``. The extent to which it can be made to | 
|  | "look through" commits may impact how big a change can be made. | 
|  |  | 
|  | #. Write a script to expand acronyms. | 
|  |  | 
|  | #. Experiment and perform dry runs of the various refactoring options. | 
|  | Results can be published in forks of the LLVM Git repository. | 
|  |  | 
|  | #. Consider the evidence and agree on the new policy. | 
|  |  | 
|  | #. Agree & announce a date for the renaming of the starter project (LLD). | 
|  |  | 
|  | #. Update the `policy page <../CodingStandards.html>`_. This will explain the | 
|  | old and new rules and which projects each applies to. | 
|  |  | 
|  | #. Refactor the starter project in two commits: | 
|  |  | 
|  | 1. Add or change the project's .clang-tidy to reflect the agreed rules. | 
|  | (This is in a separate commit to enable the merging process described in | 
|  | `Minimising cost of downstream merges`_). | 
|  | Also update the project list on the policy page. | 
|  | 2. Apply ``clang-tidy`` to the project's files, with only the | 
|  | ``readability-identifier-naming`` rules enabled. ``clang-tidy`` will also | 
|  | reformat the affected lines according to the rules in ``.clang-format``. | 
|  | It is anticipated that this will be a good dog-fooding opportunity for | 
|  | clang-tidy, and bugs should be fixed in the process, likely including: | 
|  |  | 
|  | * `readability-identifier-naming incorrectly fixes lambda capture | 
|  | <https://bugs.llvm.org/show_bug.cgi?id=41119>`_. | 
|  | * `readability-identifier-naming incorrectly fixes variables which | 
|  | become keywords <https://bugs.llvm.org/show_bug.cgi?id=41120>`_. | 
|  | * `readability-identifier-naming misses fixing member variables in | 
|  | destructor <https://bugs.llvm.org/show_bug.cgi?id=41122>`_. | 
|  |  | 
|  | #. Gather feedback and refine the process as appropriate. | 
|  |  | 
|  | #. Apply the process to the following projects, with a suitable delay between | 
|  | each (at least 4 weeks after the first change, at least 2 weeks subsequently) | 
|  | to allow gathering further feedback. | 
|  | This list should exclude projects that must adhere to an externally defined | 
|  | standard e.g. libcxx. | 
|  | The list is roughly in chronological order of renaming. | 
|  | Some items may not make sense to rename individually - it is expected that | 
|  | this list will change following experimentation: | 
|  |  | 
|  | * TableGen | 
|  | * llvm/tools | 
|  | * clang-tools-extra | 
|  | * clang | 
|  | * ARM backend | 
|  | * AArch64 backend | 
|  | * AMDGPU backend | 
|  | * ARC backend | 
|  | * AVR backend | 
|  | * BPF backend | 
|  | * Hexagon backend | 
|  | * Lanai backend | 
|  | * MIPS backend | 
|  | * NVPTX backend | 
|  | * PowerPC backend | 
|  | * RISC-V backend | 
|  | * Sparc backend | 
|  | * SystemZ backend | 
|  | * WebAssembly backend | 
|  | * X86 backend | 
|  | * XCore backend | 
|  | * libLTO | 
|  | * Debug Information | 
|  | * Remainder of llvm | 
|  | * compiler-rt | 
|  | * libunwind | 
|  | * openmp | 
|  | * parallel-libs | 
|  | * polly | 
|  | * lldb | 
|  |  | 
|  | #. Remove the old variable name rule from the policy page. | 
|  |  | 
|  | #. Repeat many of the steps in the sequence, using a script to expand acronyms. | 
|  |  | 
|  | References | 
|  | ========== | 
|  |  | 
|  | .. [LLDB] LLDB Coding Conventions https://llvm.org/svn/llvm-project/lldb/branches/release_39/www/lldb-coding-conventions.html | 
|  | .. [Google] Google C++ Style Guide https://google.github.io/styleguide/cppguide.html#Variable_Names | 
|  | .. [WebKit] WebKit Code Style Guidelines https://webkit.org/code-style-guidelines/#names | 
|  | .. [Qt] Qt Coding Style https://wiki.qt.io/Qt_Coding_Style#Declaring_variables | 
|  | .. [Rust] Rust naming conventions https://doc.rust-lang.org/1.0.0/style/style/naming/README.html | 
|  | .. [Swift] Swift API Design Guidelines https://swift.org/documentation/api-design-guidelines/#general-conventions | 
|  | .. [Python] Style Guide for Python Code https://www.python.org/dev/peps/pep-0008/#function-and-variable-names | 
|  | .. [Mozilla] Mozilla Coding style: Prefixes https://firefox-source-docs.mozilla.org/tools/lint/coding-style/coding_style_cpp.html#prefixes | 
|  | .. [SVE] LLVM with support for SVE https://github.com/ARM-software/LLVM-SVE | 
|  | .. [AminiInconsistent] Mehdi Amini, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130329.html | 
|  | .. [ArsenaultAgree] Matt Arsenault, http://lists.llvm.org/pipermail/llvm-dev/2019-February/129934.html | 
|  | .. [BeylsDistinguish] Kristof Beyls, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130292.html | 
|  | .. [BradburyConcern] Alex Bradbury, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130266.html | 
|  | .. [BradburyTransition] Alex Bradbury, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130388.html | 
|  | .. [CarruthAcronym] Chandler Carruth, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130313.html | 
|  | .. [CarruthCamelBack] Chandler Carruth, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130214.html | 
|  | .. [CarruthDistinguish] Chandler Carruth, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130310.html | 
|  | .. [CarruthFunction] Chandler Carruth, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130309.html | 
|  | .. [CarruthInconsistent] Chandler Carruth, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130312.html | 
|  | .. [CarruthLower] Chandler Carruth, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130430.html | 
|  | .. [DasInconsistent] Sanjoy Das, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130304.html | 
|  | .. [DenisovCamelBack] Alex Denisov, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130179.html | 
|  | .. [EmersonConcern] Amara Emerson, http://lists.llvm.org/pipermail/llvm-dev/2019-February/129894.html | 
|  | .. [GreeneDistinguish] David Greene, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130425.html | 
|  | .. [GreeneGitBlame] David Greene, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130228.html | 
|  | .. [HendersonPrefix] James Henderson, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130465.html | 
|  | .. [HähnleDistinguish] Nicolai Hähnle, http://lists.llvm.org/pipermail/llvm-dev/2019-February/129923.html | 
|  | .. [IvanovicDistinguish] Nemanja Ivanovic, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130249.html | 
|  | .. [JonesDistinguish] JD Jones, http://lists.llvm.org/pipermail/llvm-dev/2019-February/129926.html | 
|  | .. [LattnerAcronym] Chris Lattner, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130353.html | 
|  | .. [LattnerAgree] Chris Latter, http://lists.llvm.org/pipermail/llvm-dev/2019-February/129907.html | 
|  | .. [LattnerFunction] Chris Lattner, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130630.html | 
|  | .. [LattnerLower] Chris Lattner, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130629.html | 
|  | .. [LattnerTransition] Chris Lattner, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130355.html | 
|  | .. [MalyutinDistinguish] Danila Malyutin, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130320.html | 
|  | .. [ParzyszekAcronym] Krzysztof Parzyszek, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130306.html | 
|  | .. [ParzyszekAcronym2] Krzysztof Parzyszek, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130323.html | 
|  | .. [ParzyszekDistinguish] Krzysztof Parzyszek, http://lists.llvm.org/pipermail/llvm-dev/2019-February/129941.html | 
|  | .. [PicusAcronym] Diana Picus, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130318.html | 
|  | .. [ReamesConcern] Philip Reames, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130181.html | 
|  | .. [RicciAcronyms] Bruno Ricci, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130328.html | 
|  | .. [RobinsonAgree] Paul Robinson, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130111.html | 
|  | .. [RobinsonDistinguish] Paul Robinson, http://lists.llvm.org/pipermail/llvm-dev/2019-February/129920.html | 
|  | .. [RobinsonDistinguish2] Paul Robinson, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130229.html | 
|  | .. [RobinsonTransition] Paul Robinson, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130415.html | 
|  | .. [TurnerCamelBack] Zachary Turner, https://reviews.llvm.org/D57896#1402264 | 
|  | .. [TurnerLLDB] Zachary Turner, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130213.html | 
|  | .. [Ueyama] Rui Ueyama, http://lists.llvm.org/pipermail/llvm-dev/2019-February/130435.html |