|  | <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" | 
|  | "http://www.w3.org/TR/html4/strict.dtd"> | 
|  | <html> | 
|  | <head> | 
|  | <meta http-equiv="content-type" content="text/html; charset=iso-8859-1"> | 
|  | <title>Clang - Expressive Diagnostics</title> | 
|  | <link type="text/css" rel="stylesheet" href="menu.css"> | 
|  | <link type="text/css" rel="stylesheet" href="content.css"> | 
|  | <style type="text/css"> | 
|  | .loc { font-weight: bold; } | 
|  | .err { color:red; font-weight: bold; } | 
|  | .warn { color:magenta; font-weight: bold; } | 
|  | .note { color:gray; font-weight: bold; } | 
|  | .msg { font-weight: bold; } | 
|  | .cmd { font-style: italic; } | 
|  | .snip { } | 
|  | .point { color:green; font-weight: bold; } | 
|  | </style> | 
|  | </head> | 
|  | <body> | 
|  |  | 
|  | <!--#include virtual="menu.html.incl"--> | 
|  |  | 
|  | <div id="content"> | 
|  |  | 
|  |  | 
|  | <!--=======================================================================--> | 
|  | <h1>Expressive Diagnostics</h1> | 
|  | <!--=======================================================================--> | 
|  |  | 
|  | <p>In addition to being fast and functional, we aim to make Clang extremely user | 
|  | friendly.  As far as a command-line compiler goes, this basically boils down to | 
|  | making the diagnostics (error and warning messages) generated by the compiler | 
|  | be as useful as possible.  There are several ways that we do this.  This section | 
|  | talks about the experience provided by the command line compiler, contrasting | 
|  | Clang output to GCC 4.9's output in some cases. | 
|  | </p> | 
|  |  | 
|  | <h2>Column Numbers and Caret Diagnostics</h2> | 
|  |  | 
|  | <p>First, all diagnostics produced by clang include full column number | 
|  | information. The clang command-line compiler driver uses this information | 
|  | to print "point diagnostics". | 
|  | (IDEs can use the information to display in-line error markup.) | 
|  | This is nice because it makes it very easy to understand exactly | 
|  | what is wrong in a particular piece of code.</p> | 
|  |  | 
|  | <p>The point (the green "^" character) exactly shows where the problem is, even | 
|  | inside of a string.  This makes it really easy to jump to the problem and | 
|  | helps when multiple instances of the same character occur on a line. (We'll | 
|  | revisit this more in following examples.)</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang -fsyntax-only format-strings.c</span> | 
|  | <span class="loc">format-strings.c:91:13:</span> <span class="warn">warning:</span> <span class="msg">'.*' specified field precision is missing a matching 'int' argument</span> | 
|  | <span class="snip" >  printf("%.*d");</span> | 
|  | <span class="point">            ^</span> | 
|  | </pre> | 
|  |  | 
|  | <p>Note that modern versions of GCC have followed Clang's lead, and are | 
|  | now able to give a column for a diagnostic, and include a snippet of source | 
|  | text in the result. However, Clang's column number is much more accurate, | 
|  | pointing at the problematic format specifier, rather than the <tt>)</tt> | 
|  | character the parser had reached when the problem was detected. | 
|  | Also, Clang's diagnostic is colored by default, making it easier to | 
|  | distinguish from nearby text.</p> | 
|  |  | 
|  | <h2>Range Highlighting for Related Text</h2> | 
|  |  | 
|  | <p>Clang captures and accurately tracks range information for expressions, | 
|  | statements, and other constructs in your program and uses this to make | 
|  | diagnostics highlight related information.  In the following somewhat | 
|  | nonsensical example you can see that you don't even need to see the original source code to | 
|  | understand what is wrong based on the Clang error. Because clang prints a | 
|  | point, you know exactly <em>which</em> plus it is complaining about.  The range | 
|  | information highlights the left and right side of the plus which makes it | 
|  | immediately obvious what the compiler is talking about. | 
|  | Range information is very useful for | 
|  | cases involving precedence issues and many other cases.</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">gcc-4.9 -fsyntax-only t.c</span> | 
|  | t.c: In function 'int f(int, int)': | 
|  | t.c:7:39: error: invalid operands to binary + (have 'int' and 'struct A') | 
|  | return y + func(y ? ((SomeA.X + 40) + SomeA) / 42 + SomeA.X : SomeA.X); | 
|  | ^ | 
|  | $ <span class="cmd">clang -fsyntax-only t.c</span> | 
|  | <span class="loc">t.c:7:39:</span> <span class="err">error:</span> <span class="msg">invalid operands to binary expression ('int' and 'struct A')</span> | 
|  | <span class="snip" >  return y + func(y ? ((SomeA.X + 40) + SomeA) / 42 + SomeA.X : SomeA.X);</span> | 
|  | <span class="point">                       ~~~~~~~~~~~~~~ ^ ~~~~~</span> | 
|  | </pre> | 
|  |  | 
|  | <h2>Precision in Wording</h2> | 
|  |  | 
|  | <p>A detail is that we have tried really hard to make the diagnostics that come | 
|  | out of clang contain exactly the pertinent information about what is wrong and | 
|  | why.  In the example above, we tell you what the inferred types are for | 
|  | the left and right hand sides, and we don't repeat what is obvious from the | 
|  | point (e.g., that this is a "binary +").</p> | 
|  |  | 
|  | <p>Many other examples abound. In the following example, not only do we tell you | 
|  | that there is a problem with the <tt>*</tt> | 
|  | and point to it, we say exactly why and tell you what the type is (in case it is | 
|  | a complicated subexpression, such as a call to an overloaded function).  This | 
|  | sort of attention to detail makes it much easier to understand and fix problems | 
|  | quickly.</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">gcc-4.9 -fsyntax-only t.c</span> | 
|  | t.c:5:11: error: invalid type argument of unary '*' (have 'int') | 
|  | return *SomeA.X; | 
|  | ^ | 
|  | $ <span class="cmd">clang -fsyntax-only t.c</span> | 
|  | <span class="loc">t.c:5:11:</span> <span class="err">error:</span> <span class="msg">indirection requires pointer operand ('int' invalid)</span> | 
|  | <span class="snip" >  int y = *SomeA.X;</span> | 
|  | <span class="point">          ^~~~~~~~</span> | 
|  | </pre> | 
|  |  | 
|  | <h2>Typedef Preservation and Selective Unwrapping</h2> | 
|  |  | 
|  | <p>Many programmers use high-level user defined types, typedefs, and other | 
|  | syntactic sugar to refer to types in their program.  This is useful because they | 
|  | can abbreviate otherwise very long types and it is useful to preserve the | 
|  | typename in diagnostics.  However, sometimes very simple typedefs can wrap | 
|  | trivial types and it is important to strip off the typedef to understand what | 
|  | is going on.  Clang aims to handle both cases well.<p> | 
|  |  | 
|  | <p>The following example shows where it is important to preserve | 
|  | a typedef in C.</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang -fsyntax-only t.c</span> | 
|  | <span class="loc">t.c:15:11:</span> <span class="err">error:</span> <span class="msg">can't convert between vector values of different size ('__m128' and 'int const *')</span> | 
|  | <span class="snip">  myvec[1]/P;</span> | 
|  | <span class="point">  ~~~~~~~~^~</span> | 
|  | </pre> | 
|  |  | 
|  | <p>The following example shows where it is useful for the compiler to expose | 
|  | underlying details of a typedef. If the user was somehow confused about how the | 
|  | system "pid_t" typedef is defined, Clang helpfully displays it with "aka".</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang -fsyntax-only t.c</span> | 
|  | <span class="loc">t.c:13:9:</span> <span class="err">error:</span> <span class="msg">member reference base type 'pid_t' (aka 'int') is not a structure or union</span> | 
|  | <span class="snip">  myvar = myvar.x;</span> | 
|  | <span class="point">          ~~~~~ ^</span> | 
|  | </pre> | 
|  |  | 
|  | <p>In C++, type preservation includes retaining any qualification written into type names. For example, if we take a small snippet of code such as: | 
|  |  | 
|  | <blockquote> | 
|  | <pre> | 
|  | namespace services { | 
|  | struct WebService {  }; | 
|  | } | 
|  | namespace myapp { | 
|  | namespace servers { | 
|  | struct Server {  }; | 
|  | } | 
|  | } | 
|  |  | 
|  | using namespace myapp; | 
|  | void addHTTPService(servers::Server const &server, ::services::WebService const *http) { | 
|  | server += http; | 
|  | } | 
|  | </pre> | 
|  | </blockquote> | 
|  |  | 
|  | <p>and then compile it, we see that Clang is both providing accurate information and is retaining the types as written by the user (e.g., "servers::Server", "::services::WebService"): | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang -fsyntax-only t.cpp</span> | 
|  | <span class="loc">t.cpp:9:10:</span> <span class="err">error:</span> <span class="msg">invalid operands to binary expression ('servers::Server const' and '::services::WebService const *')</span> | 
|  | <span class="snip">server += http;</span> | 
|  | <span class="point">~~~~~~ ^  ~~~~</span> | 
|  | </pre> | 
|  |  | 
|  | <p>Naturally, type preservation extends to uses of templates, and Clang retains information about how a particular template specialization (like <code>std::vector<Real></code>) was spelled within the source code. For example:</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang -fsyntax-only t.cpp</span> | 
|  | <span class="loc">t.cpp:12:7:</span> <span class="err">error:</span> <span class="msg">incompatible type assigning 'vector<Real>', expected 'std::string' (aka 'class std::basic_string<char>')</span> | 
|  | <span class="snip">str = vec</span>; | 
|  | <span class="point">^ ~~~</span> | 
|  | </pre> | 
|  |  | 
|  | <h2>Fix-it Hints</h2> | 
|  |  | 
|  | <p>"Fix-it" hints provide advice for fixing small, localized problems | 
|  | in source code. When Clang produces a diagnostic about a particular | 
|  | problem that it can work around (e.g., non-standard or redundant | 
|  | syntax, missing keywords, common mistakes, etc.), it may also provide | 
|  | specific guidance in the form of a code transformation to correct the | 
|  | problem. In the following example, Clang warns about the use of a GCC | 
|  | extension that has been considered obsolete since 1993. The underlined | 
|  | code should be removed, then replaced with the code below the | 
|  | point line (".x =" or ".y =", respectively).</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang t.c</span> | 
|  | <span class="loc">t.c:5:28:</span> <span class="warn">warning:</span> <span class="msg">use of GNU old-style field designator extension</span> | 
|  | <span class="snip">struct point origin = { x: 0.0, y: 0.0 };</span> | 
|  | <span class="err">~~</span> <span class="msg"><span class="point">^</span></span> | 
|  | <span class="snip">.x = </span> | 
|  | <span class="loc">t.c:5:36:</span> <span class="warn">warning:</span> <span class="msg">use of GNU old-style field designator extension</span> | 
|  | <span class="snip">struct point origin = { x: 0.0, y: 0.0 };</span> | 
|  | <span class="err">~~</span> <span class="msg"><span class="point">^</span></span> | 
|  | <span class="snip">.y = </span> | 
|  | </pre> | 
|  |  | 
|  | <p>"Fix-it" hints are most useful for | 
|  | working around common user errors and misconceptions. For example, C++ users | 
|  | commonly forget the syntax for explicit specialization of class templates, | 
|  | as in the error in the following example. Again, after describing the problem, | 
|  | Clang provides the fix--add <code>template<></code>--as part of the | 
|  | diagnostic.<p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang t.cpp</span> | 
|  | <span class="loc">t.cpp:9:3:</span> <span class="err">error:</span> <span class="msg">template specialization requires 'template<>'</span> | 
|  | struct iterator_traits<file_iterator> { | 
|  | <span class="point">^</span> | 
|  | <span class="snip">template<> </span> | 
|  | </pre> | 
|  |  | 
|  | <h2>Template Type Diffing</h2> | 
|  |  | 
|  | <p>Templates types can be long and difficult to read.  More so when part of an | 
|  | error message.  Instead of just printing out the type name, Clang has enough | 
|  | information to remove the common elements and highlight the differences.  To | 
|  | show the template structure more clearly, the templated type can also be | 
|  | printed as an indented text tree.</p> | 
|  |  | 
|  | Default: template diff with type elision | 
|  | <pre> | 
|  | <span class="loc">t.cc:4:5:</span> <span class="note">note:</span> candidate function not viable: no known conversion from 'vector<map<[...], <span class="template-highlight">float</span>>>' to 'vector<map<[...], <span class="template-highlight">double</span>>>' for 1st argument; | 
|  | </pre> | 
|  | -fno-elide-type: template diff without elision | 
|  | <pre> | 
|  | <span class="loc">t.cc:4:5:</span> <span class="note">note:</span> candidate function not viable: no known conversion from 'vector<map<int, <span class="template-highlight">float</span>>>' to 'vector<map<int, <span class="template-highlight">double</span>>>' for 1st argument; | 
|  | </pre> | 
|  | -fdiagnostics-show-template-tree: template tree printing with elision | 
|  | <pre> | 
|  | <span class="loc">t.cc:4:5:</span> <span class="note">note:</span> candidate function not viable: no known conversion for 1st argument; | 
|  | vector< | 
|  | map< | 
|  | [...], | 
|  | [<span class="template-highlight">float</span> != <span class="template-highlight">double</span>]>> | 
|  | </pre> | 
|  | -fdiagnostics-show-template-tree -fno-elide-type: template tree printing with no elision | 
|  | <pre> | 
|  | <span class="loc">t.cc:4:5:</span> <span class="note">note:</span> candidate function not viable: no known conversion for 1st argument; | 
|  | vector< | 
|  | map< | 
|  | int, | 
|  | [<span class="template-highlight">float</span> != <span class="template-highlight">double</span>]>> | 
|  | </pre> | 
|  |  | 
|  | <h2>Automatic Macro Expansion</h2> | 
|  |  | 
|  | <p>Many errors happen in macros that are sometimes deeply nested.  With | 
|  | traditional compilers, you need to dig deep into the definition of the macro to | 
|  | understand how you got into trouble.  The following simple example shows how | 
|  | Clang helps you out by automatically printing instantiation information and | 
|  | nested range information for diagnostics as they are instantiated through macros | 
|  | and also shows how some of the other pieces work in a bigger example.</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang -fsyntax-only t.c</span> | 
|  | <span class="loc">t.c:80:3:</span> <span class="err">error:</span> <span class="msg">invalid operands to binary expression ('typeof(P)' (aka 'struct mystruct') and 'typeof(F)' (aka 'float'))</span> | 
|  | <span class="snip">  X = MYMAX(P, F);</span> | 
|  | <span class="point">      ^~~~~~~~~~~</span> | 
|  | <span class="loc">t.c:76:94:</span> <span class="note">note:</span> expanded from: | 
|  | <span class="snip">#define MYMAX(A,B)    __extension__ ({ __typeof__(A) __a = (A); __typeof__(B) __b = (B); __a < __b ? __b : __a; })</span> | 
|  | <span class="point">                                                                                         ~~~ ^ ~~~</span> | 
|  | </pre> | 
|  |  | 
|  | <p>Here's another real world warning that occurs in the "window" Unix package (which | 
|  | implements the "wwopen" class of APIs):</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">clang -fsyntax-only t.c</span> | 
|  | <span class="loc">t.c:22:2:</span> <span class="warn">warning:</span> <span class="msg">type specifier missing, defaults to 'int'</span> | 
|  | <span class="snip">        ILPAD();</span> | 
|  | <span class="point">        ^</span> | 
|  | <span class="loc">t.c:17:17:</span> <span class="note">note:</span> expanded from: | 
|  | <span class="snip">#define ILPAD() PAD((NROW - tt.tt_row) * 10)    /* 1 ms per char */</span> | 
|  | <span class="point">                ^</span> | 
|  | <span class="loc">t.c:14:2:</span> <span class="note">note:</span> expanded from: | 
|  | <span class="snip">        register i; \</span> | 
|  | <span class="point">        ^</span> | 
|  | </pre> | 
|  |  | 
|  | <p>In practice, we've found that Clang's treatment of macros is actually more useful in multiply nested | 
|  | macros than in simple ones.</p> | 
|  |  | 
|  | <h2>Quality of Implementation and Attention to Detail</h2> | 
|  |  | 
|  | <p>Finally, we have put a lot of work polishing the little things, because | 
|  | little things add up over time and contribute to a great user experience.</p> | 
|  |  | 
|  | <p>The following example shows that we recover from the simple case of | 
|  | forgetting a ; after a struct definition much better than GCC.</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">cat t.cc</span> | 
|  | template<class T> | 
|  | class a {}; | 
|  | struct b {} | 
|  | a<int> c; | 
|  | $ <span class="cmd">gcc-4.9 t.cc</span> | 
|  | t.cc:4:8: error: invalid declarator before 'c' | 
|  | a<int> c; | 
|  | ^ | 
|  | $ <span class="cmd">clang t.cc</span> | 
|  | <span class="loc">t.cc:3:12:</span> <span class="err">error:</span> <span class="msg">expected ';' after struct</span> | 
|  | <span class="snip" >struct b {}</span> | 
|  | <span class="point">           ^</span> | 
|  | <span class="point">           ;</span> | 
|  | </pre> | 
|  |  | 
|  | <p>The following example shows that we diagnose and recover from a missing | 
|  | <tt>typename</tt> keyword well, even in complex circumstances where GCC | 
|  | cannot cope.</p> | 
|  |  | 
|  | <pre> | 
|  | $ <span class="cmd">cat t.cc</span> | 
|  | template<class T> void f(T::type) { } | 
|  | struct A { }; | 
|  | void g() | 
|  | { | 
|  | A a; | 
|  | f<A>(a); | 
|  | } | 
|  | $ <span class="cmd">gcc-4.9 t.cc</span> | 
|  | t.cc:1:33: error: variable or field 'f' declared void | 
|  | template<class T> void f(T::type) { } | 
|  | ^ | 
|  | t.cc: In function 'void g()': | 
|  | t.cc:6:5: error: 'f' was not declared in this scope | 
|  | f<A>(a); | 
|  | ^ | 
|  | t.cc:6:8: error: expected primary-expression before '>' token | 
|  | f<A>(a); | 
|  | ^ | 
|  | $ <span class="cmd">clang t.cc</span> | 
|  | <span class="loc">t.cc:1:26:</span> <span class="err">error:</span> <span class="msg">missing 'typename' prior to dependent type name 'T::type'</span> | 
|  | <span class="snip" >template<class T> void f(T::type) { }</span> | 
|  | <span class="point">                         ^~~~~~~</span> | 
|  | <span class="point">                         typename </span> | 
|  | <span class="loc">t.cc:6:5:</span> <span class="err">error:</span> <span class="msg">no matching function for call to 'f'</span> | 
|  | <span class="snip" >    f<A>(a);</span> | 
|  | <span class="point">    ^~~~</span> | 
|  | <span class="loc">t.cc:1:24:</span> <span class="note">note:</span> <span class="msg">candidate template ignored: substitution failure [with T = A]: no type named 'type' in 'A'</span> | 
|  | <span class="snip" >template<class T> void f(T::type) { }</span> | 
|  | <span class="point">                       ^    ~~~~</span> | 
|  | </pre> | 
|  |  | 
|  |  | 
|  |  | 
|  | <p>While each of these details is minor, we feel that they all add up to provide | 
|  | a much more polished experience.</p> | 
|  |  | 
|  | </div> | 
|  | </body> | 
|  | </html> |