This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/clang/
-
clang/
-
Driver/
-
CC1Options.td
-
Options.td
-
Frontend/
-
CompilerInstance.h
-
CompilerInvocation.h
-
FrontendActions.h
-
OptReport.h
-
lib/
-
CodeGen/
4
CodeGenAction.cpp
-
Driver/
-
Tools.cpp
-
Frontend/
-
CMakeLists.txt
-
CompilerInvocation.cpp
-
OptReport.cpp
-
FrontendTool/
-
ExecuteCompilerInvocation.cpp
-
test/
-
CodeGen/
-
opt-report.c
-
Driver/
-
opt-report.c

Differential D19678

Annotated-source optimization reports (a.k.a. "listing" files)
AbandonedPublic

Authored by hfinkel on Apr 28 2016, 12:29 PM.

Download Raw Diff

Details

Reviewers

spatel
silviu.baranga
rengolin
anemet
chandlerc
rcox2
mzolotukhin
rjmccall
delena
jmolloy
echristo
zaks.anna
Carrot
congh
rsmith

Summary

This patch implements support for annotated-source optimization reports (a.k.a. "listing" files). Aside from optimizer improvements, this is a top feature request from my performance-engineering team. Most HPC-relevant compilers have some kind of capability along these lines. The DiagnosticInfo infrastructure at the IR level was designed specifically to support the development of this kind of feature, by allowing diagnostic messages to be subclass carrying arbitrary additional payload, although in terms of optimizer feedback, we currently only use this with -Rpass and friends. -Rpass and related options are very useful, but they can generate a lot of output, and that output lacks significant context, making it hard to see if the compiler is really doing what the user expects.

For this optimizer report, I focused on making the output as succinct as possible while providing information on inlining and loop transformations. The goal here is that the source code should still be easily readable in the report. My primary inspiration here is the reports generated by Cray's tools (http://docs.cray.com/books/S-2496-4101/html-S-2496-4101/z1112823641oswald.html). These reports are highly regarded within the HPC community. Intel's compiler, for example, also has an optimization-report capability (https://software.intel.com/sites/default/files/managed/55/b1/new-compiler-optimization-reports.pdf).

$ cat /tmp/v.c
void bar();
void foo() { bar(); }

void Test(int *res, int *c, int *d, int *p, int n) {
  int i;

#pragma clang loop vectorize(assume_safety)
  for (i = 0; i < 1600; i++) {
    res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
  }

  for (i = 0; i < 16; i++) {
    res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
  }

  foo();

  foo(); bar(); foo();
}

The patch -flisting and -flisting=filename. For the first form, where the file name is not explicitly specified, the file name is computed automatically just as we do for split-debug output files.

$ clang -O3 -o /tmp/v.o -c /tmp/v.c -flisting
$ cat /tmp/v.lst

< /tmp/v.c
 1     | void bar();
 2     | void foo() { bar(); }
 3     | 
 4     | void Test(int *res, int *c, int *d, int *p, int n) {
 5     |   int i;
 6     | 
 7     | #pragma clang loop vectorize(assume_safety)
 8   V |   for (i = 0; i < 1600; i++) {
 9     |     res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
10     |   }
11     | 
12     |   for (i = 0; i < 16; i++) {
13  U  |     res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
14     |   }
15     | 
16 I   |   foo();
17     | 
18     |   foo(); bar(); foo();
   I   |   ^
   I   |                 ^
19     | }
20     |

Each source line gets a prefix giving the line number, and a few columns for important optimizations: inlining, loop unrolling and loop vectorization. An 'I' is printed next to a line where a function was inlined, a 'U' next to an unrolled loop, and 'V' next to a vectorized loop. These are printing on the relevant code line when that seems unambiguous, or on subsequent lines when multiple potential options exist (messages, both positive and negative, from the same optimization with different column numbers are taken to indicate potential ambiguity). When on subsequent lines, a '^' is output in the relevant column. The fact that the 'U' is on the wrong line is also a problem with -Rpass=loop-unroll and may be something we can fix in the backend.

Annotated source for all relevant input files are put into the listing file (each starting with '<' and then the file name).

To see what this looks like for C++ code, here's a small excerpt from CodeGenAction.cpp:

340     |   // If the SMDiagnostic has an inline asm source location, translate it.
341 I   |   FullSourceLoc Loc;
342     |   if (D.getLoc() != SMLoc())
    I   |       ^
    I   |                  ^
    I   |                     ^
343     |     Loc = ConvertBackendLocation(D, Context->getSourceManager());
    I   |           ^
    I   |                                     ^
344     | 
345     |   unsigned DiagID;
346 I   |   switch (D.getKind()) {

There's obvious bikeshedding to do here, and I'm quite open to suggestions. My engineering team often calls these things "listing files", and other tools often name this files with lst as an extension, thus the naming in the patch. Intel's option is -opt-report-file=filename.

After some backend enhancements (to turn the relevant remark types into proper subclasses), I'd like to extend this to also print the vectorization factor, interleaving factor and unrolling factor when relevant. After these enhancements, I'd l imagine the loop annotations might look like V4,2U4 for a loop vectorized with VF == 4 and interleaving by 2, and then partially unrolled by a factor of 4.

Please review.

Diff Detail

Event Timeline

hfinkel updated this revision to Diff 55446.Apr 28 2016, 12:29 PM

hfinkel retitled this revision from to Annotated-source optimization reports (a.k.a. "listing" files).

hfinkel updated this object.

hfinkel added reviewers: rsmith, chandlerc, rcox2, jmolloy, anemet, silviu.baranga, mzolotukhin, spatel, rengolin, delena, Carrot, congh, echristo.

hfinkel added a subscriber: cfe-commits.

Herald added subscribers: mehdi_amini, mcrosier. · View Herald TranscriptApr 28 2016, 12:30 PM

hfinkel added a reviewer: rjmccall.Apr 28 2016, 12:40 PM

I see what you're going for with "listing file", but I like ICC's option name much better, or at least something along those lines.

hfinkel mentioned this in D19397: Initial patch for inlining report .Apr 28 2016, 12:53 PM

My primary inspiration here is the reports generated by Cray's tools (http://docs.cray.com/books/S-2496-4101/html-S-2496-4101/z1112823641oswald.html).

http://docs.cray.com/books/S-2315-52/html-S-2315-52/fixedds0jdeh38.html is a better link.

You give this example:

343     |     Loc = ConvertBackendLocation(D, Context->getSourceManager());
    I   |           ^
    I   |                                     ^

How does this look for a case like p->Foo()->Bar() (where one or both of the calls are inlined)? Can we get the source location to point at the function name instead of the start of the expression to reduce the scope for ambiguity?

lib/CodeGen/CodeGenAction.cpp
637–733	I'd like this to be factored out and moved somewhere more appropriate (such as Frontend). It seems appropriate for CodeGen to generate the data structure here, but it should not be deciding how to format the report nor doing file IO to put it somewhere. I would hope that we can combine this report information with the static analyzer's existing support for generating syntax-highlighted, annotated source code as HTML as a future extension.

In D19678#415902, @rsmith wrote:
You give this example:
343     |     Loc = ConvertBackendLocation(D, Context->getSourceManager());
    I   |           ^
    I   |                                     ^
How does this look for a case like p->Foo()->Bar() (where one or both of the calls are inlined)? Can we get the source location to point at the function name instead of the start of the expression to reduce the scope for ambiguity?

That does not currently work very well (I assume this needs a backend fix, but I'll check).

$ cat /tmp/i.cpp
void ext();

struct Bar {
  void bar() { ext(); }
};

struct Foo {
  Bar *b;

  Bar *foo() { return b; }
};

void test(Foo *f) {
  f->foo()->bar();
}

And we get:

14 I   |   f->foo()->bar();

because both inlining remarks come from the backend with the same column number:

$ clang -O3 -c -o /tmp/i.o /tmp/i.cpp -flisting -Rpass=inline
/tmp/i.cpp:14:3: remark: _ZN3Foo3fooEv inlined into _Z4testP3Foo [-Rpass=inline]
  f->foo()->bar();
  ^
/tmp/i.cpp:14:3: remark: _ZN3Bar3barEv inlined into _Z4testP3Foo [-Rpass=inline]

lib/CodeGen/CodeGenAction.cpp
637–733	I'd like this to be factored out and moved somewhere more appropriate (such as Frontend). It seems appropriate for CodeGen to generate the data structure here, but it should not be deciding how to format the report nor doing file IO to put it somewhere. Makes sense. I would hope that we can combine this report information with the static analyzer's existing support for generating syntax-highlighted, annotated source code as HTML as a future extension. I like this idea.

In D19678#415844, @rjmccall wrote:

I see what you're going for with "listing file", but I like ICC's option name much better, or at least something along those lines.

Sounds good to me. Do you have a preference on -fopt-report vs. -foptimization-report vs. something else? Do you have an opinion on the default file-name extension for the report? Maybe I should name it .opt-report (or something like that)?

hfinkel added a reviewer: zaks.anna.Apr 28 2016, 2:51 PM

In D19678#416059, @hfinkel wrote:

In D19678#415844, @rjmccall wrote:

I see what you're going for with "listing file", but I like ICC's option name much better, or at least something along those lines.

Sounds good to me. Do you have a preference on -fopt-report vs. -foptimization-report vs. something else? Do you have an opinion on the default file-name extension for the report? Maybe I should name it .opt-report (or something like that)?

I don't think we have a consistent abbreviation for that anywhere else in the options (other than -O, I guess), so my inclination would be to spell it out as -foptimization-report.

The extension is just appended to the original filename, so that it ends up something like foo.cpp.opt-report? I don't have an objection to ".opt-report" or even ".lst".

Actually, the Intel compiler distinguishes between an optimization report (-qopt-report) and an annotated listing (-qopt-report-annotate). The optimization report lists the info for optimizations in a hierarchical fashion. To use you example,

icc -c -O3 -qopt-report=1 -qopt-report-file=stderr v.c

yields:

Report from: Interprocedural optimizations [ipo]

INLINING OPTION VALUES:

-inline-factor: 100
-inline-min-size: 20
-inline-max-size: 230
-inline-max-total-size: 2000
-inline-max-per-routine: 10000
-inline-max-per-compile: 500000

Begin optimization report for: foo()

Report from: Interprocedural optimizations [ipo]

INLINE REPORT: (foo()) [1] v.c(2,12)

Report from: Code generation optimizations [cg]

v.c(2,12):remark #34051: REGISTER ALLOCATION : [foo] v.c:2

Hardware registers
    Reserved     :    1[ esp]
    Available    :   23[ eax edx ecx ebx ebp esi edi mm0-mm7 zmm0-zmm7]
    Callee-save  :    4[ ebx ebp esi edi]
    Assigned     :    0[ reg_null]

Routine temporaries
    Total         :       4
        Global    :       0
        Local     :       4
    Regenerable   :       0
    Spilled       :       0

Routine stack
    Variables     :       0 bytes*
        Reads     :       0 [0.00e+00 ~ 0.0%]
        Writes    :       0 [0.00e+00 ~ 0.0%]
    Spills        :       0 bytes*
        Reads     :       0 [0.00e+00 ~ 0.0%]
        Writes    :       0 [0.00e+00 ~ 0.0%]

Notes

    *Non-overlapping variables and spills may share stack space,
     so the total stack size might be less than this.

Begin optimization report for: Test(int *, int *, int *, int *, int)

Report from: Interprocedural optimizations [ipo]

INLINE REPORT: (Test(int *, int *, int *, int *, int)) [2] v.c(4,52)

-> INLINE: (16,3) foo()
-> INLINE: (18,3) foo()
-> INLINE: (18,17) foo()


  Report from: Loop nest, Vector & Auto-parallelization optimizations [loop, vec, par]

LOOP BEGIN at v.c(8,8)
<Peeled loop for vectorization>
LOOP END

LOOP BEGIN at v.c(8,8)

remark #15301: SIMD LOOP WAS VECTORIZED

LOOP END

LOOP BEGIN at v.c(8,8)
<Alternate Alignment Vectorized Loop>
LOOP END

LOOP BEGIN at v.c(8,8)
<Remainder loop for vectorization>

remark #15335: remainder loop was not vectorized: vectorization possible but seems inefficient. Use vector always directive or -vec-threshold0 to override

LOOP END

LOOP BEGIN at v.c(12,3)

remark #15344: loop was not vectorized: vector dependence prevents vectorization. First dependence is shown below. Use level 5 report for details
remark #15346: vector dependence: assumed FLOW dependence between res[i] (13:5) and d[i] (13:5)
remark #25436: completely unrolled by 16

LOOP END

Report from: Code generation optimizations [cg]

v.c(4,52):remark #34051: REGISTER ALLOCATION : [Test] v.c:4

Hardware registers
    Reserved     :    1[ esp]
    Available    :   23[ eax edx ecx ebx ebp esi edi mm0-mm7 zmm0-zmm7]
    Callee-save  :    4[ ebx ebp esi edi]
    Assigned     :   15[ eax edx ecx ebx ebp esi edi zmm0-zmm7]

Routine temporaries
    Total         :     123
        Global    :      47
        Local     :      76
    Regenerable   :       5
    Spilled       :       6

Routine stack
    Variables     :       0 bytes*
        Reads     :       0 [0.00e+00 ~ 0.0%]
        Writes    :       0 [0.00e+00 ~ 0.0%]
    Spills        :       8 bytes*
        Reads     :       5 [1.41e+01 ~ 1.4%]
        Writes    :       3 [3.00e+00 ~ 0.3%]

Notes

    *Non-overlapping variables and spills may share stack space,
     so the total stack size might be less than this.

while the annotated listing looks like:

------- Annotated listing with optimization reports for "/export/iusers/rcox2/rgHF/v.c" -------

INLINING OPTION VALUES:
-inline-factor: 100
-inline-min-size: 20
-inline-max-size: 230
-inline-max-total-size: 2000
-inline-max-per-routine: 10000
-inline-max-per-compile: 500000

1 void bar();
2 void foo() { bar(); }
INLINE REPORT: (foo()) [1] /export/iusers/rcox2/rgHF/v.c(2,12)

/export/iusers/rcox2/rgHF/v.c(2,12):remark #34051: REGISTER ALLOCATION : [foo] /export/iusers/rcox2/rgHF/v.c:2

Hardware registers
Reserved : 1[ esp]
Available : 23[ eax edx ecx ebx ebp esi edi mm0-mm7 zmm0-zmm7]
Callee-save : 4[ ebx ebp esi edi]
Assigned : 0[ reg_null]

Routine temporaries
Total : 4
Global : 0
Local : 4
Regenerable : 0
Spilled : 0

Routine stack
Variables : 0 bytes*
Reads : 0 [0.00e+00 ~ 0.0%]
Writes : 0 [0.00e+00 ~ 0.0%]
Spills : 0 bytes*
Reads : 0 [0.00e+00 ~ 0.0%]
Writes : 0 [0.00e+00 ~ 0.0%]

Notes

*Non-overlapping variables and spills may share stack space,
so the total stack size might be less than this.

3
4 void Test(int *res, int *c, int *d, int *p, int n) {
INLINE REPORT: (Test(int *, int *, int *, int *, int)) [2] /export/iusers/rcox2/rgHF/v.c(4,52)
-> INLINE: (16,3) foo()
-> INLINE: (18,3) foo()
-> INLINE: (18,17) foo()

/export/iusers/rcox2/rgHF/v.c(4,52):remark #34051: REGISTER ALLOCATION : [Test] /export/iusers/rcox2/rgHF/v.c:4

Hardware registers
Reserved : 1[ esp]
Available : 23[ eax edx ecx ebx ebp esi edi mm0-mm7 zmm0-zmm7]
Callee-save : 4[ ebx ebp esi edi]
Assigned : 15[ eax edx ecx ebx ebp esi edi zmm0-zmm7]

Routine temporaries
Total : 123
Global : 47
Local : 76
Regenerable : 5
Spilled : 6

Routine stack
Variables : 0 bytes*
Reads : 0 [0.00e+00 ~ 0.0%]
Writes : 0 [0.00e+00 ~ 0.0%]
Spills : 8 bytes*
Reads : 5 [1.41e+01 ~ 1.4%]
Writes : 3 [3.00e+00 ~ 0.3%]

Notes

*Non-overlapping variables and spills may share stack space,
so the total stack size might be less than this.

5 int i;
6
7 #pragma simd
8 for (i = 0; i < 1600; i++) {

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(8,8)
<Peeled loop for vectorization>
LOOP END

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(8,8)
remark #15301: SIMD LOOP WAS VECTORIZED
LOOP END

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(8,8)
<Alternate Alignment Vectorized Loop>
LOOP END

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(8,8)
<Remainder loop for vectorization>
remark #15335: remainder loop was not vectorized: vectorization possible but seems inefficient. Use vector always directive or -vec-threshold0 to override
LOOP END
9 res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
10 }
11
12 for (i = 0; i < 16; i++) {

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(12,3)
remark #15344: loop was not vectorized: vector dependence prevents vectorization. First dependence is shown below. Use level 5 report for details
remark #15346: vector dependence: assumed FLOW dependence between res[i] (13:5) and d[i] (13:5)
remark #25436: completely unrolled by 16
//LOOP END
13 res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
14 }
15
16 foo();
17
18 foo(); bar(); foo();
19 }

essentially, various parts of the optimization report are inserted into a listing at the appropriate line numbers.

(Note that this is just the default level. More detail can be obtained with -qopt-report=X where X>1 (up to 5 is supported)).

I believe what Hal is proposing in this patch is a very useful light-weight annotation of the source with key information. But I also believe that there is value for a stand-alone opt report with the kind of detailed information I presented in D19397 and the two follow up patches. In general, while this info can be interspersed in the source listing, I believe that for most purposes it is a bit too "busy" in text form. (The Intel compiler also supports annotated html and functionality that feeds into Visual Studio that has received great reviews.)

In D19678#416127, @rcox2 wrote:

Actually, the Intel compiler distinguishes between an optimization report (-qopt-report) and an annotated listing (-qopt-report-annotate).

Interesting; thanks for pointing this out (and for the example).

The optimization report lists the info for optimizations in a hierarchical fashion. To use you example,
icc -c -O3 -qopt-report=1 -qopt-report-file=stderr v.c
yields:
Report from: Interprocedural optimizations [ipo]
INLINING OPTION VALUES:
-inline-factor: 100
-inline-min-size: 20
-inline-max-size: 230
-inline-max-total-size: 2000
-inline-max-per-routine: 10000
-inline-max-per-compile: 500000
Begin optimization report for: foo()
Report from: Interprocedural optimizations [ipo]
INLINE REPORT: (foo()) [1] v.c(2,12)
Report from: Code generation optimizations [cg]
v.c(2,12):remark #34051: REGISTER ALLOCATION : [foo] v.c:2
Hardware registers
    Reserved     :    1[ esp]
    Available    :   23[ eax edx ecx ebx ebp esi edi mm0-mm7 zmm0-zmm7]
    Callee-save  :    4[ ebx ebp esi edi]
    Assigned     :    0[ reg_null]

Routine temporaries
    Total         :       4
        Global    :       0
        Local     :       4
    Regenerable   :       0
    Spilled       :       0

Routine stack
    Variables     :       0 bytes*
        Reads     :       0 [0.00e+00 ~ 0.0%]
        Writes    :       0 [0.00e+00 ~ 0.0%]
    Spills        :       0 bytes*
        Reads     :       0 [0.00e+00 ~ 0.0%]
        Writes    :       0 [0.00e+00 ~ 0.0%]

Notes

    *Non-overlapping variables and spills may share stack space,
     so the total stack size might be less than this.
Begin optimization report for: Test(int *, int *, int *, int *, int)
Report from: Interprocedural optimizations [ipo]
INLINE REPORT: (Test(int *, int *, int *, int *, int)) [2] v.c(4,52)
-> INLINE: (16,3) foo()
-> INLINE: (18,3) foo()
-> INLINE: (18,17) foo()


  Report from: Loop nest, Vector & Auto-parallelization optimizations [loop, vec, par]
LOOP BEGIN at v.c(8,8)
<Peeled loop for vectorization>
LOOP END

LOOP BEGIN at v.c(8,8)
remark #15301: SIMD LOOP WAS VECTORIZED
LOOP END

LOOP BEGIN at v.c(8,8)
<Alternate Alignment Vectorized Loop>
LOOP END

LOOP BEGIN at v.c(8,8)
<Remainder loop for vectorization>
remark #15335: remainder loop was not vectorized: vectorization possible but seems inefficient. Use vector always directive or -vec-threshold0 to override
LOOP END

LOOP BEGIN at v.c(12,3)
remark #15344: loop was not vectorized: vector dependence prevents vectorization. First dependence is shown below. Use level 5 report for details
remark #15346: vector dependence: assumed FLOW dependence between res[i] (13:5) and d[i] (13:5)
remark #25436: completely unrolled by 16
LOOP END
Report from: Code generation optimizations [cg]
v.c(4,52):remark #34051: REGISTER ALLOCATION : [Test] v.c:4
Hardware registers
    Reserved     :    1[ esp]
    Available    :   23[ eax edx ecx ebx ebp esi edi mm0-mm7 zmm0-zmm7]
    Callee-save  :    4[ ebx ebp esi edi]
    Assigned     :   15[ eax edx ecx ebx ebp esi edi zmm0-zmm7]

Routine temporaries
    Total         :     123
        Global    :      47
        Local     :      76
    Regenerable   :       5
    Spilled       :       6

Routine stack
    Variables     :       0 bytes*
        Reads     :       0 [0.00e+00 ~ 0.0%]
        Writes    :       0 [0.00e+00 ~ 0.0%]
    Spills        :       8 bytes*
        Reads     :       5 [1.41e+01 ~ 1.4%]
        Writes    :       3 [3.00e+00 ~ 0.3%]

Notes

    *Non-overlapping variables and spills may share stack space,
     so the total stack size might be less than this.
while the annotated listing looks like:

------- Annotated listing with optimization reports for "/export/iusers/rcox2/rgHF/v.c" -------

INLINING OPTION VALUES:
-inline-factor: 100
-inline-min-size: 20
-inline-max-size: 230
-inline-max-total-size: 2000
-inline-max-per-routine: 10000
-inline-max-per-compile: 500000

1 void bar();
2 void foo() { bar(); }
INLINE REPORT: (foo()) [1] /export/iusers/rcox2/rgHF/v.c(2,12)

/export/iusers/rcox2/rgHF/v.c(2,12):remark #34051: REGISTER ALLOCATION : [foo] /export/iusers/rcox2/rgHF/v.c:2

Hardware registers
Reserved : 1[ esp]
Available : 23[ eax edx ecx ebx ebp esi edi mm0-mm7 zmm0-zmm7]
Callee-save : 4[ ebx ebp esi edi]
Assigned : 0[ reg_null]

Routine temporaries
Total : 4
Global : 0
Local : 4
Regenerable : 0
Spilled : 0

Routine stack
Variables : 0 bytes*
Reads : 0 [0.00e+00 ~ 0.0%]
Writes : 0 [0.00e+00 ~ 0.0%]
Spills : 0 bytes*
Reads : 0 [0.00e+00 ~ 0.0%]
Writes : 0 [0.00e+00 ~ 0.0%]

Notes

*Non-overlapping variables and spills may share stack space,
so the total stack size might be less than this.

3
4 void Test(int *res, int *c, int *d, int *p, int n) {
INLINE REPORT: (Test(int *, int *, int *, int *, int)) [2] /export/iusers/rcox2/rgHF/v.c(4,52)
-> INLINE: (16,3) foo()
-> INLINE: (18,3) foo()
-> INLINE: (18,17) foo()

/export/iusers/rcox2/rgHF/v.c(4,52):remark #34051: REGISTER ALLOCATION : [Test] /export/iusers/rcox2/rgHF/v.c:4

Hardware registers
Reserved : 1[ esp]
Available : 23[ eax edx ecx ebx ebp esi edi mm0-mm7 zmm0-zmm7]
Callee-save : 4[ ebx ebp esi edi]
Assigned : 15[ eax edx ecx ebx ebp esi edi zmm0-zmm7]

Routine temporaries
Total : 123
Global : 47
Local : 76
Regenerable : 5
Spilled : 6

Routine stack
Variables : 0 bytes*
Reads : 0 [0.00e+00 ~ 0.0%]
Writes : 0 [0.00e+00 ~ 0.0%]
Spills : 8 bytes*
Reads : 5 [1.41e+01 ~ 1.4%]
Writes : 3 [3.00e+00 ~ 0.3%]

Notes

*Non-overlapping variables and spills may share stack space,
so the total stack size might be less than this.

5 int i;
6
7 #pragma simd
8 for (i = 0; i < 1600; i++) {

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(8,8)
<Peeled loop for vectorization>
LOOP END

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(8,8)
remark #15301: SIMD LOOP WAS VECTORIZED
LOOP END

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(8,8)
<Alternate Alignment Vectorized Loop>
LOOP END

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(8,8)
<Remainder loop for vectorization>
remark #15335: remainder loop was not vectorized: vectorization possible but seems inefficient. Use vector always directive or -vec-threshold0 to override
LOOP END
9 res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
10 }
11
12 for (i = 0; i < 16; i++) {

LOOP BEGIN at /export/iusers/rcox2/rgHF/v.c(12,3)
remark #15344: loop was not vectorized: vector dependence prevents vectorization. First dependence is shown below. Use level 5 report for details
remark #15346: vector dependence: assumed FLOW dependence between res[i] (13:5) and d[i] (13:5)
remark #25436: completely unrolled by 16
//LOOP END
13 res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
14 }
15
16 foo();
17
18 foo(); bar(); foo();
19 }

essentially, various parts of the optimization report are inserted into a listing at the appropriate line numbers.

(Note that this is just the default level. More detail can be obtained with -qopt-report=X where X>1 (up to 5 is supported)).

I believe what Hal is proposing in this patch is a very useful light-weight annotation of the source with key information. But I also believe that there is value for a stand-alone opt report with the kind of detailed information I presented in D19397 and the two follow up patches.

To be clear, I agree. I'd like to have both.

In general, while this info can be interspersed in the source listing, I believe that for most purposes it is a bit too "busy" in text form. (The Intel compiler also supports annotated html and functionality that feeds into Visual Studio that has received great reviews.)

I think this piggybacks on Richard's suggestion regarding later integration with the static analyzer's output capabilities. We should definitely explore how this might be done.

anemet added inline comments.Apr 28 2016, 5:20 PM

lib/CodeGen/CodeGenAction.cpp
706–709	Should the abbreviation be somehow part of the optimization remark API and passed in just like the pass name? It would be nice if someone added optimization remark for a new opt, it would show up here automatically. I could see how that could make the output too busy but at least have the option?

In D19678#416039, @hfinkel wrote:
In D19678#415902, @rsmith wrote:
You give this example:
343     |     Loc = ConvertBackendLocation(D, Context->getSourceManager());
    I   |           ^
    I   |                                     ^
How does this look for a case like p->Foo()->Bar() (where one or both of the calls are inlined)? Can we get the source location to point at the function name instead of the start of the expression to reduce the scope for ambiguity?
That does not currently work very well (I assume this needs a backend fix, but I'll check).

Actually, it's a Clang problem. https://llvm.org/bugs/show_bug.cgi?id=27567

hfinkel added inline comments.Apr 28 2016, 10:47 PM

lib/CodeGen/CodeGenAction.cpp
706–709	So long as we're careful in the backend to respect the limited visual real estate and namespace in this kind of report, we could have the optimizations themselves provide a letter. I'm undecided.

Renamed the option from -flisting to -foptimization-report as suggested. Moved I/O-related and formatting-related code into Frontend.

In D19678#416127, @rcox2 wrote:
Actually, the Intel compiler distinguishes between an optimization report (-qopt-report) and an annotated listing (-qopt-report-annotate). The optimization report lists the info for optimizations in a hierarchical fashion. To use you example,
icc -c -O3 -qopt-report=1 -qopt-report-file=stderr v.c
yields:

Robert, John, (et al.), do you think I should change this to have an -foptimization-report-file=<filename> and -foptimization-report, instead of -foptimization-report=<filename>? In the future, when we have multiple kinds of reports (a detailed inlining report, for example), maybe we want to use -foptimization-report=inlining,somethingelse,andmore?

Of course, it would be my preference to mirror the functionality of what is available in the "new" hierarchical form of optimization report Intel compiler. So, I would like to distinguish between what Hal is proposing (which we call an "annotated listing") and what I am proposing, which we call an "optimization report".

If Hal wants to call what he is proposing the "optimization report", then we need to come up with another name for what I am proposing.

To summarize what the Intel compiler has

-qopt-report[=N]     where the default is 2 and the range is 1-5, with 1 having the least detail and 5 having the most detail 
-qopt-report-file=F   where F is a file name or stdout or stderr 
-qopt-report-phase=P where P is a sequence of phases (like ipo,cg, etc.) and only those phases are printed 
-qopt-report-filter=X  where X allows you to filter opt reports only for certain routines or parts of routines

Use of ANY of these implies the generation of an opt report, so you don't need to say:

-qopt-report -qopt-report-file=stderr

since

-qopt-report-file=stderr

is sufficient.

On a slightly different topic ....

One key question I have about Hal's proposal is whether there is any annotation associated with code that is inlined, beyond noting the call site that is inlined.

For example, if we have:

 int foo() { 
     ... 
     loop 
     ....
} 
int main() { 
    ...
    foo(); 
    ...
 }

and foo gets inlined, we have two loops of interest, the loop in foo() and the loop inlined into main(). Each of these could be vectorized, unrolled, etc. and it isn't always the case that both loops would have the same properties. So, does Hal's report indicate info only about the loop in foo(), or are the properties of the two loops ANDed or ORed together and reported next to the loop in foo, or something else?

In general, you want info about both loops, and you can that with a classic optimization report. But it's not clear how to effectively represent this on the lightweight annotated listing. And it is often the case that the inlined loop is actually the more executed one, and therefore more important.

In D19678#419361, @rcox2 wrote:

Of course, it would be my preference to mirror the functionality of what is available in the "new" hierarchical form of optimization report Intel compiler. So, I would like to distinguish between what Hal is proposing (which we call an "annotated listing") and what I am proposing, which we call an "optimization report".

If Hal wants to call what he is proposing the "optimization report", then we need to come up with another name for what I am proposing.

To be clear, I'm fine with calling this something else. How about "optimization summary" or "annotated optimization summary"? I could name the option -fannotated-optimization-summary, for example.

To summarize what the Intel compiler has
-qopt-report[=N]     where the default is 2 and the range is 1-5, with 1 having the least detail and 5 having the most detail 
-qopt-report-file=F   where F is a file name or stdout or stderr 
-qopt-report-phase=P where P is a sequence of phases (like ipo,cg, etc.) and only those phases are printed 
-qopt-report-filter=X  where X allows you to filter opt reports only for certain routines or parts of routines
Use of ANY of these implies the generation of an opt report, so you don't need to say:
-qopt-report -qopt-report-file=stderr
since
-qopt-report-file=stderr
is sufficient.

On a slightly different topic ....
One key question I have about Hal's proposal is whether there is any annotation associated with code that is inlined, beyond noting the call site that is inlined.

For example, if we have:
 int foo() { 
     ... 
     loop 
     ....
} 
int main() { 
    ...
    foo(); 
    ...
 }
and foo gets inlined, we have two loops of interest, the loop in foo() and the loop inlined into main(). Each of these could be vectorized, unrolled, etc. and it isn't always the case that both loops would have the same properties. So, does Hal's report indicate info only about the loop in foo(), or are the properties of the two loops ANDed or ORed together and reported next to the loop in foo, or something else?

In general, you want info about both loops, and you can that with a classic optimization report. But it's not clear how to effectively represent this on the lightweight annotated listing. And it is often the case that the inlined loop is actually the more executed one, and therefore more important.

Currently, the information is ORed together. This has exactly the problem that you indicate, although the problem can be even worse than that: Functions are often transitively inlined multiple times into the same function, and users often want to know if the loops in those functions (or the inlining decisions themselves) differed each time. In the future, I'd like to detect this situation and present this information to the user. The common case (same behavior everywhere) should carry a succinct annotation, but otherwise we might want to do something like this:

1234      |    for (...)
       V  |    ^
          |    * When inlined into foo (file.c:789), doit (d.c:45) -> bar (bar.c:234), not when inlined into work (work.c:345), doit (d.c:45) -> bar (bar.c:254)

Or we might just want to indicate that the annotation applies only to some places where the code was inlined and the user can generate a more-detailed optimization report for more information. I'm certainly open to suggestions, but I'd like to handle this in follow-up because it will likely require backend enhancements as well.

A related issue comes up with templated code; we might want to indicate the types when the optimizations end up being type dependent.

This discussion of the command line interface makes me think that we should be taking Richard's suggestion one step further. Why is Clang's involvement here more than just handing down specific requests for optimization data to LLVM and packaging that information back into some reasonable format? The actual presentation of that data seems like it belongs in a separate library / tool, which can have a rich set of visualization options. This is also a more rigorously testable design, since the tool has a well-defined input format that's not just an incidental by-product of the compiler.

If we did that, then Clang just needs (1) an output filename, (2) an optional list of passes to collect data from, and (3) maybe some stringly-typed configuration data for each.

In D19678#419445, @rjmccall wrote:

This discussion of the command line interface makes me think that we should be taking Richard's suggestion one step further. Why is Clang's involvement here more than just handing down specific requests for optimization data to LLVM and packaging that information back into some reasonable format?

The static analyzer supports outputting its data in plist format (with which I'm not familiar in detail, but it looks like a fairly-simple xml format). Is that close to what you had in mind? Maybe YAML would be better (since LLVM actually has a parser for that)?

The actual presentation of that data seems like it belongs in a separate library / tool, which can have a rich set of visualization options. This is also a more rigorously testable design, since the tool has a well-defined input format that's not just an incidental by-product of the compiler.

I think this makes sense.

If we did that, then Clang just needs (1) an output filename, (2) an optional list of passes to collect data from, and (3) maybe some stringly-typed configuration data for each.

Sure. Although I'd prefer to leave the filtering to the tool unless the I/O requirements become unmanageable. Users don't know, and shouldn't know, what passes do what.

In D19678#420358, @hfinkel wrote:

In D19678#419445, @rjmccall wrote:

This discussion of the command line interface makes me think that we should be taking Richard's suggestion one step further. Why is Clang's involvement here more than just handing down specific requests for optimization data to LLVM and packaging that information back into some reasonable format?

The static analyzer supports outputting its data in plist format (with which I'm not familiar in detail, but it looks like a fairly-simple xml format). Is that close to what you had in mind? Maybe YAML would be better (since LLVM actually has a parser for that)?

YAML makes a lot of sense to me.

If we did that, then Clang just needs (1) an output filename, (2) an optional list of passes to collect data from, and (3) maybe some stringly-typed configuration data for each.

Sure. Although I'd prefer to leave the filtering to the tool unless the I/O requirements become unmanageable. Users don't know, and shouldn't know, what passes do what.

Sure, seems reasonable. So we can start with just some way to turn the feature on and specify a filename.

c-rhodes added a subscriber: c-rhodes.May 19 2016, 1:53 AM

fhahn added a subscriber: fhahn.Sep 13 2016, 4:44 AM

Abandoned in favor of D25225/D25262.

hfinkel mentioned this in rL283398: Add an llvm-opt-report tool to generate basic source-annotated optimization….Oct 5 2016, 3:19 PM

Revision Contents

Path

Size

include/

clang/

Driver/

CC1Options.td

3 lines

Options.td

6 lines

Frontend/

7 lines

11 lines

2 lines

67 lines

lib/

CodeGen/

CodeGenAction.cpp

74 lines

Driver/

Tools.cpp

28 lines

Frontend/

CMakeLists.txt

1 line

CompilerInvocation.cpp

12 lines

OptReport.cpp

123 lines

FrontendTool/

ExecuteCompilerInvocation.cpp

4 lines

test/

CodeGen/

opt-report.c

31 lines

Driver/

opt-report.c

9 lines

Diff 55907

include/clang/Driver/CC1Options.td

Show First 20 Lines • Show All 477 Lines • ▼ Show 20 Lines	def mt_migrate_directory : Separate<["-"], "mt-migrate-directory">,
HelpText<"Directory for temporary files produced during ARC or ObjC migration">;		HelpText<"Directory for temporary files produced during ARC or ObjC migration">;
def arcmt_check : Flag<["-"], "arcmt-check">,		def arcmt_check : Flag<["-"], "arcmt-check">,
HelpText<"Check for ARC migration issues that need manual handling">;		HelpText<"Check for ARC migration issues that need manual handling">;
def arcmt_modify : Flag<["-"], "arcmt-modify">,		def arcmt_modify : Flag<["-"], "arcmt-modify">,
HelpText<"Apply modifications to files to conform to ARC">;		HelpText<"Apply modifications to files to conform to ARC">;
def arcmt_migrate : Flag<["-"], "arcmt-migrate">,		def arcmt_migrate : Flag<["-"], "arcmt-migrate">,
HelpText<"Apply modifications and produces temporary files that conform to ARC">;		HelpText<"Apply modifications and produces temporary files that conform to ARC">;

		def opt_report_file : Separate<["-"], "opt-report-file">,
		HelpText<"File name to use for optimization listing output">;

def print_stats : Flag<["-"], "print-stats">,		def print_stats : Flag<["-"], "print-stats">,
HelpText<"Print performance metrics and statistics">;		HelpText<"Print performance metrics and statistics">;
def fdump_record_layouts : Flag<["-"], "fdump-record-layouts">,		def fdump_record_layouts : Flag<["-"], "fdump-record-layouts">,
HelpText<"Dump record layout information">;		HelpText<"Dump record layout information">;
def fdump_record_layouts_simple : Flag<["-"], "fdump-record-layouts-simple">,		def fdump_record_layouts_simple : Flag<["-"], "fdump-record-layouts-simple">,
HelpText<"Dump record layout information in a simple form used for testing">;		HelpText<"Dump record layout information in a simple form used for testing">;
def fix_what_you_can : Flag<["-"], "fix-what-you-can">,		def fix_what_you_can : Flag<["-"], "fix-what-you-can">,
HelpText<"Apply fix-it advice even in the presence of unfixable errors">;		HelpText<"Apply fix-it advice even in the presence of unfixable errors">;
▲ Show 20 Lines • Show All 242 Lines • Show Last 20 Lines

include/clang/Driver/Options.td

Show First 20 Lines • Show All 1,072 Lines • ▼ Show 20 Lines	def fsyntax_only : Flag<["-"], "fsyntax-only">,
Flags<[DriverOption,CoreOption,CC1Option]>, Group<Action_Group>;		Flags<[DriverOption,CoreOption,CC1Option]>, Group<Action_Group>;
def ftabstop_EQ : Joined<["-"], "ftabstop=">, Group<f_Group>;		def ftabstop_EQ : Joined<["-"], "ftabstop=">, Group<f_Group>;
def ftemplate_depth_EQ : Joined<["-"], "ftemplate-depth=">, Group<f_Group>;		def ftemplate_depth_EQ : Joined<["-"], "ftemplate-depth=">, Group<f_Group>;
def ftemplate_depth_ : Joined<["-"], "ftemplate-depth-">, Group<f_Group>;		def ftemplate_depth_ : Joined<["-"], "ftemplate-depth-">, Group<f_Group>;
def ftemplate_backtrace_limit_EQ : Joined<["-"], "ftemplate-backtrace-limit=">,		def ftemplate_backtrace_limit_EQ : Joined<["-"], "ftemplate-backtrace-limit=">,
Group<f_Group>;		Group<f_Group>;
def foperator_arrow_depth_EQ : Joined<["-"], "foperator-arrow-depth=">,		def foperator_arrow_depth_EQ : Joined<["-"], "foperator-arrow-depth=">,
Group<f_Group>;		Group<f_Group>;
		def foptimization_report : Flag<["-"], "foptimization-report">, Group<f_Group>,
		HelpText<"Generate an optimization report file">;
		def fno_optimization_report : Flag<["-"], "fno-optimization-report">,
		Group<f_Group>, Flags<[NoArgumentUnused]>;
		def foptimization_report_EQ : Joined<["-"], "foptimization-report=">,Group<f_Group>,
		HelpText<"Generate an optimization report file with the specified name">;
def ftest_coverage : Flag<["-"], "ftest-coverage">, Group<f_Group>;		def ftest_coverage : Flag<["-"], "ftest-coverage">, Group<f_Group>;
def fvectorize : Flag<["-"], "fvectorize">, Group<f_Group>,		def fvectorize : Flag<["-"], "fvectorize">, Group<f_Group>,
HelpText<"Enable the loop vectorization passes">;		HelpText<"Enable the loop vectorization passes">;
def fno_vectorize : Flag<["-"], "fno-vectorize">, Group<f_Group>;		def fno_vectorize : Flag<["-"], "fno-vectorize">, Group<f_Group>;
def : Flag<["-"], "ftree-vectorize">, Alias<fvectorize>;		def : Flag<["-"], "ftree-vectorize">, Alias<fvectorize>;
def : Flag<["-"], "fno-tree-vectorize">, Alias<fno_vectorize>;		def : Flag<["-"], "fno-tree-vectorize">, Alias<fno_vectorize>;
def fslp_vectorize : Flag<["-"], "fslp-vectorize">, Group<f_Group>,		def fslp_vectorize : Flag<["-"], "fslp-vectorize">, Group<f_Group>,
HelpText<"Enable the superword-level parallelism vectorization passes">;		HelpText<"Enable the superword-level parallelism vectorization passes">;
▲ Show 20 Lines • Show All 1,101 Lines • Show Last 20 Lines

include/clang/Frontend/CompilerInstance.h

Show First 20 Lines • Show All 292 Lines • ▼ Show 20 Lines	public:

LangOptions &getLangOpts() {		LangOptions &getLangOpts() {
return *Invocation->getLangOpts();		return *Invocation->getLangOpts();
}		}
const LangOptions &getLangOpts() const {		const LangOptions &getLangOpts() const {
return *Invocation->getLangOpts();		return *Invocation->getLangOpts();
}		}

		OptReportInfo &getOptReportInfo() {
		return Invocation->getOptReportInfo();
		}
		const OptReportInfo &getOptReportInfo() const {
		return Invocation->getOptReportInfo();
		}

PreprocessorOptions &getPreprocessorOpts() {		PreprocessorOptions &getPreprocessorOpts() {
return Invocation->getPreprocessorOpts();		return Invocation->getPreprocessorOpts();
}		}
const PreprocessorOptions &getPreprocessorOpts() const {		const PreprocessorOptions &getPreprocessorOpts() const {
return Invocation->getPreprocessorOpts();		return Invocation->getPreprocessorOpts();
}		}

PreprocessorOutputOptions &getPreprocessorOutputOpts() {		PreprocessorOutputOptions &getPreprocessorOutputOpts() {
▲ Show 20 Lines • Show All 479 Lines • Show Last 20 Lines

include/clang/Frontend/CompilerInvocation.h

Show All 13 Lines
#include "clang/Basic/FileSystemOptions.h"		#include "clang/Basic/FileSystemOptions.h"
#include "clang/Basic/LangOptions.h"		#include "clang/Basic/LangOptions.h"
#include "clang/Basic/TargetOptions.h"		#include "clang/Basic/TargetOptions.h"
#include "clang/Frontend/CodeGenOptions.h"		#include "clang/Frontend/CodeGenOptions.h"
#include "clang/Frontend/DependencyOutputOptions.h"		#include "clang/Frontend/DependencyOutputOptions.h"
#include "clang/Frontend/FrontendOptions.h"		#include "clang/Frontend/FrontendOptions.h"
#include "clang/Frontend/LangStandard.h"		#include "clang/Frontend/LangStandard.h"
#include "clang/Frontend/MigratorOptions.h"		#include "clang/Frontend/MigratorOptions.h"
		#include "clang/Frontend/OptReport.h"
#include "clang/Frontend/PreprocessorOutputOptions.h"		#include "clang/Frontend/PreprocessorOutputOptions.h"
#include "clang/Lex/HeaderSearchOptions.h"		#include "clang/Lex/HeaderSearchOptions.h"
#include "clang/Lex/PreprocessorOptions.h"		#include "clang/Lex/PreprocessorOptions.h"
#include "clang/StaticAnalyzer/Core/AnalyzerOptions.h"		#include "clang/StaticAnalyzer/Core/AnalyzerOptions.h"
#include "llvm/ADT/IntrusiveRefCntPtr.h"		#include "llvm/ADT/IntrusiveRefCntPtr.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include <string>		#include <string>
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	class CompilerInvocation : public CompilerInvocationBase {
DependencyOutputOptions DependencyOutputOpts;		DependencyOutputOptions DependencyOutputOpts;

/// Options controlling file system operations.		/// Options controlling file system operations.
FileSystemOptions FileSystemOpts;		FileSystemOptions FileSystemOpts;

/// Options controlling the frontend itself.		/// Options controlling the frontend itself.
FrontendOptions FrontendOpts;		FrontendOptions FrontendOpts;

		/// Optimization-report options and state.
		OptReportInfo OptReport;

/// Options controlling preprocessed output.		/// Options controlling preprocessed output.
PreprocessorOutputOptions PreprocessorOutputOpts;		PreprocessorOutputOptions PreprocessorOutputOpts;

public:		public:
CompilerInvocation() : AnalyzerOpts(new AnalyzerOptions()) {}		CompilerInvocation() : AnalyzerOpts(new AnalyzerOptions()) {}

/// @name Utility Methods		/// @name Utility Methods
/// @{		/// @{
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	const FileSystemOptions &getFileSystemOpts() const {
return FileSystemOpts;		return FileSystemOpts;
}		}

FrontendOptions &getFrontendOpts() { return FrontendOpts; }		FrontendOptions &getFrontendOpts() { return FrontendOpts; }
const FrontendOptions &getFrontendOpts() const {		const FrontendOptions &getFrontendOpts() const {
return FrontendOpts;		return FrontendOpts;
}		}

		OptReportInfo &getOptReportInfo() {
		return OptReport;
		}
		const OptReportInfo &getOptReportInfo() const {
		return OptReport;
		}

PreprocessorOutputOptions &getPreprocessorOutputOpts() {		PreprocessorOutputOptions &getPreprocessorOutputOpts() {
return PreprocessorOutputOpts;		return PreprocessorOutputOpts;
}		}
const PreprocessorOutputOptions &getPreprocessorOutputOpts() const {		const PreprocessorOutputOptions &getPreprocessorOutputOpts() const {
return PreprocessorOutputOpts;		return PreprocessorOutputOpts;
}		}

/// @}		/// @}
Show All 13 Lines

include/clang/Frontend/FrontendActions.h

	Show First 20 Lines • Show All 230 Lines • ▼ Show 20 Lines
	};			};

	class PrintPreprocessedAction : public PreprocessorFrontendAction {			class PrintPreprocessedAction : public PreprocessorFrontendAction {
	protected:			protected:
	void ExecuteAction() override;			void ExecuteAction() override;

	bool hasPCHSupport() const override { return true; }			bool hasPCHSupport() const override { return true; }
	};			};

	} // end namespace clang			} // end namespace clang

	#endif			#endif

include/clang/Frontend/OptReport.h

This file was added.

				//===---- OptReport.h - Clang Optimization-Report Generation ----- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_FRONTEND_OPTREPORT_H_
				#define LLVM_CLANG_FRONTEND_OPTREPORT_H_

				#include "clang/Basic/SourceLocation.h"
				#include "clang/Frontend/FrontendAction.h"
				#include <map>
				#include <string>

				namespace clang {
				// For each location in the source file, the common per-transformation state
				// collected.
				struct OptReportLocationItemInfo {
				bool Analyzed = false;
				bool Transformed = false;

				OptReportLocationItemInfo &operator \|= (
				const OptReportLocationItemInfo &RHS) {
				Analyzed \|= RHS.Analyzed;
				Transformed \|= RHS.Transformed;

				return *this;
				}
				};

				// The per-location information collected for producing an optimization report.
				struct OptReportLocationInfo {
				OptReportLocationItemInfo Inlined;
				OptReportLocationItemInfo Unrolled;
				OptReportLocationItemInfo Vectorized;

				OptReportLocationInfo &operator \|= (const OptReportLocationInfo &RHS) {
				Inlined \|= RHS.Inlined;
				Unrolled \|= RHS.Unrolled;
				Vectorized \|= RHS.Vectorized;

				return *this;
				}
				};

				// The parameters and accumulated state necessary to generate an optimization
				// report.
				struct OptReportInfo {
				std::string FileName;
				std::map<SourceLocation, OptReportLocationInfo> LocationInfo;
				};

				class OptReportAction : public WrapperFrontendAction {
				public:
				OptReportAction(std::unique_ptr<FrontendAction> WrappedAction)
				: WrapperFrontendAction(std::move(WrappedAction)) {}

				protected:
				void EndSourceFileAction() override;
				void GenerateReportFile();
				};
				} // end namespace clang

				#endif

lib/CodeGen/CodeGenAction.cpp

Show All 14 Lines
#include "clang/Basic/FileManager.h"		#include "clang/Basic/FileManager.h"
#include "clang/Basic/SourceManager.h"		#include "clang/Basic/SourceManager.h"
#include "clang/Basic/TargetInfo.h"		#include "clang/Basic/TargetInfo.h"
#include "clang/CodeGen/BackendUtil.h"		#include "clang/CodeGen/BackendUtil.h"
#include "clang/CodeGen/CodeGenAction.h"		#include "clang/CodeGen/CodeGenAction.h"
#include "clang/CodeGen/ModuleBuilder.h"		#include "clang/CodeGen/ModuleBuilder.h"
#include "clang/Frontend/CompilerInstance.h"		#include "clang/Frontend/CompilerInstance.h"
#include "clang/Frontend/FrontendDiagnostic.h"		#include "clang/Frontend/FrontendDiagnostic.h"
		#include "clang/Frontend/OptReport.h"
#include "clang/Lex/Preprocessor.h"		#include "clang/Lex/Preprocessor.h"
#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"
#include "llvm/Bitcode/ReaderWriter.h"		#include "llvm/Bitcode/ReaderWriter.h"
#include "llvm/IR/DebugInfo.h"		#include "llvm/IR/DebugInfo.h"
#include "llvm/IR/DiagnosticInfo.h"		#include "llvm/IR/DiagnosticInfo.h"
#include "llvm/IR/DiagnosticPrinter.h"		#include "llvm/IR/DiagnosticPrinter.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IRReader/IRReader.h"		#include "llvm/IRReader/IRReader.h"
#include "llvm/Linker/Linker.h"		#include "llvm/Linker/Linker.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
		#include "llvm/Support/Format.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/SourceMgr.h"		#include "llvm/Support/SourceMgr.h"
#include "llvm/Support/Timer.h"		#include "llvm/Support/Timer.h"
#include <memory>		#include <memory>
using namespace clang;		using namespace clang;
using namespace llvm;		using namespace llvm;

namespace clang {		namespace clang {
class BackendConsumer : public ASTConsumer {		class BackendConsumer : public ASTConsumer {
virtual void anchor();		virtual void anchor();
DiagnosticsEngine &Diags;		DiagnosticsEngine &Diags;
BackendAction Action;		BackendAction Action;
const CodeGenOptions &CodeGenOpts;		const CodeGenOptions &CodeGenOpts;
const TargetOptions &TargetOpts;		const TargetOptions &TargetOpts;
const LangOptions &LangOpts;		const LangOptions &LangOpts;
		OptReportInfo &OptReport;
raw_pwrite_stream *AsmOutStream;		raw_pwrite_stream *AsmOutStream;
ASTContext *Context;		ASTContext *Context;

Timer LLVMIRGeneration;		Timer LLVMIRGeneration;

std::unique_ptr<CodeGenerator> Gen;		std::unique_ptr<CodeGenerator> Gen;

SmallVector<std::pair<unsigned, std::unique_ptr<llvm::Module>>, 4>		SmallVector<std::pair<unsigned, std::unique_ptr<llvm::Module>>, 4>
LinkModules;		LinkModules;

// This is here so that the diagnostic printer knows the module a diagnostic		// This is here so that the diagnostic printer knows the module a diagnostic
// refers to.		// refers to.
llvm::Module *CurLinkModule = nullptr;		llvm::Module *CurLinkModule = nullptr;

public:		public:
BackendConsumer(		BackendConsumer(
BackendAction Action, DiagnosticsEngine &Diags,		BackendAction Action, DiagnosticsEngine &Diags,
const HeaderSearchOptions &HeaderSearchOpts,		const HeaderSearchOptions &HeaderSearchOpts,
const PreprocessorOptions &PPOpts, const CodeGenOptions &CodeGenOpts,		const PreprocessorOptions &PPOpts, const CodeGenOptions &CodeGenOpts,
const TargetOptions &TargetOpts, const LangOptions &LangOpts,		const TargetOptions &TargetOpts, const LangOptions &LangOpts,
bool TimePasses, const std::string &InFile,		OptReportInfo &OptReport, bool TimePasses, const std::string &InFile,
const SmallVectorImpl<std::pair<unsigned, llvm::Module *>> &LinkModules,		const SmallVectorImpl<std::pair<unsigned, llvm::Module *>> &LinkModules,
raw_pwrite_stream *OS, LLVMContext &C,		raw_pwrite_stream *OS, LLVMContext &C,
CoverageSourceInfo *CoverageInfo = nullptr)		CoverageSourceInfo *CoverageInfo = nullptr)
: Diags(Diags), Action(Action), CodeGenOpts(CodeGenOpts),		: Diags(Diags), Action(Action), CodeGenOpts(CodeGenOpts),
TargetOpts(TargetOpts), LangOpts(LangOpts), AsmOutStream(OS),		TargetOpts(TargetOpts), LangOpts(LangOpts), OptReport(OptReport),
Context(nullptr), LLVMIRGeneration("LLVM IR Generation Time"),		AsmOutStream(OS), Context(nullptr),
		LLVMIRGeneration("LLVM IR Generation Time"),
Gen(CreateLLVMCodeGen(Diags, InFile, HeaderSearchOpts, PPOpts,		Gen(CreateLLVMCodeGen(Diags, InFile, HeaderSearchOpts, PPOpts,
CodeGenOpts, C, CoverageInfo)) {		CodeGenOpts, C, CoverageInfo)) {
llvm::TimePassesIsEnabled = TimePasses;		llvm::TimePassesIsEnabled = TimePasses;
for (auto &I : LinkModules)		for (auto &I : LinkModules)
this->LinkModules.push_back(		this->LinkModules.push_back(
std::make_pair(I.first, std::unique_ptr<llvm::Module>(I.second)));		std::make_pair(I.first, std::unique_ptr<llvm::Module>(I.second)));
}		}
llvm::Module *getModule() const { return Gen->GetModule(); }		llvm::Module *getModule() const { return Gen->GetModule(); }
▲ Show 20 Lines • Show All 165 Lines • ▼ Show 20 Lines	public:
void OptimizationRemarkHandler(		void OptimizationRemarkHandler(
const llvm::DiagnosticInfoOptimizationRemarkAnalysis &D);		const llvm::DiagnosticInfoOptimizationRemarkAnalysis &D);
void OptimizationRemarkHandler(		void OptimizationRemarkHandler(
const llvm::DiagnosticInfoOptimizationRemarkAnalysisFPCommute &D);		const llvm::DiagnosticInfoOptimizationRemarkAnalysisFPCommute &D);
void OptimizationRemarkHandler(		void OptimizationRemarkHandler(
const llvm::DiagnosticInfoOptimizationRemarkAnalysisAliasing &D);		const llvm::DiagnosticInfoOptimizationRemarkAnalysisAliasing &D);
void OptimizationFailureHandler(		void OptimizationFailureHandler(
const llvm::DiagnosticInfoOptimizationFailure &D);		const llvm::DiagnosticInfoOptimizationFailure &D);

		void OptimizationRemarkOptReportHandler(
		const llvm::DiagnosticInfoOptimizationBase &D, bool Transformed = false);
};		};

void BackendConsumer::anchor() {}		void BackendConsumer::anchor() {}
}		}

/// ConvertBackendLocation - Convert a location in a temporary llvm::SourceMgr		/// ConvertBackendLocation - Convert a location in a temporary llvm::SourceMgr
/// buffer to be a valid FullSourceLoc.		/// buffer to be a valid FullSourceLoc.
static FullSourceLoc ConvertBackendLocation(const llvm::SMDiagnostic &D,		static FullSourceLoc ConvertBackendLocation(const llvm::SMDiagnostic &D,
▲ Show 20 Lines • Show All 243 Lines • ▼ Show 20 Lines

void BackendConsumer::OptimizationRemarkHandler(		void BackendConsumer::OptimizationRemarkHandler(
const llvm::DiagnosticInfoOptimizationRemark &D) {		const llvm::DiagnosticInfoOptimizationRemark &D) {
// Optimization remarks are active only if the -Rpass flag has a regular		// Optimization remarks are active only if the -Rpass flag has a regular
// expression that matches the name of the pass name in \p D.		// expression that matches the name of the pass name in \p D.
if (CodeGenOpts.OptimizationRemarkPattern &&		if (CodeGenOpts.OptimizationRemarkPattern &&
CodeGenOpts.OptimizationRemarkPattern->match(D.getPassName()))		CodeGenOpts.OptimizationRemarkPattern->match(D.getPassName()))
EmitOptimizationMessage(D, diag::remark_fe_backend_optimization_remark);		EmitOptimizationMessage(D, diag::remark_fe_backend_optimization_remark);

		// Record optimization decisions for the listing file.
		OptimizationRemarkOptReportHandler(D, true);
}		}

void BackendConsumer::OptimizationRemarkHandler(		void BackendConsumer::OptimizationRemarkHandler(
const llvm::DiagnosticInfoOptimizationRemarkMissed &D) {		const llvm::DiagnosticInfoOptimizationRemarkMissed &D) {
// Missed optimization remarks are active only if the -Rpass-missed		// Missed optimization remarks are active only if the -Rpass-missed
// flag has a regular expression that matches the name of the pass		// flag has a regular expression that matches the name of the pass
// name in \p D.		// name in \p D.
if (CodeGenOpts.OptimizationRemarkMissedPattern &&		if (CodeGenOpts.OptimizationRemarkMissedPattern &&
CodeGenOpts.OptimizationRemarkMissedPattern->match(D.getPassName()))		CodeGenOpts.OptimizationRemarkMissedPattern->match(D.getPassName()))
EmitOptimizationMessage(D,		EmitOptimizationMessage(D,
diag::remark_fe_backend_optimization_remark_missed);		diag::remark_fe_backend_optimization_remark_missed);

		// Record optimization decisions for the listing file.
		OptimizationRemarkOptReportHandler(D);
}		}

void BackendConsumer::OptimizationRemarkHandler(		void BackendConsumer::OptimizationRemarkHandler(
const llvm::DiagnosticInfoOptimizationRemarkAnalysis &D) {		const llvm::DiagnosticInfoOptimizationRemarkAnalysis &D) {
// Optimization analysis remarks are active if the pass name is set to		// Optimization analysis remarks are active if the pass name is set to
// llvm::DiagnosticInfo::AlwasyPrint or if the -Rpass-analysis flag has a		// llvm::DiagnosticInfo::AlwasyPrint or if the -Rpass-analysis flag has a
// regular expression that matches the name of the pass name in \p D.		// regular expression that matches the name of the pass name in \p D.

if (D.getPassName() == llvm::DiagnosticInfo::AlwaysPrint \|\|		if (D.getPassName() == llvm::DiagnosticInfo::AlwaysPrint \|\|
(CodeGenOpts.OptimizationRemarkAnalysisPattern &&		(CodeGenOpts.OptimizationRemarkAnalysisPattern &&
CodeGenOpts.OptimizationRemarkAnalysisPattern->match(D.getPassName())))		CodeGenOpts.OptimizationRemarkAnalysisPattern->match(D.getPassName())))
EmitOptimizationMessage(		EmitOptimizationMessage(
D, diag::remark_fe_backend_optimization_remark_analysis);		D, diag::remark_fe_backend_optimization_remark_analysis);

		// Record optimization decisions for the listing file.
		OptimizationRemarkOptReportHandler(D);
}		}

void BackendConsumer::OptimizationRemarkHandler(		void BackendConsumer::OptimizationRemarkHandler(
const llvm::DiagnosticInfoOptimizationRemarkAnalysisFPCommute &D) {		const llvm::DiagnosticInfoOptimizationRemarkAnalysisFPCommute &D) {
// Optimization analysis remarks are active if the pass name is set to		// Optimization analysis remarks are active if the pass name is set to
// llvm::DiagnosticInfo::AlwasyPrint or if the -Rpass-analysis flag has a		// llvm::DiagnosticInfo::AlwasyPrint or if the -Rpass-analysis flag has a
// regular expression that matches the name of the pass name in \p D.		// regular expression that matches the name of the pass name in \p D.

if (D.getPassName() == llvm::DiagnosticInfo::AlwaysPrint \|\|		if (D.getPassName() == llvm::DiagnosticInfo::AlwaysPrint \|\|
(CodeGenOpts.OptimizationRemarkAnalysisPattern &&		(CodeGenOpts.OptimizationRemarkAnalysisPattern &&
CodeGenOpts.OptimizationRemarkAnalysisPattern->match(D.getPassName())))		CodeGenOpts.OptimizationRemarkAnalysisPattern->match(D.getPassName())))
EmitOptimizationMessage(		EmitOptimizationMessage(
D, diag::remark_fe_backend_optimization_remark_analysis_fpcommute);		D, diag::remark_fe_backend_optimization_remark_analysis_fpcommute);

		// Record optimization decisions for the listing file.
		OptimizationRemarkOptReportHandler(D);
}		}

void BackendConsumer::OptimizationRemarkHandler(		void BackendConsumer::OptimizationRemarkHandler(
const llvm::DiagnosticInfoOptimizationRemarkAnalysisAliasing &D) {		const llvm::DiagnosticInfoOptimizationRemarkAnalysisAliasing &D) {
// Optimization analysis remarks are active if the pass name is set to		// Optimization analysis remarks are active if the pass name is set to
// llvm::DiagnosticInfo::AlwasyPrint or if the -Rpass-analysis flag has a		// llvm::DiagnosticInfo::AlwasyPrint or if the -Rpass-analysis flag has a
// regular expression that matches the name of the pass name in \p D.		// regular expression that matches the name of the pass name in \p D.

if (D.getPassName() == llvm::DiagnosticInfo::AlwaysPrint \|\|		if (D.getPassName() == llvm::DiagnosticInfo::AlwaysPrint \|\|
(CodeGenOpts.OptimizationRemarkAnalysisPattern &&		(CodeGenOpts.OptimizationRemarkAnalysisPattern &&
CodeGenOpts.OptimizationRemarkAnalysisPattern->match(D.getPassName())))		CodeGenOpts.OptimizationRemarkAnalysisPattern->match(D.getPassName())))
EmitOptimizationMessage(		EmitOptimizationMessage(
D, diag::remark_fe_backend_optimization_remark_analysis_aliasing);		D, diag::remark_fe_backend_optimization_remark_analysis_aliasing);

		// Record optimization decisions for the listing file.
		OptimizationRemarkOptReportHandler(D);
}		}

void BackendConsumer::OptimizationFailureHandler(		void BackendConsumer::OptimizationFailureHandler(
const llvm::DiagnosticInfoOptimizationFailure &D) {		const llvm::DiagnosticInfoOptimizationFailure &D) {
EmitOptimizationMessage(D, diag::warn_fe_backend_optimization_failure);		EmitOptimizationMessage(D, diag::warn_fe_backend_optimization_failure);
}		}

		void BackendConsumer::OptimizationRemarkOptReportHandler(
		const llvm::DiagnosticInfoOptimizationBase &D, bool Transformed) {
		if (OptReport.FileName.empty() \|\| !D.isLocationAvailable())
		return;

		SourceManager &SourceMgr = Context->getSourceManager();
		FileManager &FileMgr = SourceMgr.getFileManager();

		StringRef Filename;
		unsigned Line, Column;
		D.getLocation(&Filename, &Line, &Column);
		const FileEntry *FE = FileMgr.getFile(Filename);
		if (!FE \|\| !Line)
		return;

		// If -gcolumn-info was not used, Column will be 0. This upsets the
		// source manager, so pass 1 if Column is not set.
		SourceLocation DILoc =
		SourceMgr.translateFileLineCol(FE, Line, Column ? Column : 1);
		if (DILoc.isInvalid())
		return;

		// We track information on both actual and potential transformations. This
		// way, if there are multiple possible things on a line that are, or could
		// have been transformed, we can indicate that explicitly in the output.
		auto UpdateLLII = [Transformed](OptReportLocationItemInfo &LLII) {
		LLII.Analyzed = true;
		if (Transformed)
		LLII.Transformed = true;
		};

		// FIXME: The backend should use proper diagnostic subclasses here,
		// and we should match those instead of looking at the pass name.
		StringRef PassName = D.getPassName();
		if (PassName == "inline")
		UpdateLLII(OptReport.LocationInfo[DILoc].Inlined);
		else if (PassName == "loop-unroll")
		UpdateLLII(OptReport.LocationInfo[DILoc].Unrolled);
		else if (PassName == "loop-vectorize")
		UpdateLLII(OptReport.LocationInfo[DILoc].Vectorized);
		}

/// \brief This function is invoked when the backend needs		/// \brief This function is invoked when the backend needs
/// to report something to the user.		/// to report something to the user.
void BackendConsumer::DiagnosticHandlerImpl(const DiagnosticInfo &DI) {		void BackendConsumer::DiagnosticHandlerImpl(const DiagnosticInfo &DI) {
unsigned DiagID = diag::err_fe_inline_asm;		unsigned DiagID = diag::err_fe_inline_asm;
llvm::DiagnosticSeverity Severity = DI.getSeverity();		llvm::DiagnosticSeverity Severity = DI.getSeverity();
// Get the diagnostic ID based.		// Get the diagnostic ID based.
switch (DI.getKind()) {		switch (DI.getKind()) {
case llvm::DK_InlineAsm:		case llvm::DK_InlineAsm:
if (InlineAsmDiagHandler(cast<DiagnosticInfoInlineAsm>(DI)))		if (InlineAsmDiagHandler(cast<DiagnosticInfoInlineAsm>(DI)))
return;		return;
ComputeDiagID(Severity, inline_asm, DiagID);		ComputeDiagID(Severity, inline_asm, DiagID);
break;		break;
case llvm::DK_StackSize:		case llvm::DK_StackSize:
if (StackSizeDiagHandler(cast<DiagnosticInfoStackSize>(DI)))		if (StackSizeDiagHandler(cast<DiagnosticInfoStackSize>(DI)))
return;		return;
ComputeDiagID(Severity, backend_frame_larger_than, DiagID);		ComputeDiagID(Severity, backend_frame_larger_than, DiagID);
break;		break;
case DK_Linker:		case DK_Linker:
assert(CurLinkModule);		assert(CurLinkModule);
// FIXME: stop eating the warnings and notes.		// FIXME: stop eating the warnings and notes.
if (Severity != DS_Error)		if (Severity != DS_Error)
return;		return;
DiagID = diag::err_fe_cannot_link_module;		DiagID = diag::err_fe_cannot_link_module;
break;		break;
case llvm::DK_OptimizationRemark:		case llvm::DK_OptimizationRemark:
// Optimization remarks are always handled completely by this		// Optimization remarks are always handled completely by this
// handler. There is no generic way of emitting them.		// handler. There is no generic way of emitting them.
OptimizationRemarkHandler(cast<DiagnosticInfoOptimizationRemark>(DI));		OptimizationRemarkHandler(cast<DiagnosticInfoOptimizationRemark>(DI));
return;		return;
case llvm::DK_OptimizationRemarkMissed:		case llvm::DK_OptimizationRemarkMissed:
// Optimization remarks are always handled completely by this		// Optimization remarks are always handled completely by this
// handler. There is no generic way of emitting them.		// handler. There is no generic way of emitting them.
OptimizationRemarkHandler(cast<DiagnosticInfoOptimizationRemarkMissed>(DI));		OptimizationRemarkHandler(cast<DiagnosticInfoOptimizationRemarkMissed>(DI));
return;		return;
case llvm::DK_OptimizationRemarkAnalysis:		case llvm::DK_OptimizationRemarkAnalysis:
// Optimization remarks are always handled completely by this		// Optimization remarks are always handled completely by this
// handler. There is no generic way of emitting them.		// handler. There is no generic way of emitting them.
OptimizationRemarkHandler(		OptimizationRemarkHandler(
cast<DiagnosticInfoOptimizationRemarkAnalysis>(DI));		cast<DiagnosticInfoOptimizationRemarkAnalysis>(DI));
return;		return;
case llvm::DK_OptimizationRemarkAnalysisFPCommute:		case llvm::DK_OptimizationRemarkAnalysisFPCommute:
// Optimization remarks are always handled completely by this		// Optimization remarks are always handled completely by this
// handler. There is no generic way of emitting them.		// handler. There is no generic way of emitting them.
OptimizationRemarkHandler(		OptimizationRemarkHandler(
cast<DiagnosticInfoOptimizationRemarkAnalysisFPCommute>(DI));		cast<DiagnosticInfoOptimizationRemarkAnalysisFPCommute>(DI));
return;		return;
case llvm::DK_OptimizationRemarkAnalysisAliasing:		case llvm::DK_OptimizationRemarkAnalysisAliasing:
// Optimization remarks are always handled completely by this		// Optimization remarks are always handled completely by this
// handler. There is no generic way of emitting them.		// handler. There is no generic way of emitting them.
OptimizationRemarkHandler(		OptimizationRemarkHandler(
cast<DiagnosticInfoOptimizationRemarkAnalysisAliasing>(DI));		cast<DiagnosticInfoOptimizationRemarkAnalysisAliasing>(DI));
return;		return;
case llvm::DK_OptimizationFailure:		case llvm::DK_OptimizationFailure:
// Optimization failures are always handled completely by this		// Optimization failures are always handled completely by this
// handler.		// handler.
OptimizationFailureHandler(cast<DiagnosticInfoOptimizationFailure>(DI));		OptimizationFailureHandler(cast<DiagnosticInfoOptimizationFailure>(DI));
return;		return;
case llvm::DK_Unsupported:		case llvm::DK_Unsupported:
UnsupportedDiagHandler(cast<DiagnosticInfoUnsupported>(DI));		UnsupportedDiagHandler(cast<DiagnosticInfoUnsupported>(DI));
return;		return;
default:		default:
// Plugin IDs are not bound to any value as they are set dynamically.		// Plugin IDs are not bound to any value as they are set dynamically.
ComputeDiagRemarkID(Severity, backend_plugin, DiagID);		ComputeDiagRemarkID(Severity, backend_plugin, DiagID);
break;		break;
}		}
std::string MsgStorage;		std::string MsgStorage;
{		{
raw_string_ostream Stream(MsgStorage);		raw_string_ostream Stream(MsgStorage);
DiagnosticPrinterRawOStream DP(Stream);		DiagnosticPrinterRawOStream DP(Stream);
DI.print(DP);		DI.print(DP);
}		}

if (DiagID == diag::err_fe_cannot_link_module) {		if (DiagID == diag::err_fe_cannot_link_module) {
		anemetUnsubmitted Not Done Reply Inline Actions Should the abbreviation be somehow part of the optimization remark API and passed in just like the pass name? It would be nice if someone added optimization remark for a new opt, it would show up here automatically. I could see how that could make the output too busy but at least have the option? anemet: Should the abbreviation be somehow part of the optimization remark API and passed in just like…
		hfinkelAuthorUnsubmitted Not Done Reply Inline Actions So long as we're careful in the backend to respect the limited visual real estate and namespace in this kind of report, we could have the optimizations themselves provide a letter. I'm undecided. hfinkel: So long as we're careful in the backend to respect the limited visual real estate and namespace…
Diags.Report(diag::err_fe_cannot_link_module)		Diags.Report(diag::err_fe_cannot_link_module)
<< CurLinkModule->getModuleIdentifier() << MsgStorage;		<< CurLinkModule->getModuleIdentifier() << MsgStorage;
return;		return;
}		}

// Report the backend message using the usual diagnostic mechanism.		// Report the backend message using the usual diagnostic mechanism.
FullSourceLoc Loc;		FullSourceLoc Loc;
Diags.Report(Loc, DiagID).AddString(MsgStorage);		Diags.Report(Loc, DiagID).AddString(MsgStorage);
}		}
#undef ComputeDiagID		#undef ComputeDiagID

CodeGenAction::CodeGenAction(unsigned _Act, LLVMContext *_VMContext)		CodeGenAction::CodeGenAction(unsigned _Act, LLVMContext *_VMContext)
: Act(_Act), VMContext(_VMContext ? _VMContext : new LLVMContext),		: Act(_Act), VMContext(_VMContext ? _VMContext : new LLVMContext),
OwnsVMContext(!_VMContext) {}		OwnsVMContext(!_VMContext) {}

CodeGenAction::~CodeGenAction() {		CodeGenAction::~CodeGenAction() {
TheModule.reset();		TheModule.reset();
if (OwnsVMContext)		if (OwnsVMContext)
delete VMContext;		delete VMContext;
}		}

bool CodeGenAction::hasIRSupport() const { return true; }		bool CodeGenAction::hasIRSupport() const { return true; }

void CodeGenAction::EndSourceFileAction() {		void CodeGenAction::EndSourceFileAction() {
		rsmithUnsubmitted Not Done Reply Inline Actions I'd like this to be factored out and moved somewhere more appropriate (such as Frontend). It seems appropriate for CodeGen to generate the data structure here, but it should not be deciding how to format the report nor doing file IO to put it somewhere. I would hope that we can combine this report information with the static analyzer's existing support for generating syntax-highlighted, annotated source code as HTML as a future extension. rsmith: I'd like this to be factored out and moved somewhere more appropriate (such as Frontend). It…
		hfinkelAuthorUnsubmitted Not Done Reply Inline Actions I'd like this to be factored out and moved somewhere more appropriate (such as Frontend). It seems appropriate for CodeGen to generate the data structure here, but it should not be deciding how to format the report nor doing file IO to put it somewhere. Makes sense. I would hope that we can combine this report information with the static analyzer's existing support for generating syntax-highlighted, annotated source code as HTML as a future extension. I like this idea. hfinkel: > I'd like this to be factored out and moved somewhere more appropriate (such as Frontend). It…
// If the consumer creation failed, do nothing.		// If the consumer creation failed, do nothing.
if (!getCompilerInstance().hasASTConsumer())		if (!getCompilerInstance().hasASTConsumer())
return;		return;

// Take back ownership of link modules we passed to consumer.		// Take back ownership of link modules we passed to consumer.
if (!LinkModules.empty())		if (!LinkModules.empty())
BEConsumer->releaseLinkModules();		BEConsumer->releaseLinkModules();

▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	if (CI.getCodeGenOpts().CoverageMapping) {
CoverageInfo = new CoverageSourceInfo;		CoverageInfo = new CoverageSourceInfo;
CI.getPreprocessor().addPPCallbacks(		CI.getPreprocessor().addPPCallbacks(
std::unique_ptr<PPCallbacks>(CoverageInfo));		std::unique_ptr<PPCallbacks>(CoverageInfo));
}		}

std::unique_ptr<BackendConsumer> Result(new BackendConsumer(		std::unique_ptr<BackendConsumer> Result(new BackendConsumer(
BA, CI.getDiagnostics(), CI.getHeaderSearchOpts(),		BA, CI.getDiagnostics(), CI.getHeaderSearchOpts(),
CI.getPreprocessorOpts(), CI.getCodeGenOpts(), CI.getTargetOpts(),		CI.getPreprocessorOpts(), CI.getCodeGenOpts(), CI.getTargetOpts(),
CI.getLangOpts(), CI.getFrontendOpts().ShowTimers, InFile, LinkModules,		CI.getLangOpts(), CI.getOptReportInfo(), CI.getFrontendOpts().ShowTimers,
OS, *VMContext, CoverageInfo));		InFile, LinkModules, OS, *VMContext, CoverageInfo));
BEConsumer = Result.get();		BEConsumer = Result.get();
return std::move(Result);		return std::move(Result);
}		}

static void BitcodeInlineAsmDiagHandler(const llvm::SMDiagnostic &SM,		static void BitcodeInlineAsmDiagHandler(const llvm::SMDiagnostic &SM,
void *Context,		void *Context,
unsigned LocCookie) {		unsigned LocCookie) {
SM.print(nullptr, llvm::errs());		SM.print(nullptr, llvm::errs());
▲ Show 20 Lines • Show All 109 Lines • Show Last 20 Lines

lib/Driver/Tools.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 3,135 Lines • ▼ Show 20 Lines
	static void addDebugCompDirArg(const ArgList &Args, ArgStringList &CmdArgs) {			static void addDebugCompDirArg(const ArgList &Args, ArgStringList &CmdArgs) {
	SmallString<128> cwd;			SmallString<128> cwd;
	if (!llvm::sys::fs::current_path(cwd)) {			if (!llvm::sys::fs::current_path(cwd)) {
	CmdArgs.push_back("-fdebug-compilation-dir");			CmdArgs.push_back("-fdebug-compilation-dir");
	CmdArgs.push_back(Args.MakeArgString(cwd));			CmdArgs.push_back(Args.MakeArgString(cwd));
	}			}
	}			}

	static const char *SplitDebugName(const ArgList &Args, const InputInfo &Input) {			static const char *getAltExtOutputName(const ArgList &Args,
				const InputInfo &Input,
				const char *Ext) {
	Arg *FinalOutput = Args.getLastArg(options::OPT_o);			Arg *FinalOutput = Args.getLastArg(options::OPT_o);
	if (FinalOutput && Args.hasArg(options::OPT_c)) {			if (FinalOutput && Args.hasArg(options::OPT_c)) {
	SmallString<128> T(FinalOutput->getValue());			SmallString<128> T(FinalOutput->getValue());
	llvm::sys::path::replace_extension(T, "dwo");			llvm::sys::path::replace_extension(T, Ext);
	return Args.MakeArgString(T);			return Args.MakeArgString(T);
	} else {			} else {
	// Use the compilation dir.			// Use the compilation dir.
	SmallString<128> T(			SmallString<128> T(
	Args.getLastArgValue(options::OPT_fdebug_compilation_dir));			Args.getLastArgValue(options::OPT_fdebug_compilation_dir));
	SmallString<128> F(llvm::sys::path::stem(Input.getBaseInput()));			SmallString<128> F(llvm::sys::path::stem(Input.getBaseInput()));
	llvm::sys::path::replace_extension(F, "dwo");			llvm::sys::path::replace_extension(F, Ext);
	T += F;			T += F;
	return Args.MakeArgString(F);			return Args.MakeArgString(F);
	}			}
	}			}

				static const char *SplitDebugName(const ArgList &Args, const InputInfo &Input) {
				return getAltExtOutputName(Args, Input, "dwo");
				}

	static void SplitDebugInfo(const ToolChain &TC, Compilation &C, const Tool &T,			static void SplitDebugInfo(const ToolChain &TC, Compilation &C, const Tool &T,
	const JobAction &JA, const ArgList &Args,			const JobAction &JA, const ArgList &Args,
	const InputInfo &Output, const char *OutFile) {			const InputInfo &Output, const char *OutFile) {
	ArgStringList ExtractArgs;			ArgStringList ExtractArgs;
	ExtractArgs.push_back("--extract-dwo");			ExtractArgs.push_back("--extract-dwo");

	ArgStringList StripArgs;			ArgStringList StripArgs;
	StripArgs.push_back("--strip-dwo");			StripArgs.push_back("--strip-dwo");

	// Grabbing the output of the earlier compile step.			// Grabbing the output of the earlier compile step.
	StripArgs.push_back(Output.getFilename());			StripArgs.push_back(Output.getFilename());
	ExtractArgs.push_back(Output.getFilename());			ExtractArgs.push_back(Output.getFilename());
	ExtractArgs.push_back(OutFile);			ExtractArgs.push_back(OutFile);

	const char *Exec = Args.MakeArgString(TC.GetProgramPath("objcopy"));			const char *Exec = Args.MakeArgString(TC.GetProgramPath("objcopy"));
	InputInfo II(types::TY_Object, Output.getFilename(), Output.getFilename());			InputInfo II(types::TY_Object, Output.getFilename(), Output.getFilename());

	// First extract the dwo sections.			// First extract the dwo sections.
	C.addCommand(llvm::make_unique<Command>(JA, T, Exec, ExtractArgs, II));			C.addCommand(llvm::make_unique<Command>(JA, T, Exec, ExtractArgs, II));

	// Then remove them from the original .o file.			// Then remove them from the original .o file.
	C.addCommand(llvm::make_unique<Command>(JA, T, Exec, StripArgs, II));			C.addCommand(llvm::make_unique<Command>(JA, T, Exec, StripArgs, II));
	}			}

				static const char *getOptReportName(const ArgList &Args, const InputInfo &Input) {
				return getAltExtOutputName(Args, Input, "lst");
				}

	/// \brief Vectorize at all optimization levels greater than 1 except for -Oz.			/// \brief Vectorize at all optimization levels greater than 1 except for -Oz.
	/// For -Oz the loop vectorizer is disable, while the slp vectorizer is enabled.			/// For -Oz the loop vectorizer is disable, while the slp vectorizer is enabled.
	static bool shouldEnableVectorizerAtOLevel(const ArgList &Args, bool isSlpVec) {			static bool shouldEnableVectorizerAtOLevel(const ArgList &Args, bool isSlpVec) {
	if (Arg *A = Args.getLastArg(options::OPT_O_Group)) {			if (Arg *A = Args.getLastArg(options::OPT_O_Group)) {
	if (A->getOption().matches(options::OPT_O4) \|\|			if (A->getOption().matches(options::OPT_O4) \|\|
	A->getOption().matches(options::OPT_Ofast))			A->getOption().matches(options::OPT_Ofast))
	return true;			return true;

	▲ Show 20 Lines • Show All 2,452 Lines • ▼ Show 20 Lines

	// le32-specific flags:			// le32-specific flags:
	// -fno-math-builtin: clang should not convert math builtins to intrinsics			// -fno-math-builtin: clang should not convert math builtins to intrinsics
	// by default.			// by default.
	if (getToolChain().getArch() == llvm::Triple::le32) {			if (getToolChain().getArch() == llvm::Triple::le32) {
	CmdArgs.push_back("-fno-math-builtin");			CmdArgs.push_back("-fno-math-builtin");
	}			}

				if (Args.hasFlag(options::OPT_foptimization_report,
				options::OPT_foptimization_report_EQ,
				options::OPT_fno_optimization_report, false)) {
				CmdArgs.push_back("-opt-report-file");

				const Arg *A = Args.getLastArg(options::OPT_foptimization_report_EQ);
				if (A)
				CmdArgs.push_back(A->getValue());
				else
				CmdArgs.push_back(getOptReportName(Args, Input));
				}

	// Default to -fno-builtin-str{cat,cpy} on Darwin for ARM.			// Default to -fno-builtin-str{cat,cpy} on Darwin for ARM.
	//			//
	// FIXME: Now that PR4941 has been fixed this can be enabled.			// FIXME: Now that PR4941 has been fixed this can be enabled.
	#if 0			#if 0
	if (getToolChain().getTriple().isOSDarwin() &&			if (getToolChain().getTriple().isOSDarwin() &&
	(getToolChain().getArch() == llvm::Triple::arm \|\|			(getToolChain().getArch() == llvm::Triple::arm \|\|
	getToolChain().getArch() == llvm::Triple::thumb)) {			getToolChain().getArch() == llvm::Triple::thumb)) {
	if (!Args.hasArg(options::OPT_fbuiltin_strcat))			if (!Args.hasArg(options::OPT_fbuiltin_strcat))
	▲ Show 20 Lines • Show All 5,400 Lines • Show Last 20 Lines

lib/Frontend/CMakeLists.txt

Show All 26 Lines	add_clang_library(clangFrontend
HeaderIncludeGen.cpp		HeaderIncludeGen.cpp
InitHeaderSearch.cpp		InitHeaderSearch.cpp
InitPreprocessor.cpp		InitPreprocessor.cpp
LangStandards.cpp		LangStandards.cpp
LayoutOverrideSource.cpp		LayoutOverrideSource.cpp
LogDiagnosticPrinter.cpp		LogDiagnosticPrinter.cpp
ModuleDependencyCollector.cpp		ModuleDependencyCollector.cpp
MultiplexConsumer.cpp		MultiplexConsumer.cpp
		OptReport.cpp
PCHContainerOperations.cpp		PCHContainerOperations.cpp
PrintPreprocessedOutput.cpp		PrintPreprocessedOutput.cpp
SerializedDiagnosticPrinter.cpp		SerializedDiagnosticPrinter.cpp
SerializedDiagnosticReader.cpp		SerializedDiagnosticReader.cpp
TestModuleFileExtension.cpp		TestModuleFileExtension.cpp
TextDiagnostic.cpp		TextDiagnostic.cpp
TextDiagnosticBuffer.cpp		TextDiagnosticBuffer.cpp
TextDiagnosticPrinter.cpp		TextDiagnosticPrinter.cpp
Show All 16 Lines

lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 409 Lines • ▼ Show 20 Lines	static void setPGOUseInstrumentor(CodeGenOptions &Opts,
std::unique_ptr<llvm::IndexedInstrProfReader> PGOReader =		std::unique_ptr<llvm::IndexedInstrProfReader> PGOReader =
std::move(ReaderOrErr.get());		std::move(ReaderOrErr.get());
if (PGOReader->isIRLevelProfile())		if (PGOReader->isIRLevelProfile())
Opts.setProfileUse(CodeGenOptions::ProfileIRInstr);		Opts.setProfileUse(CodeGenOptions::ProfileIRInstr);
else		else
Opts.setProfileUse(CodeGenOptions::ProfileClangInstr);		Opts.setProfileUse(CodeGenOptions::ProfileClangInstr);
}		}

static bool ParseCodeGenArgs(CodeGenOptions &Opts, ArgList &Args, InputKind IK,		static bool ParseCodeGenArgs(CodeGenOptions &Opts, OptReportInfo &OptReport,
		ArgList &Args, InputKind IK,
DiagnosticsEngine &Diags,		DiagnosticsEngine &Diags,
const TargetOptions &TargetOpts) {		const TargetOptions &TargetOpts) {
using namespace options;		using namespace options;
bool Success = true;		bool Success = true;
llvm::Triple Triple = llvm::Triple(TargetOpts.Triple);		llvm::Triple Triple = llvm::Triple(TargetOpts.Triple);

unsigned OptimizationLevel = getOptimizationLevel(Args, IK, Diags);		unsigned OptimizationLevel = getOptimizationLevel(Args, IK, Diags);
// TODO: This could be done in Driver		// TODO: This could be done in Driver
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	static bool ParseCodeGenArgs(CodeGenOptions &Opts, OptReportInfo &OptReport,
Opts.EmitCodeView = Args.hasArg(OPT_gcodeview);		Opts.EmitCodeView = Args.hasArg(OPT_gcodeview);
Opts.WholeProgramVTables = Args.hasArg(OPT_fwhole_program_vtables);		Opts.WholeProgramVTables = Args.hasArg(OPT_fwhole_program_vtables);
Opts.WholeProgramVTablesBlacklistFiles =		Opts.WholeProgramVTablesBlacklistFiles =
Args.getAllArgValues(OPT_fwhole_program_vtables_blacklist_EQ);		Args.getAllArgValues(OPT_fwhole_program_vtables_blacklist_EQ);
Opts.SplitDwarfFile = Args.getLastArgValue(OPT_split_dwarf_file);		Opts.SplitDwarfFile = Args.getLastArgValue(OPT_split_dwarf_file);
Opts.DebugTypeExtRefs = Args.hasArg(OPT_dwarf_ext_refs);		Opts.DebugTypeExtRefs = Args.hasArg(OPT_dwarf_ext_refs);
Opts.DebugExplicitImport = Triple.isPS4CPU();		Opts.DebugExplicitImport = Triple.isPS4CPU();

		OptReport.FileName = Args.getLastArgValue(OPT_opt_report_file);

for (const auto &Arg : Args.getAllArgValues(OPT_fdebug_prefix_map_EQ))		for (const auto &Arg : Args.getAllArgValues(OPT_fdebug_prefix_map_EQ))
Opts.DebugPrefixMap.insert(StringRef(Arg).split('='));		Opts.DebugPrefixMap.insert(StringRef(Arg).split('='));

if (const Arg *A =		if (const Arg *A =
Args.getLastArg(OPT_emit_llvm_uselists, OPT_no_emit_llvm_uselists))		Args.getLastArg(OPT_emit_llvm_uselists, OPT_no_emit_llvm_uselists))
Opts.EmitLLVMUseLists = A->getOption().getID() == OPT_emit_llvm_uselists;		Opts.EmitLLVMUseLists = A->getOption().getID() == OPT_emit_llvm_uselists;

Opts.DisableLLVMOpts = Args.hasArg(OPT_disable_llvm_optzns);		Opts.DisableLLVMOpts = Args.hasArg(OPT_disable_llvm_optzns);
▲ Show 20 Lines • Show All 260 Lines • ▼ Show 20 Lines	static bool ParseCodeGenArgs(CodeGenOptions &Opts, OptReportInfo &OptReport,
}		}

// If the user requested to use a sample profile for PGO, then the		// If the user requested to use a sample profile for PGO, then the
// backend will need to track source location information so the profile		// backend will need to track source location information so the profile
// can be incorporated into the IR.		// can be incorporated into the IR.
if (!Opts.SampleProfileFile.empty())		if (!Opts.SampleProfileFile.empty())
NeedLocTracking = true;		NeedLocTracking = true;

		// To generate an optimization report, source location information is needed.
		if (!OptReport.FileName.empty())
		NeedLocTracking = true;

// If the user requested a flag that requires source locations available in		// If the user requested a flag that requires source locations available in
// the backend, make sure that the backend tracks source location information.		// the backend, make sure that the backend tracks source location information.
if (NeedLocTracking && Opts.getDebugInfo() == codegenoptions::NoDebugInfo)		if (NeedLocTracking && Opts.getDebugInfo() == codegenoptions::NoDebugInfo)
Opts.setDebugInfo(codegenoptions::LocTrackingOnly);		Opts.setDebugInfo(codegenoptions::LocTrackingOnly);

Opts.RewriteMapFiles = Args.getAllArgValues(OPT_frewrite_map_file);		Opts.RewriteMapFiles = Args.getAllArgValues(OPT_frewrite_map_file);

// Parse -fsanitize-recover= arguments.		// Parse -fsanitize-recover= arguments.
▲ Show 20 Lines • Show All 1,350 Lines • ▼ Show 20 Lines	bool CompilerInvocation::CreateFromArgs(CompilerInvocation &Res,
Success &= ParseMigratorArgs(Res.getMigratorOpts(), Args);		Success &= ParseMigratorArgs(Res.getMigratorOpts(), Args);
ParseDependencyOutputArgs(Res.getDependencyOutputOpts(), Args);		ParseDependencyOutputArgs(Res.getDependencyOutputOpts(), Args);
Success &= ParseDiagnosticArgs(Res.getDiagnosticOpts(), Args, &Diags);		Success &= ParseDiagnosticArgs(Res.getDiagnosticOpts(), Args, &Diags);
ParseCommentArgs(LangOpts.CommentOpts, Args);		ParseCommentArgs(LangOpts.CommentOpts, Args);
ParseFileSystemArgs(Res.getFileSystemOpts(), Args);		ParseFileSystemArgs(Res.getFileSystemOpts(), Args);
// FIXME: We shouldn't have to pass the DashX option around here		// FIXME: We shouldn't have to pass the DashX option around here
InputKind DashX = ParseFrontendArgs(Res.getFrontendOpts(), Args, Diags);		InputKind DashX = ParseFrontendArgs(Res.getFrontendOpts(), Args, Diags);
ParseTargetArgs(Res.getTargetOpts(), Args, Diags);		ParseTargetArgs(Res.getTargetOpts(), Args, Diags);
Success &= ParseCodeGenArgs(Res.getCodeGenOpts(), Args, DashX, Diags,		Success &= ParseCodeGenArgs(Res.getCodeGenOpts(), Res.getOptReportInfo(),
		Args, DashX, Diags,
Res.getTargetOpts());		Res.getTargetOpts());
ParseHeaderSearchArgs(Res.getHeaderSearchOpts(), Args);		ParseHeaderSearchArgs(Res.getHeaderSearchOpts(), Args);
if (DashX == IK_AST \|\| DashX == IK_LLVM_IR) {		if (DashX == IK_AST \|\| DashX == IK_LLVM_IR) {
// ObjCAAutoRefCount and Sanitize LangOpts are used to setup the		// ObjCAAutoRefCount and Sanitize LangOpts are used to setup the
// PassManager in BackendUtil.cpp. They need to be initializd no matter		// PassManager in BackendUtil.cpp. They need to be initializd no matter
// what the input type is.		// what the input type is.
if (Args.hasArg(OPT_fobjc_arc))		if (Args.hasArg(OPT_fobjc_arc))
LangOpts.ObjCAutoRefCount = 1;		LangOpts.ObjCAutoRefCount = 1;
▲ Show 20 Lines • Show All 253 Lines • Show Last 20 Lines

lib/Frontend/OptReport.cpp

This file was added.

				//===------------------------ OptReport.cpp -------------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#include "clang/Frontend/CompilerInstance.h"
				#include "clang/Frontend/FrontendDiagnostic.h"
				#include "clang/Frontend/OptReport.h"
				#include "llvm/ADT/StringExtras.h"
				#include "llvm/Support/Format.h"

				using namespace clang;

				void OptReportAction::EndSourceFileAction() {
				GenerateReportFile();
				WrapperFrontendAction::EndSourceFileAction();
				}

				void OptReportAction::GenerateReportFile() {
				CompilerInstance &CI = getCompilerInstance();
				DiagnosticsEngine &Diags = CI.getDiagnostics();
				OptReportInfo &OptReport = CI.getOptReportInfo();
				if (OptReport.FileName.empty())
				return;

				std::error_code EC;
				llvm::raw_fd_ostream OS(OptReport.FileName, EC,
				llvm::sys::fs::F_Text);
				if (EC) {
				Diags.Report(diag::err_fe_error_opening) << OptReport.FileName <<
				EC.message();
				return;
				}

				SourceManager &SourceMgr = CI.getSourceManager();
				std::set<FileID> FileIDs;
				for (auto &I : OptReport.LocationInfo)
				FileIDs.insert(SourceMgr.getFileID(I.first));

				for (auto &FID : FileIDs) {
				SourceLocation FirstLoc = SourceMgr.getLocForStartOfFile(FID);
				OS << "< " << SourceMgr.getFilename(FirstLoc) << "\n";

				auto I = OptReport.LocationInfo.lower_bound(FirstLoc);
				StringRef MB = SourceMgr.getBufferData(FID);
				const SrcMgr::ContentCache *
				Content = SourceMgr.getSLocEntry(FID).getFile().getContentCache();
				unsigned LNDigits = llvm::utostr(Content->NumLines).size();
				for (unsigned L = 0; L < Content->NumLines - 1; ++L) {
				unsigned LStartOff = Content->SourceLineCache[L];
				unsigned LEndOff = (L == Content->NumLines) ?
				Content->getSize() :
				Content->SourceLineCache[L + 1];

				std::map<unsigned, OptReportLocationInfo> ColsInfo;
				unsigned InlinedCols = 0, UnrolledCols = 0, VectorizedCols = 0;

				OptReportLocationInfo LLI;
				if (I != OptReport.LocationInfo.end()) {
				auto DI = SourceMgr.getDecomposedLoc(I->first);
				while (I != OptReport.LocationInfo.end() && DI.first == FID &&
				DI.second < LStartOff) {
				++I;
				if (I != OptReport.LocationInfo.end())
				DI = SourceMgr.getDecomposedLoc(I->first);
				}

				while (I != OptReport.LocationInfo.end() && DI.first == FID &&
				DI.second >= LStartOff && DI.second < LEndOff) {
				unsigned Col = SourceMgr.getColumnNumber(FID, DI.second);
				ColsInfo[Col] = I->second;
				InlinedCols += I->second.Inlined.Analyzed;
				UnrolledCols += I->second.Unrolled.Analyzed;
				VectorizedCols += I->second.Vectorized.Analyzed;
				LLI \|= I->second;

				++I;
				if (I != OptReport.LocationInfo.end())
				DI = SourceMgr.getDecomposedLoc(I->first);
				}
				}

				// We try to keep the output as concise as possible. If only one thing on
				// a given line could have been inlined, vectorized, etc. then we can put
				// the marker on the source line itself. If there are multiple options
				// then we want to distinguish them by placing the marker for each
				// transformation on a separate line following the source line. When we
				// do this, we use a '^' character to point to the appropriate column in
				// the source line.

				OS << llvm::format_decimal(L + 1, LNDigits) << " ";
				OS << (LLI.Inlined.Transformed && InlinedCols < 2 ? "I" : " ");
				OS << (LLI.Unrolled.Transformed && UnrolledCols < 2 ? "U" : " ");
				OS << (LLI.Vectorized.Transformed && VectorizedCols < 2 ? "V" : " ");

				OS << " \| " << MB.slice(LStartOff, LEndOff);

				for (auto &J : ColsInfo) {
				if ((J.second.Inlined.Transformed && InlinedCols > 1) \|\|
				(J.second.Unrolled.Transformed && UnrolledCols > 1) \|\|
				(J.second.Vectorized.Transformed && VectorizedCols > 1)) {
				OS << std::string(LNDigits + 1, ' ');
				OS << (J.second.Inlined.Transformed &&
				InlinedCols > 1 ? "I" : " ");
				OS << (J.second.Unrolled.Transformed &&
				UnrolledCols > 1 ? "U" : " ");
				OS << (J.second.Vectorized.Transformed &&
				VectorizedCols > 1 ? "V" : " ");

				OS << " \| " << std::string(J.first - 1, ' ') << "^\n";
				}
				}

				if (LEndOff == Content->getSize())
				OS << "\n";
				}
				}
				}

lib/FrontendTool/ExecuteCompilerInvocation.cpp

	Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines
	#endif			#endif

	// If there are any AST files to merge, create a frontend action			// If there are any AST files to merge, create a frontend action
	// adaptor to perform the merge.			// adaptor to perform the merge.
	if (!FEOpts.ASTMergeFiles.empty())			if (!FEOpts.ASTMergeFiles.empty())
	Act = llvm::make_unique<ASTMergeAction>(std::move(Act),			Act = llvm::make_unique<ASTMergeAction>(std::move(Act),
	FEOpts.ASTMergeFiles);			FEOpts.ASTMergeFiles);

				// If an optimization report is requested, generate this after compilation.
				if (!CI.getOptReportInfo().FileName.empty())
				Act = llvm::make_unique<OptReportAction>(std::move(Act));

	return Act;			return Act;
	}			}

	bool clang::ExecuteCompilerInvocation(CompilerInstance *Clang) {			bool clang::ExecuteCompilerInvocation(CompilerInstance *Clang) {
	// Honor -help.			// Honor -help.
	if (Clang->getFrontendOpts().ShowHelp) {			if (Clang->getFrontendOpts().ShowHelp) {
	std::unique_ptr<OptTable> Opts(driver::createDriverOptTable());			std::unique_ptr<OptTable> Opts(driver::createDriverOptTable());
	Opts->PrintHelp(llvm::outs(), "clang -cc1",			Opts->PrintHelp(llvm::outs(), "clang -cc1",
	▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

test/CodeGen/opt-report.c

This file was added.

				// RUN: %clang_cc1 -O3 -triple x86_64-unknown-linux-gnu -target-cpu x86-64 %s -o %t -dwarf-column-info -opt-report-file %t.lst -emit-obj
				// RUN: cat %t.lst \| FileCheck %s
				// REQUIRES: x86-registered-target

				void bar();
				void foo() { bar(); }

				void Test(int res, int c, int d, int p, int n) {
				int i;

				#pragma clang loop vectorize(assume_safety)
				for (i = 0; i < 1600; i++) {
				res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
				}

				// CHECK: {{[0-9]+}} \| #pragma clang loop vectorize(assume_safety)
				// CHECK: {{[0-9]+}} V \| for (i = 0; i < 1600; i++) {

				for (i = 0; i < 16; i++) {
				res[i] = (p[i] == 0) ? res[i] : res[i] + d[i];
				}

				foo();
				// CHECK: {{[0-9]+}} I \| foo();

				foo(); bar(); foo();
				// CHECK: {{[0-9]+}} \| foo(); bar(); foo();
				// CHECK-NEXT: I \| ^
				// CHECK-NEXT: I \| ^
				}

test/Driver/opt-report.c

This file was added.

				// RUN: %clang -### -S -o FOO -foptimization-report %s 2>&1 \| FileCheck %s
				// RUN: %clang -### -S -o FOO -foptimization-report=BAR.txt %s 2>&1 \| FileCheck %s -check-prefix=CHECK-EQ

				// CHECK: "-cc1"
				// CHECK: "-opt-report-file" "opt-report.lst"

				// CHECK-EQ: "-cc1"
				// CHECK-EQ: "-opt-report-file" "BAR.txt"

This is an archive of the discontinued LLVM Phabricator instance.

Annotated-source optimization reports (a.k.a. "listing" files)AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 55907

include/clang/Driver/CC1Options.td

include/clang/Driver/Options.td

include/clang/Frontend/CompilerInstance.h

include/clang/Frontend/CompilerInvocation.h

include/clang/Frontend/FrontendActions.h

include/clang/Frontend/OptReport.h

lib/CodeGen/CodeGenAction.cpp

lib/Driver/Tools.cpp

lib/Frontend/CMakeLists.txt

lib/Frontend/CompilerInvocation.cpp

lib/Frontend/OptReport.cpp

lib/FrontendTool/ExecuteCompilerInvocation.cpp

test/CodeGen/opt-report.c

test/Driver/opt-report.c

Annotated-source optimization reports (a.k.a. "listing" files)
AbandonedPublic