This is an archive of the discontinued LLVM Phabricator instance.

[OpenCL] Add OpenCL builtin test generator
ClosedPublic

Authored by svenvh on Mar 3 2021, 9:26 AM.

Details

Summary

Add a new clang-tblgen flag -gen-clang-opencl-builtin-tests that
generates a .cl file containing calls to every builtin function
defined in the .td input.

Example output:

...

float test6092_cos(float arg1) {
  return cos(arg1);
}
double test6093_cos(double arg1) {
  return cos(arg1);
}
half test6094_cos(half arg1) {
  return cos(arg1);
}
float2 test6095_cos(float2 arg1) {
  return cos(arg1);
}
double2 test6096_cos(double2 arg1) {
  return cos(arg1);
}

...

On the current (31 March 2020) main branch, this emitter will generate about 62k lines of test code.

The test consists of parsing a 60k line generated input file, which
takes about 90s with a debug build. This doesn't seem excessive
compared to other tests in clang/test/Headers (one of the tests
takes 120s for example).

Diff Detail

Event Timeline

svenvh created this revision.Mar 3 2021, 9:26 AM
svenvh requested review of this revision.Mar 3 2021, 9:26 AM
svenvh edited the summary of this revision. (Show Details)Mar 3 2021, 9:28 AM

If possible to add a fragment of what is generated it would be great!

``

float test6092_cos(float arg1) {
  return cos(arg1);
}
double test6093_cos(double arg1) {
  return cos(arg1);
}
half test6094_cos(half arg1) {
  return cos(arg1);
}
float2 test6095_cos(float2 arg1) {
  return cos(arg1);
}
double2 test6096_cos(double2 arg1) {
  return cos(arg1);
}
``

I was just thinking if we could combine the calls into one function to minimize the number of lines to parse? Perhaps this will make the Tablegen generator too complex?

Also would it be possible to handle optional functionality - extensions and functions available in certain versions?

svenvh added a comment.Mar 4 2021, 4:11 AM

I was just thinking if we could combine the calls into one function to minimize the number of lines to parse? Perhaps this will make the Tablegen generator too complex?

That will increase the emitter's complexity. Especially when taking into account the next point about handling optional functionality. Hard to say in advance if paying for the added emitter complexity is worth the parsing time reduction, though the parsing time reduction is probably not going to be an order of magnitude difference.

Also would it be possible to handle optional functionality - extensions and functions available in certain versions?

Yes, I think this should be possible; I already have a TODO in the emitter. It will make the resulting file bigger of course, because parts will have to be guarded with #ifdefs. But I believe it would be a good way to test extension and version handling for builtins.

That's awesome!

I'm thinking of how can you track correctness of generated built-ins according to spec.  Perhaps you can compare the number of distinct declarations for each  built-in in the spec and in tablegen and diagnose if they are not equal. This is a very straightforward, but I can't think of on other solution yet (with the absence of CTS coverage)...

That's awesome!

I'm thinking of how can you track correctness of generated built-ins according to spec.  Perhaps you can compare the number of distinct declarations for each  built-in in the spec and in tablegen and diagnose if they are not equal. This is a very straightforward, but I can't think of on other solution yet (with the absence of CTS coverage)...

The advantage of adding standard clang testing through the whole frontend invocation is that we can test all of the frontend features together to make sure that they indeed work as expected. For example there are types or macros added by clang and declarations from the opencl-base-c.h that interact with the headers declaring builtin functions so I think it would be desirable to use the standard clang testing strategy. There is also another advantage - it is more intuitive for the community to understand and maintain further.

But of course simple diffing should be fast I assume. We could look into this if we get push back on the traditional testing or we could consider it as a hybrid approach too.

I was just thinking if we could combine the calls into one function to minimize the number of lines to parse? Perhaps this will make the Tablegen generator too complex?

That will increase the emitter's complexity. Especially when taking into account the next point about handling optional functionality. Hard to say in advance if paying for the added emitter complexity is worth the parsing time reduction, though the parsing time reduction is probably not going to be an order of magnitude difference.

We could also consider generating the file with a simple emitter and then partitioning it manually before adding into the repo. But I think you are not suggesting checking it in?

Could you explain a bit more about the overall approach?

svenvh added a comment.Mar 5 2021, 7:27 AM

To be honest, I don't have a concrete picture yet of how to deploy this emitter for testing. So here are a few thoughts about the various options that come to mind:

1. Regenerate the test file every time we run check-clang and use it to validate the -fdeclare-opencl-builtins option.

Advantage:

  • This gives confidence that each builtin described in OpenCLBuiltins.td is usable in OpenCL code.

Disadvantages:

  • This doesn't protect against unintended removals of builtins/overloads, as the same source of truth is used for implementation and testing.
  • It is quick to generate (<1 second), but it takes some time to parse the entire test with a debug build of Clang (35 seconds).
2. As 1, but also keep maintaining opencl-c.h and test with both opencl-c.h and -fdeclare-opencl-builtins.

This addresses the disadvantage of option 1 at the expense of maintaining opencl-c.h and increased testing time.

3. Generate the test file once, manually verify that it is correct, check it in to git, and maintain it from there.

Advantage:

  • Different sources of truth for implementation (OpenCLBuiltins.td) and testing (the generated and maintained test).

Disadvantages:

  • The output is big, currently already 56k lines, and I expect it will still get a bit bigger when we add conditionals. I don't feel comfortable about putting such big autogenerated files under git (regardless of whether we add it as a single file, or as a collection of smaller files). Ideally the repo should only contain the source from which we generate things.
  • Updates to OpenCLBuiltins.td will give large diffs when regenerating the tests (regardless of whether the file is split or not).
  • Manually verifying that the generated test is actually correct is not trivial.
4. Generate a condensed summary of all builtins, check that in to git.

Then regenerate the summary during testing and diff it against the checked in reference. This builds upon @azabaznov's suggestion. We could for example use a count for each builtin and maintain that. We could maintain multiple counts to cover different configurations (e.g. CL versions).
Advantages:

  • There are only about 1000 different builtin function names, so this reduces the size of the reference data considerably, making it easier to verify the initial checkin and making maintainance easier.
  • Generating the summary and diffing it against a reference is quick (<1 second).
  • It would catch the case when builtins are accidentally removed from a configuration: if the count in the summary is not updated, the test will fail. I believe this is an important scenario to test in light of the OpenCL 3 work?

Disadvantage:

  • Not giving perfect coverage, so subtle errors might slip through, such as for example changing the argument type of a builtin.

Other ideas / feedback welcome!

Thanks for the comprehensive summary. I don't have many other suggestions in mind but I would like to add a few thoughts.

We could always consider writing the tests manually from scratch too. But it doesn't feel like a good cost/benefit trade-off.

Regarding 2 and 4 I think we should drive towards deprecation of opencl-c.h as it is a maintenance overhead but we could convert it into a test instead?

Aside from option 3 all other options will require adding non-standard testing formats in Clang. This means that if there is any core refactoring changes even outside of OpenCL affecting the functionality in the headers, it would be required to learn how we do the testing in order to address any breakages. Perhaps this can be mitigated by the documentation, but overall it is an extra burden to the community. Speaking from my personal experience I was in the situations breaking functionality outside of OpenCL and having a unified testing format is a big advantage in understanding and addressing the issues in unfamiliar code.

The output is big, currently already 56k lines, and I expect it will still get a bit bigger when we add conditionals. I don't feel comfortable about putting such big autogenerated files under git (regardless of whether we add it as a single file, or as a collection of smaller files). Ideally, the repo should only contain the source from which we generate things.

What problems do you see with checking in autogenerated files? There are some existing tests that check similar functionality and they seem to contain repetitive patterns. Honestly, I wouldn't be able to say if they were written with copy/paste or auto-generated but I am not sure whether it is very important. Of course, it would have been better if we have built up the functionality incrementally, and it is still an option. But it hasn't been possible in the past and as a matter of fact the original header was committed as one large file (around 15K lines).

Updates to OpenCLBuiltins.td will give large diffs when regenerating the tests (regardless of whether the file is split or not).

Could you provide more details. I don't follow this.

Manually verifying that the generated test is actually correct is not trivial.

I think the way to approach this would be to just assume that the test is correct and then fix any issues we discover as we go along with the development. There is nothing wrong with this though as even small manually written tests have bugs.

Regarding 2 and 4 I think we should drive towards deprecation of opencl-c.h as it is a maintenance overhead but we could convert it into a test instead?

Perhaps you misunderstood option 4, as that does not require maintaining opencl-c.h. Only for option 2 we would have to maintain opencl-c.h. For option 4, the checked in files could look something like this:

clang/test/SemaOpenCL/builtins-opencl2.0.check
; RUN: clang-tblgen -emit-opencl-builtin-count -cl-std=cl2.0 OpenCLBuiltins.td | FileCheck '%s'

; This file lists the number of builtin function declarations per builtin function name
; available in the OpenCL 2.0 language mode.

CHECK: absdiff 48
CHECK: abs 48
CHECK: acos 18
CHECK: acosh 18
CHECK: acospi 18
...

To illustrate the use, suppose a developer writes a patch that accidentally makes the double variants of acos unavailable in OpenCL 2.0. The reported number of acos builtins would be different so FileCheck would fail on the line containing acos 18. If we use option 4 in combination with option 1, a Clang developer can regenerate the test and observe the differences for the acos tests: from there it becomes obvious that the double variants of acos disappeared.

Aside from option 3 all other options will require adding non-standard testing formats in Clang. This means that if there is any core refactoring changes even outside of OpenCL affecting the functionality in the headers, it would be required to learn how we do the testing in order to address any breakages. Perhaps this can be mitigated by the documentation, but overall it is an extra burden to the community. Speaking from my personal experience I was in the situations breaking functionality outside of OpenCL and having a unified testing format is a big advantage in understanding and addressing the issues in unfamiliar code.

Do you expect this to be a problem in practice for the other proposals, and if so could you elaborate in more detail about any particular concerns for the other proposals?

What problems do you see with checking in autogenerated files? There are some existing tests that check similar functionality and they seem to contain repetitive patterns. Honestly, I wouldn't be able to say if they were written with copy/paste or auto-generated but I am not sure whether it is very important. Of course, it would have been better if we have built up the functionality incrementally, and it is still an option. But it hasn't been possible in the past and as a matter of fact the original header was committed as one large file (around 15K lines).

Updates to OpenCLBuiltins.td will give large diffs when regenerating the tests (regardless of whether the file is split or not).

Could you provide more details. I don't follow this.

There are two problems here that combine to make the problem bigger: checking in autogenerated files, and the fact that those files are large. The problem I anticipate with regenerating the test files (at least with the current generator), is that inserting a new builtin in the .td file results in renumbering of the test functions, which results in a large diff.

Other problems I see with checking in large autogenerated files:

  • Merge conflicts in the generated files when reordering/rebasing/porting patches. A single conflict in the .td description could expand to many conflicts in the autogenerated file(s).
  • Diffs in the autogenerated files clutter the output of for example git log -p.
  • Code reviews of changes to those files are harder due to the size.
  • It bloats the repository.

Disclaimer: I may be biased by my previous experience on projects that had large autogenerated files checked in, that's why I feel that we should avoid doing so if we reasonably can. I have probably revealed now that option 3 is my least favorite option. :-)

Manually verifying that the generated test is actually correct is not trivial.

I think the way to approach this would be to just assume that the test is correct and then fix any issues we discover as we go along with the development. There is nothing wrong with this though as even small manually written tests have bugs.

Fair enough.

Regarding 2 and 4 I think we should drive towards deprecation of opencl-c.h as it is a maintenance overhead but we could convert it into a test instead?

Perhaps you misunderstood option 4, as that does not require maintaining opencl-c.h. Only for option 2 we would have to maintain opencl-c.h. For option 4, the checked in files could look something like this:

clang/test/SemaOpenCL/builtins-opencl2.0.check
; RUN: clang-tblgen -emit-opencl-builtin-count -cl-std=cl2.0 OpenCLBuiltins.td | FileCheck '%s'

; This file lists the number of builtin function declarations per builtin function name
; available in the OpenCL 2.0 language mode.

CHECK: absdiff 48
CHECK: abs 48
CHECK: acos 18
CHECK: acosh 18
CHECK: acospi 18
...

To illustrate the use, suppose a developer writes a patch that accidentally makes the double variants of acos unavailable in OpenCL 2.0. The reported number of acos builtins would be different so FileCheck would fail on the line containing acos 18. If we use option 4 in combination with option 1, a Clang developer can regenerate the test and observe the differences for the acos tests: from there it becomes obvious that the double variants of acos disappeared.

Ok, I see. It is more clear now. I like that it is easy enough to understand. Still it has the issue of non-standard testing but we have mentioned it already.

Aside from option 3 all other options will require adding non-standard testing formats in Clang. This means that if there is any core refactoring changes even outside of OpenCL affecting the functionality in the headers, it would be required to learn how we do the testing in order to address any breakages. Perhaps this can be mitigated by the documentation, but overall it is an extra burden to the community. Speaking from my personal experience I was in the situations breaking functionality outside of OpenCL and having a unified testing format is a big advantage in understanding and addressing the issues in unfamiliar code.

Do you expect this to be a problem in practice for the other proposals, and if so could you elaborate in more detail about any particular concerns for the other proposals?

Not sure I understand what you mean by "other proposals" though. I think this is only an issue for the proposals that don't use regular clang invocation for testing. It is probably not a widespread problem but it could bother someone someday potentially.

What problems do you see with checking in autogenerated files? There are some existing tests that check similar functionality and they seem to contain repetitive patterns. Honestly, I wouldn't be able to say if they were written with copy/paste or auto-generated but I am not sure whether it is very important. Of course, it would have been better if we have built up the functionality incrementally, and it is still an option. But it hasn't been possible in the past and as a matter of fact the original header was committed as one large file (around 15K lines).

Updates to OpenCLBuiltins.td will give large diffs when regenerating the tests (regardless of whether the file is split or not).

Could you provide more details. I don't follow this.

There are two problems here that combine to make the problem bigger: checking in autogenerated files, and the fact that those files are large. The problem I anticipate with regenerating the test files (at least with the current generator), is that inserting a new builtin in the .td file results in renumbering of the test functions, which results in a large diff.

Other problems I see with checking in large autogenerated files:

  • Merge conflicts in the generated files when reordering/rebasing/porting patches. A single conflict in the .td description could expand to many conflicts in the autogenerated file(s).
  • Diffs in the autogenerated files clutter the output of for example git log -p.
  • Code reviews of changes to those files are harder due to the size.
  • It bloats the repository.

Disclaimer: I may be biased by my previous experience on projects that had large autogenerated files checked in, that's why I feel that we should avoid doing so if we reasonably can. I have probably revealed now that option 3 is my least favorite option. :-)

Ok, fair point. :D But however your option 3 and my option 3 are apparently not the same. I was thinking we would only generate test once and then manually modify and extend it every time new functionality is added. We could see it as if the file was written with copy-paste or perfect formatting. So every time someone would add anything in the headers they would also need to modify the test and reflect the functionality being added. So we could see the test generator as a tool to help us get started with testing. The advantage of extending the test manually is that it makes more clear what functionality is being implemented because you can see OpenCL code lines and it also separates the implementation from testing to minimize the probability of mistakes. The disadvantage is that for every few lines in Tablegen definitions we might need hundreds or thousands lines in the test.

If we instead regenerate the test every time then I agree with you that we should just generate it when the tests are build and never check it into the repo.

I have one more though.

I like the idea of turning opencl-c.h into the test: as it is already in the repo and is already being used for quite a while we can assume it as a mainline for now. I think the first step should be to test that -fdeclare-oprencl-builtins generates the same built-ins which are declared in opencl-c.h. If we can't do this programmable way we can use AST pretty-printing for tablegen header and opencl-c.h and compare these with diff (we can also do a clean-up of AST files with rm while running that tests).

Once this is done we can incrementally modify either opencl-c.h header and tablegen header. Testing changes to either of them can combine sanity check as @svenvh suggested in builtins-opencl2.0.check and diff for AST pretty printing.

Advantages:

  • Time of testing. Locally I get nice results: 
$ time ./build_release/bin/clang-check -extra-arg=-cl-std=CL2.0 --ast-print  llvm-project/clang/lib/Headers/opencl-c.h &> header.h
real    0m0.182s
user    0m0.162s
sys     0m0.020s

  But not yet clear how much time such printing will take for tablegen header. I assume it won't take a lot longer.

  • This will keep changes to opencl-h header and tablegen header consistent (until opencl-c.h will be deprecated)

 
Disadvantages:

  • Still doesn't eliminate subtle errors, need to carefully collect information from the spec about amount of the built-ins

I have one more though.

I like the idea of turning opencl-c.h into the test: as it is already in the repo and is already being used for quite a while we can assume it as a mainline for now. I think the first step should be to test that -fdeclare-oprencl-builtins generates the same built-ins which are declared in opencl-c.h. If we can't do this programmable way we can use AST pretty-printing for tablegen header and opencl-c.h and compare these with diff (we can also do a clean-up of AST files with rm while running that tests).

Once this is done we can incrementally modify either opencl-c.h header and tablegen header. Testing changes to either of them can combine sanity check as @svenvh suggested in builtins-opencl2.0.check and diff for AST pretty printing.

Advantages:

  • Time of testing. Locally I get nice results: 
$ time ./build_release/bin/clang-check -extra-arg=-cl-std=CL2.0 --ast-print  llvm-project/clang/lib/Headers/opencl-c.h &> header.h
real    0m0.182s
user    0m0.162s
sys     0m0.020s

  But not yet clear how much time such printing will take for tablegen header. I assume it won't take a lot longer.

  • This will keep changes to opencl-h header and tablegen header consistent (until opencl-c.h will be deprecated)

 
Disadvantages:

  • Still doesn't eliminate subtle errors, need to carefully collect information from the spec about amount of the built-ins

Thanks for elaborating on this option. Yes, I think this could make the testing quick. The biggest concern with this I see is that it doesn't actually show that the header is working through the clang frontend. For example, we could easily change something very significant like remove the definition of macros guarding the function declarations and not notice that it means the declarations are no longer available to the users of clang. So I would say while it covers all the builtin overloads it doesn't give the confidence that them at all are exposed correctly to the users. We could of course work around this by providing complimentary partial testing using clang invocation but it means our testing will become harder to understand.

Overall I think we have made a good progress on brainstorming here so I suggest to give the ideas broader visibility via RFC to cfe-dev. We could include the following:

  • Start from standard clang testing idea i.e. adding a large test file(s) using clang invocation and calling the functions (either written manually or autogenerated).
  • Summarize the issues with the standard approach and highlight that similar tests with other clang features (i.e. target intrinsics) show time bottlenecks.
  • Enumerate various alternative options and hybrid options to avoid adding large test files with long execution time highlighting adv/disadv for each of them.

Does this sound reasonable? Is there anything else we should include in such RFC?

svenvh updated this revision to Diff 332675.Mar 23 2021, 7:45 AM

Emit #if guards for extensions and versions.

I have done some measurements using the test produced from this Tablegen emitter (59K lines).

I have used the test it in two files:

  1. SemaOpenCL/all-std-buitins.cl that has the following RUN line appended 6 times (for every supported OpenCL version v1.0, v1.1, v1.2, v2.0, v1.3, C++)
//RUN: %clang_cc1 %s -triple=spir -fsyntax-only -verify -cl-std=CL2.0 -finclude-default-header -fdeclare-opencl-builtins
  1. SemaOpenCL/all-std-buitins-slow-header.cl that has the following RUN line appended 6 times (for every supported OpenCL version v1.0, v1.1, v1.2, v2.0, v3.0, C++)
//RUN: %clang_cc1 %s -triple=spir -fsyntax-only -verify -cl-std=CL2.0 -finclude-default-header

So I am getting the following testing time breakdown then:

201.61s: Clang :: SemaOpenCL/all-std-buitins-slow-header.cl
199.70s: Clang :: SemaOpenCL/all-std-buitins.cl
85.14s: Clang :: Headers/arm-neon-header.c
68.06s: Clang :: OpenMP/nesting_of_regions.cpp
65.23s: Clang :: Driver/crash-report.c
60.26s: Clang :: Analysis/PR24184.cpp
57.80s: Clang :: CodeGen/X86/rot-intrinsics.c
57.58s: Clang :: CodeGen/X86/x86_64-xsave.c
56.34s: Clang :: Headers/opencl-c-header.cl
55.68s: Clang :: CodeGen/X86/x86_32-xsave.c
44.83s: Clang :: Driver/crash-report-with-asserts.c
40.38s: Clang :: Lexer/SourceLocationsOverflow.c
37.44s: Clang :: Headers/x86intrin-2.c
36.53s: Clang :: OpenMP/target_teams_distribute_parallel_for_simd_codegen_registration.cpp
34.09s: Clang :: CodeGen/X86/avx512f-builtins-constrained.c
33.41s: Clang :: CodeGen/X86/sse-builtins-constrained.c
32.82s: Clang :: Analysis/iterator-modeling.cpp
31.37s: Clang :: OpenMP/target_teams_distribute_simd_codegen_registration.cpp
31.10s: Clang :: OpenMP/target_parallel_for_simd_codegen_registration.cpp
30.78s: Clang :: Analysis/use-after-move.cpp

I am very confused though about why is the difference between Tablegen and opencl-c.h so insignificant? FYI, also for a single clang invocation with Tablegen and opencl-c.h the difference is very insignificant in parsing time of this test - 20.794s vs 21.401s. This is really interesting because with small files the difference is huge 0.043s vs 3.990s on test with empty kernel.


I also timed check-clang invocation on my 8 core machine:

  1. with both tests - 697.70s
  2. with all-std-buitins.cl only - 684.43s
  3. without any new tests - 673.00s

The change in total testing time appears to be insignificant. I guess this is due to parallel execution?
Btw one thing I have thought of since OpenCL v1.0-1.1 doesn't differ a lot for builtin functions and they are not modified much either, perhaps we only need to test v1.2? That would reduce number of clang invocations to 4 in each test. Then the measurements are as follows:

134.13s: Clang :: SemaOpenCL/all-std-buitins-slow-header.cl
131.52s: Clang :: SemaOpenCL/all-std-buitins.cl
85.81s: Clang :: Headers/arm-neon-header.c
69.14s: Clang :: OpenMP/nesting_of_regions.cpp
60.08s: Clang :: Driver/crash-report.c
59.67s: Clang :: Analysis/PR24184.cpp
57.27s: Clang :: CodeGen/X86/rot-intrinsics.c
56.93s: Clang :: CodeGen/X86/x86_32-xsave.c
56.59s: Clang :: CodeGen/X86/x86_64-xsave.c
55.68s: Clang :: Headers/opencl-c-header.cl
40.71s: Clang :: Driver/crash-report-with-asserts.c
39.44s: Clang :: Lexer/SourceLocationsOverflow.c
38.02s: Clang :: OpenMP/target_teams_distribute_parallel_for_simd_codegen_registration.cpp
37.07s: Clang :: Headers/x86intrin-2.c
32.61s: Clang :: CodeGen/X86/avx512f-builtins-constrained.c
32.58s: Clang :: CodeGen/X86/sse-builtins-constrained.c
32.19s: Clang :: Analysis/use-after-move.cpp
31.96s: Clang :: Analysis/iterator-modeling.cpp
31.02s: Clang :: OpenMP/target_teams_distribute_simd_codegen_registration.cpp
30.59s: Clang :: OpenMP/target_parallel_for_simd_codegen_registration.cpp

with a total testing time 688.61s

Conclusion:

  • if we test the whole functionality the test will be at least 2x slower than the slowest clang test so far but it hardly affect the full testing time of clang-check on modern HW due to the parallel execution. Also related to this partitioning of test files could help with the latency due to the parallel execution.
  • Testing of opencl-c.h only doubles the testing time.
svenvh edited the summary of this revision. (Show Details)Mar 31 2021, 9:29 AM
svenvh updated this revision to Diff 347428.May 24 2021, 9:46 AM
svenvh edited the summary of this revision. (Show Details)

Fix formatting issues, rebase patch, add test.

I think this is a very good starting point for the testing of built-in functions. Btw should we now remove '[Draft]' from the title?

clang/utils/TableGen/ClangOpenCLBuiltinEmitter.cpp
242

Should be emit to adhere to coding style?

924

I would add a documentation for this function, it is not very obvious.

959

Maybe this should be lifted as documentation of the header of this function?

988

I don't get this comment and its exact reference to the code.

svenvh updated this revision to Diff 350314.Jun 7 2021, 8:56 AM
svenvh marked an inline comment as done.
svenvh retitled this revision from [OpenCL][Draft] Add OpenCL builtin test generator to [OpenCL] Add OpenCL builtin test generator.

Improve comments.

clang/utils/TableGen/ClangOpenCLBuiltinEmitter.cpp
924

Not sure what documentation you are expecting? Are there any particular lines that are "not very obvious"?
Note that there is already a comment for this method up in the class declaration.

959

This comment only applies to the first loop-nest. It documents an implementation detail, so I don't think this comment should be lifted to the method's declaration comment.

988

I have expanded the comment, and also factored out an index into a variable. Hopefully that makes it clear?

Anastasia accepted this revision.Jun 9 2021, 3:56 AM

LGTM! Thanks!

This revision is now accepted and ready to land.Jun 9 2021, 3:56 AM

@svenvh, are you going to add header-like output emitting?

svenvh added a comment.Jun 9 2021, 4:27 AM

@svenvh, are you going to add header-like output emitting?

Yes, I can add that if there is interest. I already have it on a local branch, so if there is interest (and it seems there is? :-)), then I will rebase that, tidy it up, and create a separate review.

@svenvh, are you going to add header-like output emitting?

Yes, I can add that if there is interest. I already have it on a local branch, so if there is interest (and it seems there is? :-)), then I will rebase that, tidy it up, and create a separate review.

Sounds great :) Thanks! Yeah, it may be needed as alternative testing of changes of opencl-h and testing OpenCL TableGen header as well.

This revision was landed with ongoing or failed builds.Jun 9 2021, 6:08 AM
This revision was automatically updated to reflect the committed changes.
Herald added a project: Restricted Project. · View Herald TranscriptJun 9 2021, 6:08 AM
Herald added a subscriber: ldrumm. · View Herald Transcript