clang-format changes "<<<" to "<< <" which is problematic for CUDA code. Change the behavior of clang-format to operate correctly with CUDA kernel launches without changing language options.
- No space required between "<<" and "<" in "<<<".
- No space required between identifier and "<<<". This avoids changing "f<<<" to "f <<<"
- No space required between ">" and "<<<". Avoids changing "f<t><<<" to "f<t> <<<".
- No space required between ">>>" and "("
Such that "f<param><<<1, 1>>>();" remains unchanged instead of being changed to "f<param> << <1, 1>>> ();"
I really don't like such static variables. Also, now we have two such mechanisms (GreaterStashed and Count). Seems like the same mechanisms should be used.