This adds support for the unroll_and_jam pragma, to go with D41953. The name of
the pragma is copied from the Intel compiler, and most of the code works the same as
for unroll.
I have some doubts whether this will ever be used sensibly in the real world, but can
be useful for testing and was not difficult to put together.
[nit] unrelated whitespace change?