This is not supported to change anything but allow us to reuse the math
functions separately from the device functions, e.g., source them at
different times. This will be used by the OpenMP overlay.
This also adds two return keywords that were missing.
I accidentally moved the clock functions and added the openmp ifdef we'll need later. I'll move the clock functions back before I commit.