Implement MonotonicNanoTime using QueryPerformanceCounter.
This function is used by Scudo & the 64-bit Primary allocator. Implementing it
now means that the release-to-OS mechanism of the Primary will kick in (it
never did since the function returned 0 always), but ReleaseMemoryPagesToOS is
still not currently implemented for Windows.
Performance wise, this adds a syscall & a 64-bit division per call to
MonotonicNanoTime so the impact might not be negligible, but I don't think
there is a way around it.
ULL