MS ABI: Implement thread_local for global variables
This add support for the C++11 feature, thread_local global variables.
The ABI Clang implements is an improvement of the MSVC ABI. Sadly,
further improvements could be made but not without sacrificing ABI
The feature is implemented as follows:
- All thread_local initialization routines are pointed to from the .CRT$XDU section.
- All non-weak thread_local variables have their initialization routines call from a single function instead of getting their own .CRT$XDU section entry. This is done to open up optimization opportunities to the compiler.
- All weak thread_local variables have their own .CRT$XDU section entry. This entry is in a COMDAT with the global variable it is initializing; this ensures that we will initialize the global exactly once.
- Destructors are registered in the initialization function using __tlregdtor.
Differential Revision: http://reviews.llvm.org/D5597