Major changes:
- nonmonotonic modifier mapped on to static_steal schedule;
- static_steal schedule extended to x86 architecture, and loops with 8-byte induction variable, used critical section to modify pair of 8-byte values;
- victim choosing algorithm enhanced;
- threshold of not-done chunks victim has for stealing reduced from 4 to 2 chunks.