- Remove unnecessary data from list node structure
- Remove timerPair in favor of pushing/popping explicitTimers. This way, nested timers will work properly.
- Fix #pragma omp critical timers
- Add histogram capability
- Add KMP_STATS_FILE formatting capability
- Have time partitioned into serial & parallel by introducing partitionedTimers::exchange(). This also counts the number of serial regions in the executable.
- Fix up the timers around OMP loops so that scheduling overhead and work are both counted correctly.
- Fix up the iterations statistics so they count the number of iterations the thread receives at each loop scheduling event
- Change timers so there is only one RDTSC read per event change
- Fix up the outdated comments for the timers
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM