Introduce a conversion pass from SCF parallel loops to OpenMP dialect
constructs - parallel region and workshare loop. Loops with reductions are not
supported because the OpenMP dialect cannot model them yet.
The conversion currently targets only one level of parallelism, i.e. only
one top-level omp.parallel operation is produced even if there are nested
scf.parallel operations that could be mapped to omp.wsloop. Nested
parallelism support is left for future work.
Brief doc comment please.