Depends On D105037
Avoid creating too many tasks when the number of workers is large.
Differential D105126
[mlir:Async] Change async-parallel-for block size/count calculation ezhulenev on Jun 29 2021, 8:10 AM. Authored by
Details
Diff Detail
Event Timeline
Comment Actions After this change kMaxOvershading is unused, causing warnings in the clang build: /usr/bin/clang++ -DGTEST_HAS_RTTI=0 -DMLIR_CUDA_CONVERSIONS_ENABLED=0 -DMLIR_ROCM_CONVERSIONS_ENABLED=0 -D_DEBUG -D_GNU_SOURCE -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -Itools/mlir/lib/Dialect/Async/Transforms -I/mnt/vss/_work/2/s/mlir/lib/Dialect/Async/Transforms -Iinclude -I/mnt/vss/_work/2/s/llvm/include -I/mnt/vss/_work/2/s/mlir/include -Itools/mlir/include -fPIC -fvisibility-inlines-hidden -Werror -Werror=date-time -Werror=unguarded-availability-new -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -pedantic -Wno-long-long -Wc++98-compat-extra-semi -Wimplicit-fallthrough -Wcovered-switch-default -Wno-noexcept-type -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wsuggest-override -Wstring-conversion -Wmisleading-indentation -fdiagnostics-color -Werror=global-constructors -g -fno-exceptions -fno-rtti -std=c++14 -MD -MT tools/mlir/lib/Dialect/Async/Transforms/CMakeFiles/obj.MLIRAsyncTransforms.dir/AsyncParallelFor.cpp.o -MF tools/mlir/lib/Dialect/Async/Transforms/CMakeFiles/obj.MLIRAsyncTransforms.dir/AsyncParallelFor.cpp.o.d -o tools/mlir/lib/Dialect/Async/Transforms/CMakeFiles/obj.MLIRAsyncTransforms.dir/AsyncParallelFor.cpp.o -c /mnt/vss/_work/2/s/mlir/lib/Dialect/Async/Transforms/AsyncParallelFor.cpp /mnt/vss/_work/2/s/mlir/lib/Dialect/Async/Transforms/AsyncParallelFor.cpp:116:28: error: unused variable 'kMaxOversharding' [-Werror,-Wunused-const-variable] static constexpr int32_t kMaxOversharding = 4; If the variable is no longer needed, can we remove it? |
Does this mean that for large number of threads, we do not use all of them? That is because we assume we have too many threads for the available memory bandwidth?