This is an archive of the discontinued LLVM Phabricator instance.

[NVPTX] Added an option to run NVVMReflect pass.
ClosedPublic

Authored by tra on Jul 30 2015, 2:48 PM.

Download Raw Diff

Details

Reviewers

jholewinski
echristo

Commits

rG0127d80986a2: [NVPTX] Added run NVVMReflect pass to NVPTX back-end.
rL247072: [NVPTX] Added run NVVMReflect pass to NVPTX back-end.

Summary

The pass is needed to remove __nvvm_reflect calls after we link in libdevice bitcode that comes with CUDA.
http://llvm.org/docs/NVPTXUsage.html#linking-with-libdevice

Diff Detail

Repository: rL LLVM

Event Timeline

tra updated this revision to Diff 31076.Jul 30 2015, 2:48 PM

tra retitled this revision from to [NVPTX] Added an option to run NVVMReflect pass..

tra updated this object.

tra added reviewers: echristo, jholewinski.

tra added a subscriber: llvm-commits.

Herald added a subscriber: jholewinski. · View Herald TranscriptJul 30 2015, 2:48 PM

tra added a child revision: D11664: [CUDA] Implemented additional processing steps needed to link with CUDA libdevice bitcode..Jul 30 2015, 2:58 PM

Testcase to make sure it's running?

-eric

Added test case to verify that nvptx-enable-reflect option works.

I guess that's one way. I was hoping for a basic functionality test?

-eric

In D11663#231506, @echristo wrote:

I guess that's one way. I was hoping for a basic functionality test?

NVVMReflect pass functionality is already tested in test/CodeGen/NVPTX/nvvm-reflect.ll

In addition, tests in D11664 (clang-side counterpart of this patch) verify end-to-end functionality which ensures that __nvvm_reflect() is eliminated when -nvptx-enable-reflect option is passed.

Seems this could be unified with with the nvvm-reflect-enable option in the pass itself? Actually, is there any reason this shouldn't be on by default for the backend? (i.e. am I missing something here?)

-eric

In D11663#241611, @echristo wrote:

Seems this could be unified with with the nvvm-reflect-enable option in the pass itself? Actually, is there any reason this shouldn't be on by default for the backend? (i.e. am I missing something here?)

Your point makes sense as linking with libdevice will be the common case for compiling real CUDA apps.
Adding NVVMReflect pass unconditionally would break whoever may want to compile their own libdevice variant.
It can be easily fixed with -nvvm-reflect-enable=0, so it should not be too big of a deal.

On second thought, adding NVVMReflect pass with *default* settings unconditionally may make is hard to replace it with the one that takes optional StringMap. I don't think we need that now, but if/when that happens we can make NVVMReflect pass conditional then.

Added NVVMReflect pass unconditionally.
Removed command-line option.
If NVVMReflect has to be disabled, it can be done with -nvvm-reflect-enable=0 option.

LGTM. Thanks.

This revision is now accepted and ready to land.Sep 8 2015, 1:59 PM

Closed by commit rL247072: [NVPTX] Added run NVVMReflect pass to NVPTX back-end. (authored by tra). · Explain WhySep 8 2015, 2:06 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Target/

NVPTX/

NVPTXTargetMachine.cpp

1 line

Diff 34258

llvm/trunk/lib/Target/NVPTX/NVPTXTargetMachine.cpp

Show First 20 Lines • Show All 170 Lines • ▼ Show 20 Lines	void NVPTXPassConfig::addIRPasses() {
// around after register allocation (which in our case, is all registers).		// around after register allocation (which in our case, is all registers).
// We explicitly disable them here. We do, however, need some functionality		// We explicitly disable them here. We do, however, need some functionality
// of the PrologEpilogCodeInserter pass, so we emulate that behavior in the		// of the PrologEpilogCodeInserter pass, so we emulate that behavior in the
// NVPTXPrologEpilog pass (see NVPTXPrologEpilogPass.cpp).		// NVPTXPrologEpilog pass (see NVPTXPrologEpilogPass.cpp).
disablePass(&PrologEpilogCodeInserterID);		disablePass(&PrologEpilogCodeInserterID);
disablePass(&MachineCopyPropagationID);		disablePass(&MachineCopyPropagationID);
disablePass(&TailDuplicateID);		disablePass(&TailDuplicateID);

		addPass(createNVVMReflectPass());
addPass(createNVPTXImageOptimizerPass());		addPass(createNVPTXImageOptimizerPass());
addPass(createNVPTXAssignValidGlobalNamesPass());		addPass(createNVPTXAssignValidGlobalNamesPass());
addPass(createGenericToNVVMPass());		addPass(createGenericToNVVMPass());

// === Propagate special address spaces ===		// === Propagate special address spaces ===
addPass(createNVPTXLowerKernelArgsPass(&getNVPTXTargetMachine()));		addPass(createNVPTXLowerKernelArgsPass(&getNVPTXTargetMachine()));
// NVPTXLowerKernelArgs emits alloca for byval parameters which can often		// NVPTXLowerKernelArgs emits alloca for byval parameters which can often
// be eliminated by SROA.		// be eliminated by SROA.
▲ Show 20 Lines • Show All 135 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[NVPTX] Added an option to run NVVMReflect pass.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 34258

llvm/trunk/lib/Target/NVPTX/NVPTXTargetMachine.cpp

[NVPTX] Added an option to run NVVMReflect pass.
ClosedPublic