This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/tools/llvm-reduce/deltas/
-
tools/
-
llvm-reduce/
-
deltas/
-
ReduceArguments.cpp
-
ReduceBasicBlocks.cpp
-
ReduceGlobalVars.cpp
-
ReduceInstructions.cpp

Differential D112757

[llvm-reduce] optimize extractFromModule functions
ClosedPublic

Authored by dwightguth on Oct 28 2021, 2:10 PM.

Download Raw Diff

Details

Reviewers

aeubanks

Commits

rG2f1617362751: [llvm-reduce] optimize extractFromModule functions

Summary

The extractBasicBlocksFromModule, extractInstrFromModule, and other
similar functions previously performed very poorly when the number of
such elements in the program to reduce was very high. Previously, we
were creating the set which caches elements to keep by looping through
all elements in the module and adding them to the set. However, since
std::set is an ordered set, this introduces a massive amount of
rebalancing if the order of elements in the program and the order of
their pointers in memory are not the same.

The solution is straightforward: first put all the elements to be kept
in a vector, then use the constructor for std::set which takes a pair of
iterators over a collection. This constructor is optimized to avoid
doing unnecessary work when initializing large sets.

Also in this change, we pass BBsToKeep set to functions
replaceBranchTerminator and removeUninterestingBBsFromSwitch as a const
reference rather than passing it by value. This ought to prevent the
need to copy the collection each time these functions are called, which
is expensive if the collection is large.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dwightguth requested review of this revision.Oct 28 2021, 2:10 PM

dwightguth created this revision.

Herald added a project: Restricted Project. · View Herald TranscriptOct 28 2021, 2:10 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

removed stray change made by mistake

does std::unordered_set help?

I tried changing it to an unordered set without any of the other changes originally, but it still had the same problem of a lot of time spent inserting into the hash table. I eventually abandoned that approach because my intuition was that a hash table where the keys are addresses in memory would probably end up poorly balanced a lot of the time.

If you want I can try out the combination of pushing first into a vector and also creating a hash set from the vector, but I didn't go that route because I had already largely addressed the performance bottleneck in this function with the diff as you see it now, and it didn't seem worth the risk that it might be badly balanced in some cases when the majority of the execution time was now taking place elsewhere.

Harbormaster completed remote builds in B131297: Diff 383165.Oct 28 2021, 3:37 PM

adding some comments saying that this is for performance reasons would be good

This revision is now accepted and ready to land.Oct 28 2021, 3:57 PM

add comments

I addressed your suggestion that we add comments. I do not have commit access yet; can you please commit it if you're happy with the diff?

This revision was landed with ongoing or failed builds.Oct 29 2021, 10:08 AM

Closed by commit rG2f1617362751: [llvm-reduce] optimize extractFromModule functions (authored by dwightguth, committed by aeubanks). · Explain Why

This revision was automatically updated to reflect the committed changes.

aeubanks added a commit: rG2f1617362751: [llvm-reduce] optimize extractFromModule functions.

Harbormaster completed remote builds in B131458: Diff 383400.Oct 29 2021, 10:35 AM

Revision Contents

Path

Size

llvm/

tools/

llvm-reduce/

deltas/

ReduceArguments.cpp

8 lines

ReduceBasicBlocks.cpp

13 lines

ReduceGlobalVars.cpp

7 lines

ReduceInstructions.cpp

9 lines

Diff 383163

llvm/tools/llvm-reduce/deltas/ReduceArguments.cpp

	//===- ReduceArguments.cpp - Specialized Delta Pass -----------------------===//			//===- ReduceArguments.cpp - Specialized Delta Pass -----------------------===//
				Lint: Lint Inline Actions clang-format not found in user’s local PATH; not linting file. Lint: Lint: clang-format not found in user’s local PATH; not linting file.
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	Show All 10 Lines
	#include <vector>			#include <vector>

	using namespace llvm;			using namespace llvm;

	/// Goes over OldF calls and replaces them with a call to NewF			/// Goes over OldF calls and replaces them with a call to NewF
	static void replaceFunctionCalls(Function &OldF, Function &NewF,			static void replaceFunctionCalls(Function &OldF, Function &NewF,
	const std::set<int> &ArgIndexesToKeep) {			const std::set<int> &ArgIndexesToKeep) {
	const auto &Users = OldF.users();			const auto &Users = OldF.users();
	for (auto I = Users.begin(), E = Users.end(); I != E; )			for (auto I = Users.begin(), E = Users.end(); I != E;)
	if (auto CI = dyn_cast<CallInst>(I++)) {			if (auto CI = dyn_cast<CallInst>(I++)) {
	// Skip uses in call instructions where OldF isn't the called function			// Skip uses in call instructions where OldF isn't the called function
	// (e.g. if OldF is an argument of the call).			// (e.g. if OldF is an argument of the call).
	if (CI->getCalledFunction() != &OldF)			if (CI->getCalledFunction() != &OldF)
	continue;			continue;
	SmallVector<Value *, 8> Args;			SmallVector<Value *, 8> Args;
	for (auto ArgI = CI->arg_begin(), E = CI->arg_end(); ArgI != E; ++ArgI)			for (auto ArgI = CI->arg_begin(), E = CI->arg_end(); ArgI != E; ++ArgI)
	if (ArgIndexesToKeep.count(ArgI - CI->arg_begin()))			if (ArgIndexesToKeep.count(ArgI - CI->arg_begin()))
	Show All 13 Lines
	/// fixed.			/// fixed.
	static bool shouldRemoveArguments(const Function &F) {			static bool shouldRemoveArguments(const Function &F) {
	return !F.arg_empty() && !F.isIntrinsic();			return !F.arg_empty() && !F.isIntrinsic();
	}			}

	/// Removes out-of-chunk arguments from functions, and modifies their calls			/// Removes out-of-chunk arguments from functions, and modifies their calls
	/// accordingly. It also removes allocations of out-of-chunk arguments.			/// accordingly. It also removes allocations of out-of-chunk arguments.
	static void extractArgumentsFromModule(Oracle &O, Module &Program) {			static void extractArgumentsFromModule(Oracle &O, Module &Program) {
	std::set<Argument *> ArgsToKeep;			std::vector<Argument *> InitArgsToKeep;
	std::vector<Function *> Funcs;			std::vector<Function *> Funcs;
	// Get inside-chunk arguments, as well as their parent function			// Get inside-chunk arguments, as well as their parent function
	for (auto &F : Program)			for (auto &F : Program)
	if (shouldRemoveArguments(F)) {			if (shouldRemoveArguments(F)) {
	Funcs.push_back(&F);			Funcs.push_back(&F);
	for (auto &A : F.args())			for (auto &A : F.args())
	if (O.shouldKeep())			if (O.shouldKeep())
	ArgsToKeep.insert(&A);			InitArgsToKeep.push_back(&A);
	}			}

				std::set<Argument *> ArgsToKeep(InitArgsToKeep.begin(), InitArgsToKeep.end());

	for (auto *F : Funcs) {			for (auto *F : Funcs) {
	ValueToValueMapTy VMap;			ValueToValueMapTy VMap;
	std::vector<WeakVH> InstToDelete;			std::vector<WeakVH> InstToDelete;
	for (auto &A : F->args())			for (auto &A : F->args())
	if (!ArgsToKeep.count(&A)) {			if (!ArgsToKeep.count(&A)) {
	// By adding undesired arguments to the VMap, CloneFunction will remove			// By adding undesired arguments to the VMap, CloneFunction will remove
	// them from the resulting Function			// them from the resulting Function
	VMap[&A] = UndefValue::get(A.getType());			VMap[&A] = UndefValue::get(A.getType());
	▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

llvm/tools/llvm-reduce/deltas/ReduceBasicBlocks.cpp

//===- ReduceArguments.cpp - Specialized Delta Pass -----------------------===//		//===- ReduceArguments.cpp - Specialized Delta Pass -----------------------===//
		Lint: Lint Inline Actions clang-format not found in user’s local PATH; not linting file. Lint: Lint: clang-format not found in user’s local PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
Show All 11 Lines
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;

/// Replaces BB Terminator with one that only contains Chunk BBs		/// Replaces BB Terminator with one that only contains Chunk BBs
static void replaceBranchTerminator(BasicBlock &BB,		static void replaceBranchTerminator(BasicBlock &BB,
std::set<BasicBlock *> BBsToKeep) {		const std::set<BasicBlock *> &BBsToKeep) {
auto *Term = BB.getTerminator();		auto *Term = BB.getTerminator();
std::vector<BasicBlock *> ChunkSucessors;		std::vector<BasicBlock *> ChunkSucessors;
for (auto *Succ : successors(&BB))		for (auto *Succ : successors(&BB))
if (BBsToKeep.count(Succ))		if (BBsToKeep.count(Succ))
ChunkSucessors.push_back(Succ);		ChunkSucessors.push_back(Succ);

// BB only references Chunk BBs		// BB only references Chunk BBs
if (ChunkSucessors.size() == Term->getNumSuccessors())		if (ChunkSucessors.size() == Term->getNumSuccessors())
Show All 24 Lines	if (Address) {
for (auto *Dest : ChunkSucessors)		for (auto *Dest : ChunkSucessors)
NewIndBI->addDestination(Dest);		NewIndBI->addDestination(Dest);
}		}
}		}

/// Removes uninteresting BBs from switch, if the default case ends up being		/// Removes uninteresting BBs from switch, if the default case ends up being
/// uninteresting, the switch is replaced with a void return (since it has to be		/// uninteresting, the switch is replaced with a void return (since it has to be
/// replace with something)		/// replace with something)
static void removeUninterestingBBsFromSwitch(SwitchInst &SwInst,		static void
std::set<BasicBlock *> BBsToKeep) {		removeUninterestingBBsFromSwitch(SwitchInst &SwInst,
		const std::set<BasicBlock *> &BBsToKeep) {
if (!BBsToKeep.count(SwInst.getDefaultDest())) {		if (!BBsToKeep.count(SwInst.getDefaultDest())) {
auto *FnRetTy = SwInst.getParent()->getParent()->getReturnType();		auto *FnRetTy = SwInst.getParent()->getParent()->getReturnType();
ReturnInst::Create(SwInst.getContext(),		ReturnInst::Create(SwInst.getContext(),
FnRetTy->isVoidTy() ? nullptr : UndefValue::get(FnRetTy),		FnRetTy->isVoidTy() ? nullptr : UndefValue::get(FnRetTy),
SwInst.getParent());		SwInst.getParent());
SwInst.eraseFromParent();		SwInst.eraseFromParent();
} else		} else
for (int I = 0, E = SwInst.getNumCases(); I != E; ++I) {		for (int I = 0, E = SwInst.getNumCases(); I != E; ++I) {
auto Case = SwInst.case_begin() + I;		auto Case = SwInst.case_begin() + I;
if (!BBsToKeep.count(Case->getCaseSuccessor())) {		if (!BBsToKeep.count(Case->getCaseSuccessor())) {
SwInst.removeCase(Case);		SwInst.removeCase(Case);
--I;		--I;
--E;		--E;
}		}
}		}
}		}

/// Removes out-of-chunk arguments from functions, and modifies their calls		/// Removes out-of-chunk arguments from functions, and modifies their calls
/// accordingly. It also removes allocations of out-of-chunk arguments.		/// accordingly. It also removes allocations of out-of-chunk arguments.
static void extractBasicBlocksFromModule(Oracle &O, Module &Program) {		static void extractBasicBlocksFromModule(Oracle &O, Module &Program) {
std::set<BasicBlock *> BBsToKeep;		std::vector<BasicBlock *> InitBBsToKeep;

for (auto &F : Program)		for (auto &F : Program)
for (auto &BB : F)		for (auto &BB : F)
if (O.shouldKeep())		if (O.shouldKeep())
BBsToKeep.insert(&BB);		InitBBsToKeep.push_back(&BB);

		std::set<BasicBlock *> BBsToKeep(InitBBsToKeep.begin(), InitBBsToKeep.end());

std::vector<BasicBlock *> BBsToDelete;		std::vector<BasicBlock *> BBsToDelete;
for (auto &F : Program)		for (auto &F : Program)
for (auto &BB : F) {		for (auto &BB : F) {
if (!BBsToKeep.count(&BB)) {		if (!BBsToKeep.count(&BB)) {
BBsToDelete.push_back(&BB);		BBsToDelete.push_back(&BB);
// Remove out-of-chunk BB from successor phi nodes		// Remove out-of-chunk BB from successor phi nodes
for (auto *Succ : successors(&BB))		for (auto *Succ : successors(&BB))
▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

llvm/tools/llvm-reduce/deltas/ReduceGlobalVars.cpp

	//===- ReduceGlobalVars.cpp - Specialized Delta Pass ----------------------===//			//===- ReduceGlobalVars.cpp - Specialized Delta Pass ----------------------===//
				Lint: Lint Inline Actions clang-format not found in user’s local PATH; not linting file. Lint: Lint: clang-format not found in user’s local PATH; not linting file.
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file implements a function which calls the Generic Delta pass in order			// This file implements a function which calls the Generic Delta pass in order
	// to reduce Global Variables in the provided Module.			// to reduce Global Variables in the provided Module.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "ReduceGlobalVars.h"			#include "ReduceGlobalVars.h"
	#include "llvm/IR/Constants.h"			#include "llvm/IR/Constants.h"
	#include <set>			#include <set>

	using namespace llvm;			using namespace llvm;

	/// Removes all the GVs that aren't inside the desired Chunks.			/// Removes all the GVs that aren't inside the desired Chunks.
	static void extractGVsFromModule(Oracle &O, Module &Program) {			static void extractGVsFromModule(Oracle &O, Module &Program) {
	// Get GVs inside desired chunks			// Get GVs inside desired chunks
	std::set<GlobalVariable *> GVsToKeep;			std::vector<GlobalVariable *> InitGVsToKeep;
	for (auto &GV : Program.globals())			for (auto &GV : Program.globals())
	if (O.shouldKeep())			if (O.shouldKeep())
	GVsToKeep.insert(&GV);			InitGVsToKeep.push_back(&GV);

				std::set<GlobalVariable *> GVsToKeep(InitGVsToKeep.begin(),
				InitGVsToKeep.end());

	// Delete out-of-chunk GVs and their uses			// Delete out-of-chunk GVs and their uses
	std::vector<GlobalVariable *> ToRemove;			std::vector<GlobalVariable *> ToRemove;
	std::vector<WeakVH> InstToRemove;			std::vector<WeakVH> InstToRemove;
	for (auto &GV : Program.globals())			for (auto &GV : Program.globals())
	if (!GVsToKeep.count(&GV)) {			if (!GVsToKeep.count(&GV)) {
	for (auto *U : GV.users())			for (auto *U : GV.users())
	if (auto *Inst = dyn_cast<Instruction>(U))			if (auto *Inst = dyn_cast<Instruction>(U))
	Show All 37 Lines

llvm/tools/llvm-reduce/deltas/ReduceInstructions.cpp

	//===- ReduceArguments.cpp - Specialized Delta Pass -----------------------===//			//===- ReduceArguments.cpp - Specialized Delta Pass -----------------------===//
				Lint: Lint Inline Actions clang-format not found in user’s local PATH; not linting file. Lint: Lint: clang-format not found in user’s local PATH; not linting file.
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file implements a function which calls the Generic Delta pass in order			// This file implements a function which calls the Generic Delta pass in order
	// to reduce uninteresting Arguments from defined functions.			// to reduce uninteresting Arguments from defined functions.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "ReduceInstructions.h"			#include "ReduceInstructions.h"

	using namespace llvm;			using namespace llvm;

	/// Removes out-of-chunk arguments from functions, and modifies their calls			/// Removes out-of-chunk arguments from functions, and modifies their calls
	/// accordingly. It also removes allocations of out-of-chunk arguments.			/// accordingly. It also removes allocations of out-of-chunk arguments.
	static void extractInstrFromModule(Oracle &O, Module &Program) {			static void extractInstrFromModule(Oracle &O, Module &Program) {
	std::set<Instruction *> InstToKeep;			std::vector<Instruction *> InitInstToKeep;

	for (auto &F : Program)			for (auto &F : Program)
	for (auto &BB : F) {			for (auto &BB : F) {
	// Removing the terminator would make the block invalid. Only iterate over			// Removing the terminator would make the block invalid. Only iterate over
	// instructions before the terminator.			// instructions before the terminator.
	InstToKeep.insert(BB.getTerminator());			InitInstToKeep.push_back(BB.getTerminator());
	for (auto &Inst : make_range(BB.begin(), std::prev(BB.end())))			for (auto &Inst : make_range(BB.begin(), std::prev(BB.end())))
	if (O.shouldKeep())			if (O.shouldKeep())
	InstToKeep.insert(&Inst);			InitInstToKeep.push_back(&Inst);
	}			}

				std::set<Instruction *> InstToKeep(InitInstToKeep.begin(),
				InitInstToKeep.end());

	std::vector<Instruction *> InstToDelete;			std::vector<Instruction *> InstToDelete;
	for (auto &F : Program)			for (auto &F : Program)
	for (auto &BB : F)			for (auto &BB : F)
	for (auto &Inst : BB)			for (auto &Inst : BB)
	if (!InstToKeep.count(&Inst)) {			if (!InstToKeep.count(&Inst)) {
	Inst.replaceAllUsesWith(UndefValue::get(Inst.getType()));			Inst.replaceAllUsesWith(UndefValue::get(Inst.getType()));
	InstToDelete.push_back(&Inst);			InstToDelete.push_back(&Inst);
	}			}
	Show All 24 Lines