This is an archive of the discontinued LLVM Phabricator instance.

[FoldingSet] Move compare to header (NFC).
Needs ReviewPublic

Authored by fhahn on Dec 7 2022, 1:00 PM.

Download Raw Diff

Details

Reviewers

nikic
serge-sans-paille
yurai007

Summary

Moving the compare operator implementations to the header gives a slight
speedup. Not sure if the speedups are worthwhile, but I noticed this
when looking at folding set performance in general.

NewPM-O3: -0.04%
NewPM-ReleaseThinLTO: -0.03%
NewPM-ReleaseLTO-g: -0.01

https://llvm-compile-time-tracker.com/compare.php?from=8fa81cfad47c827ea1b75afe1f32861d4a7bad37&to=3c5e5ebd834106e62407e79d80ced0a49eafc4b7&stat=instructions:u

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

fhahn created this revision.Dec 7 2022, 1:00 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 7 2022, 1:00 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

fhahn requested review of this revision.Dec 7 2022, 1:00 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 7 2022, 1:00 PM

Out of curiosity: do you have any speedup when building with LTO? Looks like something LTO should handle gracefully...

Harbormaster completed remote builds in B201786: Diff 481026.Dec 7 2022, 10:47 PM

In D139573#3979467, @serge-sans-paille wrote:

Out of curiosity: do you have any speedup when building with LTO? Looks like something LTO should handle gracefully...

Sure, this is not needed when building with LTO, but many builds don't use LTO :)

looking at folding set performance in general

Just as a side note regarding further improvements in future - when I was working on related patches some months ago I noted 2 extra ideas which could be worth exploring:

[1] faster hashing related to FoldingSet* classes family. Symbols like llvm::FoldingSetBase::FindNodeOrInsertPos and llvm::hashing::detail::hash_short are often seen as hot in profiler output (at least were - couple of months ago). There is long-standing task in my queue to try xxHash or even CRC just for FoldingSet and see if it helps.

[2] removing (somehow) indirection related to FoldingSet::NodeEquals in context of following callers: FunctionProtoTypes/PointerTypes/ElaboratedTypes/ParenTypes Profile functions. IIRC that indirection made inlining impossible which mattered for projects with compile time highly dominated by frontend. Moving compare operators in this change may help for NodeEquals but I believe there is still some room for improvement.

I'm not sure about [2], but I think that [1] should bring some value even for builds with LTO=ON.

Revision Contents

Path

Size

llvm/

include/

llvm/

ADT/

FoldingSet.h

33 lines

lib/

Support/

FoldingSet.cpp

38 lines

Diff 481026

llvm/include/llvm/ADT/FoldingSet.h

Show All 17 Lines

#include "llvm/ADT/Hashing.h"		#include "llvm/ADT/Hashing.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/iterator.h"		#include "llvm/ADT/iterator.h"
#include "llvm/Support/Allocator.h"		#include "llvm/Support/Allocator.h"
#include <cassert>		#include <cassert>
#include <cstddef>		#include <cstddef>
#include <cstdint>		#include <cstdint>
		#include <cstring>
#include <type_traits>		#include <type_traits>
#include <utility>		#include <utility>

namespace llvm {		namespace llvm {

/// This folding set used for two purposes:		/// This folding set used for two purposes:
/// 1. Given information about a node we want to create, look up the unique		/// 1. Given information about a node we want to create, look up the unique
/// instance of the node in the set. If the node already exists, return		/// instance of the node in the set. If the node already exists, return
▲ Show 20 Lines • Show All 260 Lines • ▼ Show 20 Lines	public:
FoldingSetNodeIDRef(const unsigned *D, size_t S) : Data(D), Size(S) {}		FoldingSetNodeIDRef(const unsigned *D, size_t S) : Data(D), Size(S) {}

/// ComputeHash - Compute a strong hash value for this FoldingSetNodeIDRef,		/// ComputeHash - Compute a strong hash value for this FoldingSetNodeIDRef,
/// used to lookup the node in the FoldingSetBase.		/// used to lookup the node in the FoldingSetBase.
unsigned ComputeHash() const {		unsigned ComputeHash() const {
return static_cast<unsigned>(hash_combine_range(Data, Data + Size));		return static_cast<unsigned>(hash_combine_range(Data, Data + Size));
}		}

bool operator==(FoldingSetNodeIDRef) const;		bool operator==(const FoldingSetNodeIDRef &RHS) const {
		if (Size != RHS.Size)
		return false;
		return memcmp(Data, RHS.Data, Size * sizeof(*Data)) == 0;
		}

bool operator!=(FoldingSetNodeIDRef RHS) const { return !(*this == RHS); }		bool operator!=(const FoldingSetNodeIDRef &RHS) const {
		return !(*this == RHS);
		}

/// Used to compare the "ordering" of two nodes as defined by the		/// Used to compare the "ordering" of two nodes as defined by the
/// profiled bits and their ordering defined by memcmp().		/// profiled bits and their ordering defined by memcmp().
bool operator<(FoldingSetNodeIDRef) const;		bool operator<(const FoldingSetNodeIDRef &RHS) const {
		if (Size != RHS.Size)
		return Size < RHS.Size;
		return memcmp(Data, RHS.Data, Size * sizeof(*Data)) < 0;
		}

const unsigned *getData() const { return Data; }		const unsigned *getData() const { return Data; }
size_t getSize() const { return Size; }		size_t getSize() const { return Size; }
};		};

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
/// FoldingSetNodeID - This class is used to gather all the unique data bits of		/// FoldingSetNodeID - This class is used to gather all the unique data bits of
/// a node. When all the bits are gathered this class is used to produce a		/// a node. When all the bits are gathered this class is used to produce a
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	public:

/// ComputeHash - Compute a strong hash value for this FoldingSetNodeID, used		/// ComputeHash - Compute a strong hash value for this FoldingSetNodeID, used
/// to lookup the node in the FoldingSetBase.		/// to lookup the node in the FoldingSetBase.
unsigned ComputeHash() const {		unsigned ComputeHash() const {
return FoldingSetNodeIDRef(Bits.data(), Bits.size()).ComputeHash();		return FoldingSetNodeIDRef(Bits.data(), Bits.size()).ComputeHash();
}		}

/// operator== - Used to compare two nodes to each other.		/// operator== - Used to compare two nodes to each other.
bool operator==(const FoldingSetNodeID &RHS) const;		bool operator==(const FoldingSetNodeID &RHS) const {
bool operator==(const FoldingSetNodeIDRef RHS) const;		return *this == FoldingSetNodeIDRef(RHS.Bits.data(), RHS.Bits.size());
		}
		bool operator==(const FoldingSetNodeIDRef RHS) const {
		return FoldingSetNodeIDRef(Bits.data(), Bits.size()) == RHS;
		}

bool operator!=(const FoldingSetNodeID &RHS) const { return !(*this == RHS); }		bool operator!=(const FoldingSetNodeID &RHS) const { return !(*this == RHS); }
bool operator!=(const FoldingSetNodeIDRef RHS) const { return !(*this ==RHS);}		bool operator!=(const FoldingSetNodeIDRef RHS) const { return !(*this ==RHS);}

/// Used to compare the "ordering" of two nodes as defined by the		/// Used to compare the "ordering" of two nodes as defined by the
/// profiled bits and their ordering defined by memcmp().		/// profiled bits and their ordering defined by memcmp().
bool operator<(const FoldingSetNodeID &RHS) const;		bool operator<(const FoldingSetNodeID &RHS) const {
bool operator<(const FoldingSetNodeIDRef RHS) const;		return *this < FoldingSetNodeIDRef(RHS.Bits.data(), RHS.Bits.size());
		}
		bool operator<(const FoldingSetNodeIDRef RHS) const {
		return FoldingSetNodeIDRef(Bits.data(), Bits.size()) < RHS;
		}

/// Intern - Copy this node's data to a memory region allocated from the		/// Intern - Copy this node's data to a memory region allocated from the
/// given allocator and return a FoldingSetNodeIDRef describing the		/// given allocator and return a FoldingSetNodeIDRef describing the
/// interned data.		/// interned data.
FoldingSetNodeIDRef Intern(BumpPtrAllocator &Allocator) const;		FoldingSetNodeIDRef Intern(BumpPtrAllocator &Allocator) const;
};		};

// Convenience type to hide the implementation of the folding set.		// Convenience type to hide the implementation of the folding set.
▲ Show 20 Lines • Show All 449 Lines • Show Last 20 Lines

llvm/lib/Support/FoldingSet.cpp

Show All 16 Lines
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/SwapByteOrder.h"		#include "llvm/Support/SwapByteOrder.h"
#include <cassert>		#include <cassert>
#include <cstring>		#include <cstring>
using namespace llvm;		using namespace llvm;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// FoldingSetNodeIDRef Implementation

bool FoldingSetNodeIDRef::operator==(FoldingSetNodeIDRef RHS) const {
if (Size != RHS.Size) return false;
return memcmp(Data, RHS.Data, Sizesizeof(Data)) == 0;
}

/// Used to compare the "ordering" of two nodes as defined by the
/// profiled bits and their ordering defined by memcmp().
bool FoldingSetNodeIDRef::operator<(FoldingSetNodeIDRef RHS) const {
if (Size != RHS.Size)
return Size < RHS.Size;
return memcmp(Data, RHS.Data, Sizesizeof(Data)) < 0;
}

//===----------------------------------------------------------------------===//
// FoldingSetNodeID Implementation		// FoldingSetNodeID Implementation

/// Add* - Add various data types to Bit data.		/// Add* - Add various data types to Bit data.
///		///
void FoldingSetNodeID::AddString(StringRef String) {		void FoldingSetNodeID::AddString(StringRef String) {
unsigned Size = String.size();		unsigned Size = String.size();

unsigned NumInserts = 1 + divideCeil(Size, 4);		unsigned NumInserts = 1 + divideCeil(Size, 4);
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	void FoldingSetNodeID::AddString(StringRef String) {
Bits.push_back(V);		Bits.push_back(V);
}		}

// AddNodeID - Adds the Bit data of another ID to *this.		// AddNodeID - Adds the Bit data of another ID to *this.
void FoldingSetNodeID::AddNodeID(const FoldingSetNodeID &ID) {		void FoldingSetNodeID::AddNodeID(const FoldingSetNodeID &ID) {
Bits.append(ID.Bits.begin(), ID.Bits.end());		Bits.append(ID.Bits.begin(), ID.Bits.end());
}		}

/// operator== - Used to compare two nodes to each other.
///
bool FoldingSetNodeID::operator==(const FoldingSetNodeID &RHS) const {
return *this == FoldingSetNodeIDRef(RHS.Bits.data(), RHS.Bits.size());
}

/// operator== - Used to compare two nodes to each other.
///
bool FoldingSetNodeID::operator==(FoldingSetNodeIDRef RHS) const {
return FoldingSetNodeIDRef(Bits.data(), Bits.size()) == RHS;
}

/// Used to compare the "ordering" of two nodes as defined by the
/// profiled bits and their ordering defined by memcmp().
bool FoldingSetNodeID::operator<(const FoldingSetNodeID &RHS) const {
return *this < FoldingSetNodeIDRef(RHS.Bits.data(), RHS.Bits.size());
}

bool FoldingSetNodeID::operator<(FoldingSetNodeIDRef RHS) const {
return FoldingSetNodeIDRef(Bits.data(), Bits.size()) < RHS;
}

/// Intern - Copy this node's data to a memory region allocated from the		/// Intern - Copy this node's data to a memory region allocated from the
/// given allocator and return a FoldingSetNodeIDRef describing the		/// given allocator and return a FoldingSetNodeIDRef describing the
/// interned data.		/// interned data.
FoldingSetNodeIDRef		FoldingSetNodeIDRef
FoldingSetNodeID::Intern(BumpPtrAllocator &Allocator) const {		FoldingSetNodeID::Intern(BumpPtrAllocator &Allocator) const {
unsigned *New = Allocator.Allocate<unsigned>(Bits.size());		unsigned *New = Allocator.Allocate<unsigned>(Bits.size());
std::uninitialized_copy(Bits.begin(), Bits.end(), New);		std::uninitialized_copy(Bits.begin(), Bits.end(), New);
return FoldingSetNodeIDRef(New, Bits.size());		return FoldingSetNodeIDRef(New, Bits.size());
▲ Show 20 Lines • Show All 289 Lines • Show Last 20 Lines