This is an archive of the discontinued LLVM Phabricator instance.

llvm/include/llvm/Analysis/Utils/TFUtils.h
142–145	It seems over-complicated to pass the flag FinalReward to writeRawTensorsAsFeatureLists function and treat the case separately. How about making the RawLogData ready-to-print (reward vector is <0, 0, ..., 0, reward>) so that we don't need to change writeRawTensorsAsFeatureLists function? basically the user is supposed to make sure the data in RawLogData is ready-to-print and writeRawTensorsAsFeatureLists only takes care of printing format. We can either: logReward(0) in each step have a function in Logger called logFinalReward(T value) or overwriteFinalReward(T value), which overwrites the value in RawLogData.back()->back() or: don't logReward(0) in each step have a function in Logger called logFinalReward(T value) that fills in 0 in RawLogData.back() except putting reward in the last, we can tell the length by looking at feature length already logged in RawLogData

This revision now requires changes to proceed.Oct 18 2020, 10:10 PM

mtrofin marked an inline comment as done.Oct 18 2020, 10:15 PM

mtrofin added inline comments.

llvm/include/llvm/Analysis/Utils/TFUtils.h
142–145	That would mean keeping around a potentially large array containing 0 (except the last value). I would rather not add memory overhead if it can be avoided, and the extra consideration in code isn't that hard to grok.

yundiqian accepted this revision.Oct 18 2020, 10:41 PM

yundiqian added inline comments.

llvm/include/llvm/Analysis/Utils/TFUtils.h
142–145	Since it's for development mode, I guess it is debatable whether it worth adding code complexity to trade for better efficiency. imo it does not worth it, but I'm open to either options.

This revision is now accepted and ready to land.Oct 18 2020, 10:41 PM

This revision was landed with ongoing or failed builds.Oct 19 2020, 8:49 AM

Closed by commit rGd454328ea885: [ML] Add final reward logging facility. (authored by mtrofin). · Explain Why

This revision was automatically updated to reflect the committed changes.

mtrofin marked 2 inline comments as done.

mtrofin added a commit: rGd454328ea885: [ML] Add final reward logging facility..

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

Utils/

TFUtils.h

5 lines

lib/

Analysis/

TFUtils.cpp

55 lines

unittests/

Analysis/

TFUtilsTest.cpp

45 lines

Diff 299071

llvm/include/llvm/Analysis/Utils/TFUtils.h

Show First 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	Logger(const std::vector<LoggedFeatureSpec> &FeatureSpecs,
RawLogData(FeatureSpecs.size() + IncludeReward),		RawLogData(FeatureSpecs.size() + IncludeReward),
IncludeReward(IncludeReward) {}		IncludeReward(IncludeReward) {}

template <typename T> void logReward(T Value) {		template <typename T> void logReward(T Value) {
assert(IncludeReward);		assert(IncludeReward);
logTensorValue(RawLogData.size() - 1, &Value);		logTensorValue(RawLogData.size() - 1, &Value);
}		}

		template <typename T> void logFinalReward(T Value) {
		assert(RawLogData.back().empty());
		logReward(Value);
		}
		yundiqianUnsubmitted Done Reply Inline Actions It seems over-complicated to pass the flag FinalReward to writeRawTensorsAsFeatureLists function and treat the case separately. How about making the RawLogData ready-to-print (reward vector is <0, 0, ..., 0, reward>) so that we don't need to change writeRawTensorsAsFeatureLists function? basically the user is supposed to make sure the data in RawLogData is ready-to-print and writeRawTensorsAsFeatureLists only takes care of printing format. We can either: logReward(0) in each step have a function in Logger called logFinalReward(T value) or overwriteFinalReward(T value), which overwrites the value in RawLogData.back()->back() or: don't logReward(0) in each step have a function in Logger called logFinalReward(T value) that fills in 0 in RawLogData.back() except putting reward in the last, we can tell the length by looking at feature length already logged in RawLogData yundiqian: It seems over-complicated to pass the flag FinalReward to writeRawTensorsAsFeatureLists…
		mtrofinAuthorUnsubmitted Done Reply Inline Actions That would mean keeping around a potentially large array containing 0 (except the last value). I would rather not add memory overhead if it can be avoided, and the extra consideration in code isn't that hard to grok. mtrofin: That would mean keeping around a potentially large array containing 0 (except the last value).
		yundiqianUnsubmitted Done Reply Inline Actions Since it's for development mode, I guess it is debatable whether it worth adding code complexity to trade for better efficiency. imo it does not worth it, but I'm open to either options. yundiqian: Since it's for development mode, I guess it is debatable whether it worth adding code…

template <typename T>		template <typename T>
void logTensorValue(size_t FeatureID, const T *Value, size_t Size = 1) {		void logTensorValue(size_t FeatureID, const T *Value, size_t Size = 1) {
const char Start = reinterpret_cast<const char >(Value);		const char Start = reinterpret_cast<const char >(Value);
const char End = Start + sizeof(T) Size;		const char End = Start + sizeof(T) Size;
RawLogData[FeatureID].insert(RawLogData[FeatureID].end(), Start, End);		RawLogData[FeatureID].insert(RawLogData[FeatureID].end(), Start, End);
}		}

void print(raw_ostream &OS);		void print(raw_ostream &OS);
▲ Show 20 Lines • Show All 98 Lines • Show Last 20 Lines

llvm/lib/Analysis/TFUtils.cpp

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	void writeTensorValues(raw_ostream &OutFile, const char *TensorData,
for (size_t I = 0; I < ElemCount; ++I) {		for (size_t I = 0; I < ElemCount; ++I) {
if (I > 0)		if (I > 0)
OutFile << ", ";		OutFile << ", ";
OutFile << TypedData[I];		OutFile << TypedData[I];
}		}
OutFile << "]";		OutFile << "]";
}		}

/// Untyped implementation of the API above.		/// Write a list of tensors as a sequence of TensorFlow FeatureList protobufs.
		/// The tensors are assumed to be stored contiguously, in row-major format,
		/// in the TensorData buffer. Each tensor has the shape given by Spec. The
		/// feature name in the output is either the provided LoggingName, if
		/// specified, otherwise it's the name of the tensor (as given by Spec).
void writeRawTensorsAsFeatureLists(raw_ostream &OutFile,		void writeRawTensorsAsFeatureLists(raw_ostream &OutFile,
const Logger::LoggedFeatureSpec &LoggedSpec,		const Logger::LoggedFeatureSpec &LoggedSpec,
const char *TensorData, size_t TensorCount) {		const char *TensorData, size_t TensorCount,
		bool FinalReward = false) {
const char *FieldName = "<invalid>";		const char *FieldName = "<invalid>";
std::function<void(const char *)> ValueWriter;		std::function<void(const char *)> ValueWriter;
const auto &Spec = LoggedSpec.Spec;		const auto &Spec = LoggedSpec.Spec;
// The 'Feature' protobuf only has 3 possible fields: float_list,		// The 'Feature' protobuf only has 3 possible fields: float_list,
// int64_list, or bytes_list, so we capture int32 values as int64. We don't		// int64_list, or bytes_list, so we capture int32 values as int64. We don't
// support any other types.		// support any other types.
if (Spec.isElementType<int64_t>()) {		if (Spec.isElementType<int64_t>()) {
FieldName = "int64_list";		FieldName = "int64_list";
Show All 18 Lines	void writeRawTensorsAsFeatureLists(raw_ostream &OutFile,

OutFile << " feature_list: {\n";		OutFile << " feature_list: {\n";
OutFile << " key: "		OutFile << " key: "
<< "\""		<< "\""
<< (LoggedSpec.LoggingName ? *LoggedSpec.LoggingName : Spec.name())		<< (LoggedSpec.LoggingName ? *LoggedSpec.LoggingName : Spec.name())
<< "\" ";		<< "\" ";
OutFile << "value: {\n";		OutFile << "value: {\n";
size_t TensorByteSize = Spec.getElementCount() * Spec.getElementByteSize();		size_t TensorByteSize = Spec.getElementCount() * Spec.getElementByteSize();
for (const char *P = TensorData,
E = TensorData + TensorByteSize TensorCount;		auto WriteFeatureProto = [&](const char *P) {
P < E; P += TensorByteSize) {
OutFile << " feature: { " << FieldName << ": { value: ";		OutFile << " feature: { " << FieldName << ": { value: ";
ValueWriter(P);		ValueWriter(P);
OutFile << " } }\n";		OutFile << " } }\n";
		};

		const char *CurrentTensor = TensorData;
		static int64_t Zero = 0;
		// Write all but the last value. If this is the final reward, don't increment
		// the CurrentTensor, and just write 0.
		for (size_t I = 0; I < TensorCount - 1; ++I) {
		if (FinalReward)
		WriteFeatureProto(reinterpret_cast<const char *>(&Zero));
		else {
		WriteFeatureProto(CurrentTensor);
		CurrentTensor += TensorByteSize;
}		}
OutFile << " }\n";
OutFile << " }\n";
}		}

/// Write a list of tensors as a sequence of TensorFlow FeatureList protobufs.		WriteFeatureProto(CurrentTensor);
/// The tensors are assumed to be stored contiguously, in row-major format,
/// in the TensorData buffer. Each tensor has the shape given by Spec. The		OutFile << " }\n";
/// feature name in the output is either the provided LoggingName, if		OutFile << " }\n";
/// specified, otherwise it's the name of the tensor (as given by Spec).
template <typename T>
void writeTensorsAsFeatureLists(raw_ostream &OutFile,
const Logger::LoggedFeatureSpec &Spec,
const T *TensorData, size_t TensorCount) {
writeRawTensorsAsFeatureLists(
OutFile, Spec, reinterpret_cast<const char *>(TensorData), TensorCount);
}		}
} // namespace		} // namespace

namespace llvm {		namespace llvm {
class EvaluationResultImpl {		class EvaluationResultImpl {
public:		public:
EvaluationResultImpl(size_t OutputSize)		EvaluationResultImpl(size_t OutputSize)
: OutputSize(OutputSize), Output(OutputSize){};		: OutputSize(OutputSize), Output(OutputSize){};
▲ Show 20 Lines • Show All 252 Lines • ▼ Show 20 Lines	if (RawLogData.empty())
return;		return;
if (RawLogData[0].empty())		if (RawLogData[0].empty())
return;		return;
size_t Tensor0Size = FeatureSpecs[0].Spec.getElementCount() *		size_t Tensor0Size = FeatureSpecs[0].Spec.getElementCount() *
FeatureSpecs[0].Spec.getElementByteSize();		FeatureSpecs[0].Spec.getElementByteSize();
size_t NumberOfRecords = RawLogData[0].size() / Tensor0Size;		size_t NumberOfRecords = RawLogData[0].size() / Tensor0Size;
if (NumberOfRecords == 0)		if (NumberOfRecords == 0)
return;		return;
		size_t RewardSize =
		RewardSpec.getElementCount() * RewardSpec.getElementByteSize();
		size_t NumberOfRewards = RawLogData.back().size() / RewardSize;

OS << "feature_lists: {\n";		OS << "feature_lists: {\n";
for (size_t I = 0; I < FeatureSpecs.size(); ++I)		for (size_t I = 0; I < FeatureSpecs.size(); ++I)
writeTensorsAsFeatureLists(OS, FeatureSpecs[I], RawLogData[I].data(),		writeRawTensorsAsFeatureLists(OS, FeatureSpecs[I], RawLogData[I].data(),
NumberOfRecords);		NumberOfRecords);

if (IncludeReward)		if (IncludeReward)
writeTensorsAsFeatureLists(OS, {RewardSpec, None}, RawLogData.back().data(),		writeRawTensorsAsFeatureLists(OS, {RewardSpec, None},
NumberOfRecords);		RawLogData.back().data(), NumberOfRecords,
		NumberOfRewards == 1);

OS << "}\n";		OS << "}\n";
}		}
#endif // defined(LLVM_HAVE_TF_API)		#endif // defined(LLVM_HAVE_TF_API)

llvm/unittests/Analysis/TFUtilsTest.cpp

Show First 20 Lines • Show All 221 Lines • ▼ Show 20 Lines	feature_list: {
}		}
}		}
}		}
)";		)";
std::string Result;		std::string Result;
raw_string_ostream OS(Result);		raw_string_ostream OS(Result);
L.print(OS);		L.print(OS);
EXPECT_EQ(Result, Expected);		EXPECT_EQ(Result, Expected);
}		}
No newline at end of file
		TEST(TFUtilsTest, LoggerFinalReward) {
		std::vector<Logger::LoggedFeatureSpec> Features;
		Features.push_back({TensorSpec::createSpec<float>("the_float", {1}), None});
		Features.push_back({TensorSpec::createSpec<int64_t>("the_int", {1}), None});

		auto Rewards = TensorSpec::createSpec<float>("reward", {1});
		Logger L(Features, Rewards, true);
		for (size_t I = 0; I < 3; ++I) {
		float F = static_cast<float>(I);
		L.logTensorValue(0, &F);
		L.logTensorValue(1, &I);
		}
		L.logFinalReward<float>(3.14);
		const auto *Expected = R"(feature_lists: {
		feature_list: {
		key: "the_float" value: {
		feature: { float_list: { value: [0.000000e+00] } }
		feature: { float_list: { value: [1.000000e+00] } }
		feature: { float_list: { value: [2.000000e+00] } }
		}
		}
		feature_list: {
		key: "the_int" value: {
		feature: { int64_list: { value: [0] } }
		feature: { int64_list: { value: [1] } }
		feature: { int64_list: { value: [2] } }
		}
		}
		feature_list: {
		key: "reward" value: {
		feature: { float_list: { value: [0.000000e+00] } }
		feature: { float_list: { value: [0.000000e+00] } }
		feature: { float_list: { value: [3.140000e+00] } }
		}
		}
		}
		)";
		std::string Result;
		raw_string_ostream OS(Result);
		L.print(OS);
		EXPECT_EQ(Result, Expected);
		}

This is an archive of the discontinued LLVM Phabricator instance.

[ML] Add final reward logging facility.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 299071

llvm/include/llvm/Analysis/Utils/TFUtils.h

llvm/lib/Analysis/TFUtils.cpp

llvm/unittests/Analysis/TFUtilsTest.cpp

[ML] Add final reward logging facility.
ClosedPublic