Download Raw Diff

Details

Reviewers

rnk
zturner
amccarth
k8stone
max-kudr

Commits

rGdab898f9ab62: [Windows] Fix limit on command line size
rGd4020ef7c474: [Windows] Fix limit on command line size

Summary

Documentation on CreateProcessW states that maximal size of command line
is 32767 characters including ternimation null character. In the
function llvm::sys::commandLineFitsWithinSystemLimits this limit was set
to 32768. As a result if command line was exactly 32768 characters long,
a response file was not created and CreateProcessW was called with
too long command line.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	500 ms	linux > MemorySanitizer-X86_64.MemorySanitizer-X86_64::Unknown Unit Message ("")
	230 ms	linux > MemorySanitizer-lld-X86_64.MemorySanitizer-lld-X86_64::Unknown Unit Message ("")
	530 ms	linux > SanitizerCommon-asan-x86_64-Linux.Linux::Unknown Unit Message ("")
	290 ms	linux > SanitizerCommon-lsan-x86_64-Linux.Linux::Unknown Unit Message ("")
	450 ms	linux > SanitizerCommon-msan-x86_64-Linux.Linux::Unknown Unit Message ("")
		View Full Test Results (10 Failed)

Event Timeline

sepavloff created this revision.Jul 14 2020, 7:22 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 14 2020, 7:22 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

Harbormaster failed remote builds in B64149: Diff 277825!Jul 14 2020, 8:20 AM

I'm jumping in since rnk is on leave and (I believe) zturner is less focused on these issues than he used to be.

Thanks for tracking down the cause of this bug.

I have some concerns:

We're trying to cut it right up to the hard limit. That seems an unnecessary risk, as the off-by-one bug illustrates. Is there anything wrong with just rounding down to 32,000 and leaving ourselves some wiggle room? Will someone be upset that we used a response file for a _very_ long command that technically would have been a viable command line?

We're constructing a temporary copy of the command line to test if it's short enough and then constructing the actual command line elsewhere, which leaves us open to divergence between when we think the command line will be and what it actually ends up being. I'd rather measure the actual command line, especially if we're going to run right up to the limit.

We're checking to make sure that the number of UTF-8 code units is below the limit, but the limit is actually in terms of WCHARs. Fortunately, I think that discrepancy works in our favor, since the number of WCHARs will always be smaller than the number of UTF-8 code units. But it underscores the points I'm making above: the precise limit probably isn't an issue and we're measuring a proxy command line rather than the actual command line.

Please consider these suggestions:

Round down to 32,000 to leave us wiggle room.

Have flattenWindowsCommandLine return a wstring rather than a string. This will reduce the chance of the proxy string we measure differing from the actual command string we're issuing, and it's already a Windows-specific function. This will require a change in Execute (in the same source file), which is currently doing the conversion from UTF-8 to UTF-16.

Add a short command to the test for commandLineFitsWithinSystemLimits. Right now, the test only checks a humongous command line. (The test is in llvm\unittests\Support\CommandLineTest.cpp.)

amccarth added a reviewer: amccarth.Jul 14 2020, 1:39 PM

Addressed reviewer's notes

Thank you for your detailed feedback!

Round down to 32,000 to leave us wiggle room.

As there is no requirement to use response files as rarely as possible, the choice of the limit is an implementation detail. Having some wiggle room is a good idea in this case.

Have flattenWindowsCommandLine return a wstring rather than a string. This will reduce the chance of the proxy string we measure differing from the actual command string we're issuing, and it's already a Windows-specific function.

Agree, this is more robust solution.

Add a short command to the test for commandLineFitsWithinSystemLimits.

Now the patch is more than a change of a constant, so a test is necessary.

Harbormaster failed remote builds in B64321: Diff 278117!Jul 15 2020, 3:09 AM

Fixed typo

Harbormaster failed remote builds in B64326: Diff 278132!Jul 15 2020, 4:20 AM

Thanks! I know it was more work, but I think unifying the command line generation to work the same in both cases will avoid future surprises.

This revision is now accepted and ready to land.Jul 15 2020, 8:15 AM

Fixed unit test

Harbormaster completed remote builds in B64376: Diff 278221.Jul 15 2020, 10:09 AM

Closed by commit rGd4020ef7c474: [Windows] Fix limit on command line size (authored by sepavloff). · Explain WhyJul 21 2020, 3:34 AM

This revision was automatically updated to reflect the committed changes.

This commit is now failing LLDB Windows buildbot builds http://lab.llvm.org:8011/builders/lldb-x64-windows-ninja/builds/17702. Please fix or revert.

E:\build_slave\lldb-x64-windows-ninja\llvm-project\lldb\source\Host\windows\ProcessLauncherWindows.cpp(53): error C2679: binary '=': no operator found which takes a right-hand operand of type 'llvm::ErrorOr<std::wstring>' (or there is no acceptable conversion)

In D83772#2164685, @max-kudr wrote:
This commit is now failing LLDB Windows buildbot builds http://lab.llvm.org:8011/builders/lldb-x64-windows-ninja/builds/17702. Please fix or revert.
E:\build_slave\lldb-x64-windows-ninja\llvm-project\lldb\source\Host\windows\ProcessLauncherWindows.cpp(53): error C2679: binary '=': no operator found which takes a right-hand operand of type 'llvm::ErrorOr<std::wstring>' (or there is no acceptable conversion)

I'll fix that.

Thank you! @sepavloff reverted it in revision ac0edc55887b6961ad90fd51f349c9587b1a8a7a

max-kudr removed a subscriber: max-kudr.Jul 21 2020, 3:06 PM

sepavloff reopened this revision.Jul 21 2020, 11:27 PM

This revision is now accepted and ready to land.Jul 21 2020, 11:27 PM

Added changes for LLDB

Herald added a project: Restricted Project. · View Herald TranscriptJul 21 2020, 11:28 PM

Herald added a subscriber: lldb-commits. · View Herald Transcript

@k8stone @max-kudr Could you please review the changes for LLDB?

Harbormaster failed remote builds in B65187: Diff 279709!Jul 21 2020, 11:42 PM

LGTM

Closed by commit rGdab898f9ab62: [Windows] Fix limit on command line size (authored by sepavloff). · Explain WhyJul 22 2020, 10:26 PM

This revision was automatically updated to reflect the committed changes.

Diff 278132

llvm/include/llvm/Support/Program.h

Show First 20 Lines • Show All 212 Lines • ▼ Show 20 Lines	#endif
/// Print a command argument, and optionally quote it.		/// Print a command argument, and optionally quote it.
void printArg(llvm::raw_ostream &OS, StringRef Arg, bool Quote);		void printArg(llvm::raw_ostream &OS, StringRef Arg, bool Quote);

#if defined(_WIN32)		#if defined(_WIN32)
/// Given a list of command line arguments, quote and escape them as necessary		/// Given a list of command line arguments, quote and escape them as necessary
/// to build a single flat command line appropriate for calling CreateProcess		/// to build a single flat command line appropriate for calling CreateProcess
/// on		/// on
/// Windows.		/// Windows.
std::string flattenWindowsCommandLine(ArrayRef<StringRef> Args);		ErrorOr<std::wstring> flattenWindowsCommandLine(ArrayRef<StringRef> Args);
#endif		#endif
}		}
}		}

#endif		#endif

llvm/lib/Support/Windows/Program.inc

Show First 20 Lines • Show All 183 Lines • ▼ Show 20 Lines	static bool Execute(ProcessInfo &PI, StringRef Program,
// ".exe" ourselves.		// ".exe" ourselves.
SmallString<64> ProgramStorage;		SmallString<64> ProgramStorage;
if (!sys::fs::exists(Program))		if (!sys::fs::exists(Program))
Program = Twine(Program + ".exe").toStringRef(ProgramStorage);		Program = Twine(Program + ".exe").toStringRef(ProgramStorage);

// Windows wants a command line, not an array of args, to pass to the new		// Windows wants a command line, not an array of args, to pass to the new
// process. We have to concatenate them all, while quoting the args that		// process. We have to concatenate them all, while quoting the args that
// have embedded spaces (or are empty).		// have embedded spaces (or are empty).
std::string Command = flattenWindowsCommandLine(Args);		auto Result = flattenWindowsCommandLine(Args);
		if (std::error_code ec = Result.getError()) {
		SetLastError(ec.value());
		MakeErrMsg(ErrMsg, std::string("Unable to convert command-line to UTF-16"));
		return false;
		}
		std::wstring Command = *Result;

// The pointer to the environment block for the new process.		// The pointer to the environment block for the new process.
std::vector<wchar_t> EnvBlock;		std::vector<wchar_t> EnvBlock;

if (Env) {		if (Env) {
// An environment block consists of a null-terminated block of		// An environment block consists of a null-terminated block of
// null-terminated strings. Convert the array of environment variables to		// null-terminated strings. Convert the array of environment variables to
// an environment block by concatenating them.		// an environment block by concatenating them.
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	static bool Execute(ProcessInfo &PI, StringRef Program,
SmallVector<wchar_t, MAX_PATH> ProgramUtf16;		SmallVector<wchar_t, MAX_PATH> ProgramUtf16;
if (std::error_code ec = sys::windows::widenPath(Program, ProgramUtf16)) {		if (std::error_code ec = sys::windows::widenPath(Program, ProgramUtf16)) {
SetLastError(ec.value());		SetLastError(ec.value());
MakeErrMsg(ErrMsg,		MakeErrMsg(ErrMsg,
std::string("Unable to convert application name to UTF-16"));		std::string("Unable to convert application name to UTF-16"));
return false;		return false;
}		}

SmallVector<wchar_t, MAX_PATH> CommandUtf16;		std::vector<wchar_t> CommandUtf16(Command.size() + 1, 0);
if (std::error_code ec = windows::UTF8ToUTF16(Command, CommandUtf16)) {		std::copy(Command.begin(), Command.end(), CommandUtf16.begin());
SetLastError(ec.value());
MakeErrMsg(ErrMsg,
std::string("Unable to convert command-line to UTF-16"));
return false;
}

BOOL rc = CreateProcessW(ProgramUtf16.data(), CommandUtf16.data(), 0, 0,		BOOL rc = CreateProcessW(ProgramUtf16.data(), CommandUtf16.data(), 0, 0,
TRUE, CREATE_UNICODE_ENVIRONMENT,		TRUE, CREATE_UNICODE_ENVIRONMENT,
EnvBlock.empty() ? 0 : EnvBlock.data(), 0, &si,		EnvBlock.empty() ? 0 : EnvBlock.data(), 0, &si,
&pi);		&pi);
DWORD err = GetLastError();		DWORD err = GetLastError();

// Regardless of whether the process got created or not, we are done with		// Regardless of whether the process got created or not, we are done with
// the handles we created for it to inherit.		// the handles we created for it to inherit.
▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	while (!Arg.empty()) {
Arg = Arg.drop_front(FirstNonBackslash + 1);		Arg = Arg.drop_front(FirstNonBackslash + 1);
}		}

Result.push_back('"');		Result.push_back('"');
return Result;		return Result;
}		}

namespace llvm {		namespace llvm {
std::string sys::flattenWindowsCommandLine(ArrayRef<StringRef> Args) {		ErrorOr<std::wstring> sys::flattenWindowsCommandLine(ArrayRef<StringRef> Args) {
std::string Command;		std::string Command;
for (StringRef Arg : Args) {		for (StringRef Arg : Args) {
if (argNeedsQuotes(Arg))		if (argNeedsQuotes(Arg))
Command += quoteSingleArg(Arg);		Command += quoteSingleArg(Arg);
else		else
Command += Arg;		Command += Arg;

Command.push_back(' ');		Command.push_back(' ');
}		}

return Command;		SmallVector<wchar_t, MAX_PATH> CommandUtf16;
		if (std::error_code ec = windows::UTF8ToUTF16(Command, CommandUtf16))
		return ec;

		return std::wstring(CommandUtf16.begin(), CommandUtf16.end());
}		}

ProcessInfo sys::Wait(const ProcessInfo &PI, unsigned SecondsToWait,		ProcessInfo sys::Wait(const ProcessInfo &PI, unsigned SecondsToWait,
bool WaitUntilChildTerminates, std::string *ErrMsg,		bool WaitUntilChildTerminates, std::string *ErrMsg,
Optional<ProcessStatistics> *ProcStat) {		Optional<ProcessStatistics> *ProcStat) {
assert(PI.Pid && "invalid pid to wait on, process not started?");		assert(PI.Pid && "invalid pid to wait on, process not started?");
assert((PI.Process && PI.Process != INVALID_HANDLE_VALUE) &&		assert((PI.Process && PI.Process != INVALID_HANDLE_VALUE) &&
"invalid process handle to wait on, process not started?");		"invalid process handle to wait on, process not started?");
▲ Show 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	llvm::sys::writeFileWithEncoding(StringRef FileName, StringRef Contents,
if (OS.has_error())		if (OS.has_error())
return make_error_code(errc::io_error);		return make_error_code(errc::io_error);

return EC;		return EC;
}		}

bool llvm::sys::commandLineFitsWithinSystemLimits(StringRef Program,		bool llvm::sys::commandLineFitsWithinSystemLimits(StringRef Program,
ArrayRef<StringRef> Args) {		ArrayRef<StringRef> Args) {
// The documented max length of the command line passed to CreateProcess.		// The documentation on CreateProcessW states that the size of the argument
static const size_t MaxCommandStringLength = 32768;		// lpCommandLine must not be greater than 32767 characters, including the
		// Unicode terminating null character. We use smaller value to reduce risk
		// of getting invalid command line due to unaccounted factors.
		static const size_t MaxCommandStringLength = 32000;
SmallVector<StringRef, 8> FullArgs;		SmallVector<StringRef, 8> FullArgs;
FullArgs.push_back(Program);		FullArgs.push_back(Program);
FullArgs.append(Args.begin(), Args.end());		FullArgs.append(Args.begin(), Args.end());
std::string Result = flattenWindowsCommandLine(FullArgs);		auto Result = flattenWindowsCommandLine(FullArgs);
return (Result.size() + 1) <= MaxCommandStringLength;		assert(!Result.getError());
		return (Result->size() + 1) <= MaxCommandStringLength;
}		}
}		}

llvm/unittests/Support/CommandLineTest.cpp

Show First 20 Lines • Show All 757 Lines • ▼ Show 20 Lines	for (auto *S : cl::getRegisteredSubcommands()) {
}		}
}		}
cl::ResetCommandLineParser();		cl::ResetCommandLineParser();
}		}

TEST(CommandLineTest, ArgumentLimit) {		TEST(CommandLineTest, ArgumentLimit) {
std::string args(32 * 4096, 'a');		std::string args(32 * 4096, 'a');
EXPECT_FALSE(llvm::sys::commandLineFitsWithinSystemLimits("cl", args.data()));		EXPECT_FALSE(llvm::sys::commandLineFitsWithinSystemLimits("cl", args.data()));
		std::string args2(256, 'a');
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'args2' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'args2' [readability-identifier-naming]…
		EXPECT_TRUE(llvm::sys::commandLineFitsWithinSystemLimits("cl", args2.data()));
		if (Triple(sys::getProcessTriple()).isOSWindows()) {
		// We use 32000 as a limit for command line length.
		std::string long_arg(32000, 'b');
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'long_arg' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'long_arg' [readability-identifier-naming]…
		EXPECT_TRUE(
		llvm::sys::commandLineFitsWithinSystemLimits("cl", long_arg.data()));
		long_arg += 'b';
		EXPECT_FALSE(
		llvm::sys::commandLineFitsWithinSystemLimits("cl", long_arg.data()));
		}
}		}

TEST(CommandLineTest, ResponseFileWindows) {		TEST(CommandLineTest, ResponseFileWindows) {
if (!Triple(sys::getProcessTriple()).isOSWindows())		if (!Triple(sys::getProcessTriple()).isOSWindows())
return;		return;

StackOption<std::string, cl::list<std::string>> InputFilenames(		StackOption<std::string, cl::list<std::string>> InputFilenames(
cl::Positional, cl::desc("<input files>"), cl::ZeroOrMore);		cl::Positional, cl::desc("<input files>"), cl::ZeroOrMore);
▲ Show 20 Lines • Show All 1,102 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Windows] Fix limit on command line size
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 278132

llvm/include/llvm/Support/Program.h

llvm/lib/Support/Windows/Program.inc

llvm/unittests/Support/CommandLineTest.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[Windows] Fix limit on command line sizeClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 278132

llvm/include/llvm/Support/Program.h

llvm/lib/Support/Windows/Program.inc

llvm/unittests/Support/CommandLineTest.cpp

[Windows] Fix limit on command line size
ClosedPublic