This is an archive of the discontinued LLVM Phabricator instance.

[lld] Teach LLD how to parse complete linker scripts
AbandonedPublic

Authored by rafaelauler on Oct 17 2014, 5:02 PM.

Download Raw Diff

Details

Reviewers

ruiu
Bigcheese
shankarke
• shankar.easwaran

Summary

This patch does *not* implement any semantic actions, but it is a first step to
teach LLD how to read complete linker scripts. The additional linker scripts
statements whose parsing is now supported are:

SEARCH_DIR directive
SECTIONS directive
Symbol definitions inside SECTIONS including PROVIDE and PROVIDE_HIDDEN
C-like expressions used in many places in linker scripts
Input to output sections mapping
The goal of this patch was guided towards completely parsing a default GNU ld
linker script and the linker script used to link the FreeBSD kernel. Thus, it
also adds a test case based on the default linker script used in GNU ld for
x86_64 ELF targets.

Note: Sorry for the size of this patch. I started with the goal of incrementally
developing this parser but ultimately failed. On the other side, the logic of the
parser is kind of repetitive, as one would expect, so this patch does not feature
lots of new things.

Diff Detail

Event Timeline

rafaelauler updated this revision to Diff 15107.Oct 17 2014, 5:02 PM

rafaelauler retitled this revision from to [lld] Teach LLD how to parse complete linker scripts.

rafaelauler updated this object.

rafaelauler edited the test plan for this revision. (Show Details)

rafaelauler added reviewers: Bigcheese, shankarke.

rafaelauler added subscribers: ruiu, • rafael, Unknown Object (MLST).

I'll address a comment from Rui:

"ruiu added a comment.Via Web · Fri, Oct 17, 9:28 PM
Because this is a large patch, it'll take some time to review.
Let me send out my first comment in the meantime.

Inline Comments
include/lld/ReaderWriter/LinkerScript.h
402
Is there any reason to have these member functions to mutate object's fields? It seems to me that we can make the constructor to take all these values. If we do so, we can handle all BinOps as const, which helps readers to understand the code.

The same comments are applied to all other classes representing ast nodes."

Sure, I agree. I'll try to eliminate these setters as much as possible and upload a new patch.

Could you add test cases that test for invalid linker scripts as well ?

lib/ReaderWriter/LinkerScript.cpp
973–975	can you convert the assert messages to appropriate error messages that can be displayed to the user ?

This revision now requires changes to proceed.Oct 17 2014, 7:29 PM

Hi Shankar,

Thanks for your input. I agree and will update the patch according your suggestions.

I changed the AST nodes creation according to Rui's suggestion.
I converted some asserts to error messages, according to Shankar's suggestion. There are still some asserts left that are only to check code sanity and are not appropriate for error messages.
I added test cases that exercises parsing errors (incomplete linker scripts) according to Shankar's suggestion.
I also changed the expression parsing functions to eliminate duplicated code and make it simpler.

LGTM

emaste added a subscriber: emaste.Oct 22 2014, 5:49 AM

I'll review this patch today. Please hold on for a while. Thanks.

No problem, Rui, I was waiting for your feedback too. Thanks for putting in time to see this.

ruiu added inline comments.Oct 22 2014, 4:46 PM

include/lld/ReaderWriter/LinkerScript.h
322	Remove three blank lines above.
333	Is constant always unsigned?
392	sort
467	Can you make this enum class?
478	Add an example to the comment. My understanding of this class is it is representing ".text" or "SORT(.text)" in SECTIONS { .x: { .text } .y { SORT(.text) } }
535	example here.
567	example here
605	Or, instead of adding example here, a link to the documentation is better? For this, https://sourceware.org/binutils/docs/ld/Overlay-Description.html#Overlay-Description
615	Remove
675	In this patch you pass only 1 to this function. Do you really need to look ahead arbitrary number of tokens? The buffer management looks a bit too complicated to me.
756	You want to mention that identifier and "(" were already read.
793	I think we don't usually use \p notation.
lib/ReaderWriter/LinkerScript.cpp
85	sort
96	I think there's an implicit conversion from std::error_code to llvm::ErrorOr. So you can just write return std::errc::io_error
99	return res;
134	You can write else if (c >= 'A' && c <= 'F') res += c - 'A' + 10; else return llvm::ErrorOr<uint64_t>(std::make_error_code(std::errc::io_error));
147	This needs to be case insensitive.
156	Instead of using two boolean values, you can define variable multiplier and set 1024 for K or 1024*1024 for M.
171	Huh, I didn't know that the linker script supports this way of writing a number. Good to know.
176	startswith_lower
190	Use early return here. if (res.getError()) return res;
194	and multiply by the multiplier
211	return '0' <= c && c <= '9'
253	You want to write multiple cases in one line. Or if expression may be shorter.
325	I think we need a function something like drop(StringRef &s, int n) that (destructively) drops n characters from s and returns them.
456	This line should be before default and case '"': case '\'':
469	This expression (_buffer.startswith("0x") && _buffer.size() > 2 && canContinueNumber(_buffer[2]) seems redundant, because if a string starts with "0x", it will always satisfy canStartNumber(_buffer[0])
552	Sort
748	return 1;
758	Did you forget to add the trailing "("?
762	Ditto
860	remove space before \n
1002	This can be if (peek(1)._kind == Token::l_paren) return parseFunctionCall(); Symbol *sym = ... ... return sym;
1123	s/LHS/lhs/
1141	s/RHS/rhs/
1149	lhs
1334	Remove this if
1361	Move consumeToken()s after this switch statement.
1589	remove

ruiu added inline comments.Oct 22 2014, 4:46 PM

include/lld/ReaderWriter/LinkerScript.h
70	Sort in asciii-betical order.
120	It should be done in a different patch, but these can* member functions have nothing to do with the Lexer class (they don't use member variables of the class etc). These should be non-member function.
157	Sort.

Hi Rui,

I appreciate the thorough review, thanks! I implemented your suggestions and will upload a new patch now.

include/lld/ReaderWriter/LinkerScript.h
70	Done
120	I agree, I will submit another patch soon.
157	Done
322	Done.
333	Added the following comment to explain this: / A constant value is stored as unsigned because it represents absolute / values. We represent negative numbers by composing the unary '-' operator /// with a constant. I also added the UnaryMinus class to represent negative numbers and properly updated the parser, thanks for pointing out.
392	Done
467	Done
478	Done
535	Done
567	Done
605	Done
615	Done
675	I agree. I changed this algorithm to only peek at the next token. I originally wrote this function to perform arbitrary look ahead because I was not sure if the linker script grammar would need it. Currently, I believe we won't need this in the future, so I modified this function to always peek the next token, which is a much simpler problem. If we ever come across the need to look ahead more than 1 token, I'll resubmit this complete algorithm again.
756	Sorry, I'm not sure if I understood your observation here, you see, parseFunctionCall() is responsible for consuming this entire expression, including identifier and left paren.
793	No problem, removed.
lib/ReaderWriter/LinkerScript.cpp
85	Done
96	Thanks for the tip!
99	Fixed
134	Indeed, thanks, updated it.
147	OK. I don't think that StringSwitch has something like StartsWithLower, so I just added another case.
156	Fair, changed it.
176	Fixed
190	Done
194	OK
211	Fixed
253	In fact I did with multiple cases, but clang-format undid it :-) will fix it.
325	Good idea. Done.
456	Done
469	Fixed.
552	Done
748	Done
758	Right, thanks
762	Fixed
860	Fixed
1002	Fixed
1123	Fixed
1141	Fixed
1149	Fixed
1334	Done
1361	Done
1589	Removed

Implemented Rui's suggestions.

Note: Sorry for the size of this patch. I started with the goal of incrementally
developing this parser but ultimately failed. On the other side, the logic of the
parser is kind of repetitive, as one would expect, so this patch does not feature
lots of new things.

It's totally OK (and I think normal) to go back and split the history into incremental pieces after the fact if you are unhappy with the patch size. That way, you can still get the same high-quality code review that you would have gotten by incrementally developing it, which is most of the benefit of the incremental approach.

As a sanity check, I would also recommend running this over all the linker scripts in the linux kernel and the linker scripts from a typical embedded application. If you haven't already, definitely take a read through http://lists.cs.uiuc.edu/pipermail/llvmdev/2012-December/057421.html

Also, to test your "dump" functionality, you should verify that the various projects still work if you replace the real scripts with "dump"ed scripts.

In test/LinkerScript/sections.test, it looks like you directly used the tool output as the expected output for the test. This doesn't give any confidence in the correctness by itself; if you haven't already done so, definitely try to build some code using the dumped script in place of the real script as a sanity check.

Also, please add links (or at least mention) any resources you used when developing this. What did you use as your "langref"? In the future, somebody is going to have to debug this code and they need to be able to determine if it is doing the right thing (or what the right thing is supposed to be).

lib/ReaderWriter/LinkerScript.cpp
479–489	Don't use doxygen comments inside the function. Here and elsewhere.
test/LinkerScript/expr-precedence.test
28–33	Having the token dump together with the AST dump is really gross. Could you commit a separate patch that enhances linker-script-test to have two different modes and then clean up these test cases?
test/LinkerScript/missing-operand.test
20	FileCheck has some special functionality for checking diagnostics that avoids the need to hard-code absolute line numbers. You should use it: http://llvm.org/docs/CommandGuide/FileCheck.html#filecheck-expressions Also, it should be pretty easy to do caret diagnostics.

Hi Sean, thanks for your input, I didn't know your write-up about linker scripts, it's great!

I'll check the dumped scripts for correct behavior. I put in a comment the reference I used to implement the parser (see LinkerScript.h:692); it's the GNU manual, e.g. https://sourceware.org/binutils/docs/ld/SECTIONS.html#SECTIONS.

I'll work on your suggestions.

Addressed Sean's concerns and tested the dump'ed scripts. This required adjusting some cosmetic issues to make dump'ed scripts 100% parseable by GNU ld. Also added other missing features to parse other kernel linker scripts: unary negation, fill expressions that can be indefinitely large ( =0x909090909090909090909090) and parsing symbol assignments outside SECTIONS.

Sanity checks performed: I tested SPEC userland programs linked by GNU ld, using the linker script dump'ed by this parser, and everything went fine.
I then tested linking the FreeBSD kernel with a dump'ed linker script, installed the new kernel and booted it, everything went fine.

Successfully parsed all linker scripts from the FreeBSD kernel, except ldscript.mips.cfe and ldscript.mips.octeon, which requires the PHDRS command (not implemented yet).

Things not tested: embedded scenarios, since this parser currently does not support the MEMORY command, common in these scenarios.

shankarke requested changes to this revision.Oct 25 2014, 7:45 PM

shankarke edited edge metadata.

shankarke added inline comments.

include/lld/ReaderWriter/LinkerScript.h
44–70	All of the above need to have a prefix.Also you may need to sort all the enumerated constants like others to be consistent. http://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly
275	const StringRef => StringRef everywhere.
294–337	remove const StringRef, StringRef is const already
334	explicit Constant ? There are other classes that would need explicit as well.
398–410	There are enums which are not classes, and some enums which are plain enums. We would need to be consistent. Rename the enumeration names as documented in the convention. http://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly
430–435	Can you add comment on whats the depth of the ternary condition thats supported as well as an example like how you document other commands.
432	explicit again.
462–463	look at previous comments on enums.
655–665	From your tests, do you see any that uses Overlay command ?
lib/ReaderWriter/LinkerScript.cpp
91–138	Could you move all the above functions to lld/Support ? These functions may be useful for even command line parsing for example to support --section-start as well as for other flavors.
307–311	How will this perform ? For large linker scripts drop_front is going to be repeatedly called for every token.
600–605	move this inside the class itself.
777–780	move this inside the class itself.

This revision now requires changes to proceed.Oct 25 2014, 7:45 PM

Hi Shankar,
Glad you made a second round of suggestions, thanks. I answered some of your concerns below, but will also work on your suggestions and update the patch soon.

Best regards,
Rafael Auler

include/lld/ReaderWriter/LinkerScript.h
44–70	I just kept the coding style I found in this file, whose original author is not me. For me, it's no problem to convert this to the official LLVM naming convention. This enum is already sorted according to Rui's suggestion, by using the ASCII table order. Correct me if I'm wrong, but I think we can drop the prefix here, according to official LLVM naming rules. If the enum is inside a class, the prefix is not necessary.
275	You're right! Thanks for spotting this!
334	I agree, will do, thanks.
398–410	I guess the criteria was to use enum classes when the enum is defined outside a class, like the one Rui pointed out. Correct me if I'm wrong, but I think this enum already follows the LLVM standard -- when the enum is inside a class, we may drop the prefix, i.e.: And instead of O_And.
430–435	Will do.
655–665	None. Do you think it's better to remove this, for now?
lib/ReaderWriter/LinkerScript.cpp
307–311	This should be inlined, in fact, it was just a refactoring suggested by Rui that would enable us to write simpler code in the next lines. You're right though, it will be called a lot. On the other hand, I don't think that this code is bad-performing: substr() will just update the size of in the StringRef object and drop_front() will directly compute a new StringRef pointer and return it (pointer arithmetic) and update size as well. But I can profile this and discover if this is a limitation.
600–605	Should we leave these as virtual anchors in the C++ code or is this not a concern for the LLD project?

shankarke added inline comments.Oct 25 2014, 8:50 PM

include/lld/ReaderWriter/LinkerScript.h
44–70	We are following a mixed pattern here, for some we use a kw_ to start the enumeration with, there are _(underscore patterns) and non underscore patterns, I am not sure on what we want to rename but I think it would be nice if we are consistent in naming the enumerations ?
655–665	leave it for now, at the time of Layout we could say this command is not supported :)
lib/ReaderWriter/LinkerScript.cpp
600–605	Leave them as virtual anchors still.

Seems this patch is good enough to land. We may be able to improve it even more but that can be done in a different patch.

As you mentioned, this patch was too large. It could have been split into small incremental patches, each adding syntactic element one by one. This patch was fortunately not risky, but if it was and we couldn't agree that this should land, you might have wasted your time. So please start with a small patch next time. Thanks!

include/lld/ReaderWriter/LinkerScript.h
44–70	Seems only keyword (e.g. "align" or "as_needed") starts with kw_ prefix. That looks fine and this code follows the coding style. I don't see non-keyword (e.g. = or +) needs prefix.
432	This constructor has three parameters so explicit keyword is not needed.

A couple nits, but I agree with Rui that this is good to land.

lib/ReaderWriter/LinkerScript.cpp
609–615	fwiw, my favorite pattern for doing this "intersperse with commas" operation is: for (int i = 0, e = _args.size(); i != e; ++i) { if (i) os << ", "; _args[i]->dump(os); } It's a bit cleaner than the pattern you're currently using.
1573–1577	You use this simpler pattern elsewhere: for (int i = 0; i != numParen; ++i) if (!expectAndConsume(Token::r_paren, "expected )")) return nullptr; also, just above you use this more complicated if-while pattern.

Thanks Sean and Rui for your code review.

lib/ReaderWriter/LinkerScript.cpp
609–615	Nice one, will use it.
1573–1577	Makes sense, I will change this code to use the simpler pattern

LGTM

Committed revision 221126.

rafaelauler abandoned this revision.Nov 2 2014, 8:21 PM

Revision Contents

Path

Size

include/

lld/

ReaderWriter/

LinkerScript.h

715 lines

lib/

ReaderWriter/

LinkerScript.cpp

1627 lines

test/

LinkerScript/

expr-precedence.test

29 lines

incomplete-ternary.test

21 lines

missing-entry-symbol.test

17 lines

missing-input-file-name.test

21 lines

missing-input-sections.test

23 lines

missing-operand.test

20 lines

missing-output-section-name.test

21 lines

missing-symbol.test

20 lines

sections.test

618 lines

Diff 15233

include/lld/ReaderWriter/LinkerScript.h

Show All 28 Lines
namespace lld {		namespace lld {
namespace script {		namespace script {
class Token {		class Token {
public:		public:
enum Kind {		enum Kind {
unknown,		unknown,
eof,		eof,
identifier,		identifier,
		number,
libname,		libname,
comma,		comma,
		colon,
		semicolon,
l_paren,		l_paren,
r_paren,		r_paren,
		l_brace,
		r_brace,
		question,
		exclaim,
		exclaimequal,
		equal,
		equalequal,
		plus,
		plusequal,
		minus,
		minusequal,
		star,
		starequal,
		slash,
		slashequal,
		amp,
		ampequal,
		pipe,
		pipeequal,
		less,
		lessless,
		lesslessequal,
		greater,
		greatergreater,
		greatergreaterequal,
		lessequal,
		greaterequal,
		ruiuUnsubmitted Not Done Reply Inline Actions Sort in asciii-betical order. ruiu: Sort in asciii-betical order.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		shankarkeUnsubmitted Not Done Reply Inline Actions All of the above need to have a prefix.Also you may need to sort all the enumerated constants like others to be consistent. http://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly shankarke: All of the above need to have a prefix.Also you may need to sort all the enumerated constants…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions I just kept the coding style I found in this file, whose original author is not me. For me, it's no problem to convert this to the official LLVM naming convention. This enum is already sorted according to Rui's suggestion, by using the ASCII table order. Correct me if I'm wrong, but I think we can drop the prefix here, according to official LLVM naming rules. If the enum is inside a class, the prefix is not necessary. rafaelauler: I just kept the coding style I found in this file, whose original author is not me. For me…
		shankarkeUnsubmitted Not Done Reply Inline Actions We are following a mixed pattern here, for some we use a kw_ to start the enumeration with, there are _(underscore patterns) and non underscore patterns, I am not sure on what we want to rename but I think it would be nice if we are consistent in naming the enumerations ? shankarke: We are following a mixed pattern here, for some we use a kw_ to start the enumeration with…
		ruiuUnsubmitted Not Done Reply Inline Actions Seems only keyword (e.g. "align" or "as_needed") starts with kw_ prefix. That looks fine and this code follows the coding style. I don't see non-keyword (e.g. = or +) needs prefix. ruiu: Seems only keyword (e.g. "align" or "as_needed") starts with kw_ prefix. That looks fine and…
kw_entry,		kw_entry,
kw_group,		kw_group,
kw_output_format,		kw_output_format,
kw_output_arch,		kw_output_arch,
kw_as_needed		kw_as_needed,
		kw_search_dir,
		kw_sections,
		kw_hidden,
		kw_provide,
		kw_provide_hidden,
		kw_overlay,
		kw_discard,
		kw_at,
		kw_align,
		kw_align_with_input,
		kw_subalign,
		kw_exclude_file,
		kw_sort_by_name,
		kw_sort_by_alignment,
		kw_sort_by_init_priority,
		kw_sort_none,
		kw_keep,
		kw_only_if_ro,
		kw_only_if_rw
};		};

Token() : _kind(unknown) {}		Token() : _kind(unknown) {}
Token(StringRef range, Kind kind) : _range(range), _kind(kind) {}		Token(StringRef range, Kind kind) : _range(range), _kind(kind) {}

void dump(raw_ostream &os) const;		void dump(raw_ostream &os) const;

StringRef _range;		StringRef _range;
Kind _kind;		Kind _kind;
};		};

class Lexer {		class Lexer {
public:		public:
explicit Lexer(std::unique_ptr<MemoryBuffer> mb)		explicit Lexer(std::unique_ptr<MemoryBuffer> mb) : _buffer(mb->getBuffer()) {
: _buffer(mb->getBuffer()) {
_sourceManager.AddNewSourceBuffer(std::move(mb), llvm::SMLoc());		_sourceManager.AddNewSourceBuffer(std::move(mb), llvm::SMLoc());
}		}

void lex(Token &tok);		void lex(Token &tok);

const llvm::SourceMgr &getSourceMgr() const { return _sourceManager; }		const llvm::SourceMgr &getSourceMgr() const { return _sourceManager; }

private:		private:
		bool canStartNumber(char c) const;
		bool canContinueNumber(char c) const;
bool canStartName(char c) const;		bool canStartName(char c) const;
bool canContinueName(char c) const;		bool canContinueName(char c) const;
		ruiuUnsubmitted Not Done Reply Inline Actions It should be done in a different patch, but these can* member functions have nothing to do with the Lexer class (they don't use member variables of the class etc). These should be non-member function. ruiu: It should be done in a different patch, but these can* member functions have nothing to do with…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions I agree, I will submit another patch soon. rafaelauler: I agree, I will submit another patch soon.
void skipWhitespace();		void skipWhitespace();

Token _current;		Token _current;
/// \brief The current buffer state.		/// \brief The current buffer state.
StringRef _buffer;		StringRef _buffer;
// Lexer owns the input files.		// Lexer owns the input files.
llvm::SourceMgr _sourceManager;		llvm::SourceMgr _sourceManager;
};		};

		/// All linker scripts commands derive from this class. High-level, sections and
		/// output section commands are all subclasses of this class.
		/// Examples:
		///
		/// OUTPUT_FORMAT("elf64-x86-64") /* A linker script command */
		/// OUTPUT_ARCH(i386:x86-64) /* Another command */
		/// ENTRY(_start) /* Another command */
		///
		/// SECTIONS /* Another command */
		/// {
		/// .interp : { /* A sections-command */
		/// (.interp) / An output-section-command */
		/// }
		/// }
		///
class Command {		class Command {
public:		public:
enum class Kind { Entry, OutputFormat, OutputArch, Group, };		enum class Kind {
		Entry,
		OutputFormat,
		OutputArch,
		Group,
		SearchDir,
		Sections,
		SymbolAssignment,
		OutputSectionDescription,
		InputSectionFile,
		Overlay,
		ruiuUnsubmitted Not Done Reply Inline Actions Sort. ruiu: Sort.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		};

Kind getKind() const { return _kind; }		Kind getKind() const { return _kind; }

virtual void dump(raw_ostream &os) const = 0;		virtual void dump(raw_ostream &os) const = 0;

virtual ~Command() {}		virtual ~Command() {}

protected:		protected:
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	struct Path {

Path() : _asNeeded(false), _isDashlPrefix(false) {}		Path() : _asNeeded(false), _isDashlPrefix(false) {}
explicit Path(StringRef path, bool asNeeded = false, bool isLib = false)		explicit Path(StringRef path, bool asNeeded = false, bool isLib = false)
: _path(path), _asNeeded(asNeeded), _isDashlPrefix(isLib) {}		: _path(path), _asNeeded(asNeeded), _isDashlPrefix(isLib) {}
};		};

class Group : public Command {		class Group : public Command {
public:		public:
template <class RangeT>		template <class RangeT> explicit Group(RangeT range) : Command(Kind::Group) {
explicit Group(RangeT range) : Command(Kind::Group) {
std::copy(std::begin(range), std::end(range), std::back_inserter(_paths));		std::copy(std::begin(range), std::end(range), std::back_inserter(_paths));
}		}

static bool classof(const Command *c) { return c->getKind() == Kind::Group; }		static bool classof(const Command *c) { return c->getKind() == Kind::Group; }

void dump(raw_ostream &os) const override {		void dump(raw_ostream &os) const override {
os << "GROUP(";		os << "GROUP(";
bool first = true;		bool first = true;
Show All 15 Lines	public:
const std::vector<Path> &getPaths() const { return _paths; }		const std::vector<Path> &getPaths() const { return _paths; }

private:		private:
std::vector<Path> _paths;		std::vector<Path> _paths;
};		};

class Entry : public Command {		class Entry : public Command {
public:		public:
explicit Entry(StringRef entryName) :		explicit Entry(StringRef entryName)
Command(Kind::Entry), _entryName(entryName) { }		: Command(Kind::Entry), _entryName(entryName) {}

		static bool classof(const Command *c) { return c->getKind() == Kind::Entry; }

		void dump(raw_ostream &os) const override {
		os << "ENTRY(" << _entryName << ")\n";
		}

		const StringRef getEntryName() const { return _entryName; }
		shankarkeUnsubmitted Not Done Reply Inline Actions const StringRef => StringRef everywhere. shankarke: const StringRef => StringRef everywhere.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions You're right! Thanks for spotting this! rafaelauler: You're right! Thanks for spotting this!

		private:
		StringRef _entryName;
		};

		class SearchDir : public Command {
		public:
		explicit SearchDir(StringRef searchPath)
		: Command(Kind::SearchDir), _searchPath(searchPath) {}

static bool classof(const Command *c) {		static bool classof(const Command *c) {
return c->getKind() == Kind::Entry;		return c->getKind() == Kind::SearchDir;
}		}

void dump(raw_ostream &os) const override {		void dump(raw_ostream &os) const override {
os << "ENTRY(" << _entryName << ")\n";		os << "SEARCH_DIR(" << _searchPath << ")\n";
}		}

const StringRef getEntryName() const {		const StringRef getSearchPath() const { return _searchPath; }
return _entryName;
		private:
		StringRef _searchPath;
		};

		/// Superclass for expression nodes. Linker scripts accept C-like expressions in
		/// many places, such as when defining the value of a symbol or the address of
		/// an output section.
		/// Example:
		///
		/// SECTIONS {
		/// my_symbol = 1 + 1 * 2;
		/// \| \| ^~~~> Constant : Expression
		/// \| \| ^~~~> Constant : Expression
		/// \| \| ^~~~> BinOp : Expression
		/// ^~~~> Constant : Expression
		/// ^~~~> BinOp : Expression (the top-level Expression node)
		/// }
		///
		class Expression {
		public:
		enum class Kind { Constant, Symbol, FunctionCall, BinOp, TernaryConditional };

		Kind getKind() const { return _kind; }

		virtual void dump(raw_ostream &os) const = 0;

		virtual ~Expression() {}
		ruiuUnsubmitted Not Done Reply Inline Actions Remove three blank lines above. ruiu: Remove three blank lines above.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done. rafaelauler: Done.

		protected:
		explicit Expression(Kind k) : _kind(k) {}

		private:
		Kind _kind;
		};

		class Constant : public Expression {
		public:
		Constant(uint64_t num) : Expression(Kind::Constant), _num(num) {}
		ruiuUnsubmitted Not Done Reply Inline Actions Is constant always unsigned? ruiu: Is constant always unsigned?
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Added the following comment to explain this: / A constant value is stored as unsigned because it represents absolute / values. We represent negative numbers by composing the unary '-' operator /// with a constant. I also added the UnaryMinus class to represent negative numbers and properly updated the parser, thanks for pointing out. rafaelauler: Added the following comment to explain this: /// A constant value is stored as unsigned because…
		void dump(raw_ostream &os) const override;
		shankarkeUnsubmitted Not Done Reply Inline Actions explicit Constant ? There are other classes that would need explicit as well. shankarke: explicit Constant ? There are other classes that would need explicit as well.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions I agree, will do, thanks. rafaelauler: I agree, will do, thanks.

		static bool classof(const Expression *c) {
		return c->getKind() == Kind::Constant;
		shankarkeUnsubmitted Not Done Reply Inline Actions remove const StringRef, StringRef is const already shankarke: remove const StringRef, StringRef is const already
}		}

private:		private:
StringRef _entryName;		uint64_t _num;
};		};

		class Symbol : public Expression {
		public:
		Symbol(StringRef name) : Expression(Kind::Symbol), _name(name) {}
		void dump(raw_ostream &os) const override;

		static bool classof(const Expression *c) {
		return c->getKind() == Kind::Symbol;
		}

		private:
		StringRef _name;
		};

		class FunctionCall : public Expression {
		public:
		template <class RangeT>
		FunctionCall(StringRef name, RangeT range)
		: Expression(Kind::FunctionCall), _name(name) {
		std::copy(std::begin(range), std::end(range), std::back_inserter(_args));
		}

		void dump(raw_ostream &os) const override;

		static bool classof(const Expression *c) {
		return c->getKind() == Kind::FunctionCall;
		}

		private:
		StringRef _name;
		std::vector<const Expression *> _args;
		};

		class BinOp : public Expression {
		public:
		enum Operation {
		Sum,
		Sub,
		Mul,
		Div,
		Shl,
		Shr,
		And,
		Or,
		CompareLess,
		CompareGreater,
		CompareLessEqual,
		CompareGreaterEqual,
		CompareEqual,
		CompareDifferent
		ruiuUnsubmitted Not Done Reply Inline Actions sort ruiu: sort
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		};

		BinOp(const Expression LHS, Operation op, const Expression RHS)
		: Expression(Kind::BinOp), _op(op), _LHS(LHS), _RHS(RHS) {}

		void dump(raw_ostream &os) const override;

		static bool classof(const Expression *c) {
		return c->getKind() == Kind::BinOp;
		}

		private:
		Operation _op;
		const Expression *_LHS;
		const Expression *_RHS;
		};

		class TernaryConditional : public Expression {
		shankarkeUnsubmitted Not Done Reply Inline Actions There are enums which are not classes, and some enums which are plain enums. We would need to be consistent. Rename the enumeration names as documented in the convention. http://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly shankarke: There are enums which are not classes, and some enums which are plain enums. We would need to…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions I guess the criteria was to use enum classes when the enum is defined outside a class, like the one Rui pointed out. Correct me if I'm wrong, but I think this enum already follows the LLVM standard -- when the enum is inside a class, we may drop the prefix, i.e.: And instead of O_And. rafaelauler: I guess the criteria was to use enum classes when the enum is defined outside a class, like the…
		public:
		TernaryConditional(const Expression conditional, const Expression trueExpr,
		const Expression *falseExpr)
		: Expression(Kind::TernaryConditional), _conditional(conditional),
		_trueExpr(trueExpr), _falseExpr(falseExpr) {}

		void dump(raw_ostream &os) const override;

		static bool classof(const Expression *c) {
		return c->getKind() == Kind::TernaryConditional;
		}

		private:
		const Expression *_conditional;
		const Expression *_trueExpr;
		const Expression *_falseExpr;
		};

		/// Symbol assignments of the form "symbolname = <expression>" may occur either
		/// as sections-commands or as output-section-commands.
		/// Example:
		///
		shankarkeUnsubmitted Not Done Reply Inline Actions explicit again. shankarke: explicit again.
		ruiuUnsubmitted Not Done Reply Inline Actions This constructor has three parameters so explicit keyword is not needed. ruiu: This constructor has three parameters so explicit keyword is not needed.
		/// SECTIONS {
		/// mysymbol = . /* SymbolAssignment as a sections-command */
		/// .data : {
		shankarkeUnsubmitted Not Done Reply Inline Actions Can you add comment on whats the depth of the ternary condition thats supported as well as an example like how you document other commands. shankarke: Can you add comment on whats the depth of the ternary condition thats supported as well as an…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Will do. rafaelauler: Will do.
		/// othersymbol = . /* SymbolAssignment as an output-section-command */
		/// }
		///}
		///
		class SymbolAssignment : public Command {
		public:
		enum AssignmentKind { Simple, Sum, Sub, Mul, Div, Shl, Shr, And, Or };
		enum AssignmentVisibility { Normal, Hidden, Provide, ProvideHidden };

		SymbolAssignment(StringRef name, const Expression *expr, AssignmentKind kind,
		AssignmentVisibility visibility)
		: Command(Kind::SymbolAssignment), _expression(expr), _symbol(name),
		_assignmentKind(Simple), _assignmentVisibility(visibility) {}

		static bool classof(const Command *c) {
		return c->getKind() == Kind::SymbolAssignment;
		}

		void dump(raw_ostream &os) const override;

		private:
		const Expression *_expression;
		StringRef _symbol;
		AssignmentKind _assignmentKind;
		AssignmentVisibility _assignmentVisibility;
		};

		/// Encodes how to sort file names or section names that are expanded from
		shankarkeUnsubmitted Not Done Reply Inline Actions look at previous comments on enums. shankarke: look at previous comments on enums.
		/// wildcard operators. This typically occurs in constructs such as
		/// SECTIONS { .data : SORT_BY_NAME()() }}, where the order of the expanded
		/// names is important to determine which sections go first.
		enum WildcardSortMode {
		ruiuUnsubmitted Not Done Reply Inline Actions Can you make this enum class? ruiu: Can you make this enum class?
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		WSM_NA,
		WSM_ByName,
		WSM_ByAlignment,
		WSM_ByNameAndAlignment,
		WSM_ByAlignmentAndName,
		WSM_ByInitPriority,
		WSM_None
		};

		/// Represents either a single input section name or a group of sorted input
		/// section names. They specify which sections to map to a given output section.
		ruiuUnsubmitted Not Done Reply Inline Actions Add an example to the comment. My understanding of this class is it is representing ".text" or "SORT(.text)" in SECTIONS { .x: { .text } .y { SORT(.text) } } ruiu: Add an example to the comment. My understanding of this class is it is representing ".text" or…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		class InputSection {
		public:
		enum class Kind { InputSectionName, SortedGroup };

		Kind getKind() const { return _kind; }

		virtual void dump(raw_ostream &os) const = 0;

		virtual ~InputSection() {}

		protected:
		explicit InputSection(Kind k) : _kind(k) {}

		private:
		Kind _kind;
		};

		class InputSectionName : public InputSection {
		public:
		InputSectionName(StringRef name, bool excludeFile)
		: InputSection(Kind::InputSectionName), _name(name),
		_excludeFile(excludeFile) {}

		void dump(raw_ostream &os) const override;

		static bool classof(const InputSection *c) {
		return c->getKind() == Kind::InputSectionName;
		}

		private:
		StringRef _name;
		bool _excludeFile;
		};

		class InputSectionSortedGroup : public InputSection {
		public:
		template <class RangeT>
		InputSectionSortedGroup(WildcardSortMode sort, RangeT range)
		: InputSection(Kind::SortedGroup), _sortMode(sort) {
		std::copy(std::begin(range), std::end(range),
		std::back_inserter(_sections));
		}

		void dump(raw_ostream &os) const override;
		WildcardSortMode getSortMode() const { return _sortMode; }

		static bool classof(const InputSection *c) {
		return c->getKind() == Kind::SortedGroup;
		}

		private:
		WildcardSortMode _sortMode;
		std::vector<const InputSection *> _sections;
		};

		/// An output-section-command that maps a series of sections inside a given
		/// file to an output section.
		ruiuUnsubmitted Not Done Reply Inline Actions example here. ruiu: example here.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		class InputSectionFile : public Command {
		public:
		typedef std::vector<const InputSection *> VectorTy;

		template <class RangeT>
		InputSectionFile(StringRef fileName, StringRef archiveName, bool keep,
		WildcardSortMode fileSortMode,
		WildcardSortMode archiveSortMode, RangeT range)
		: Command(Kind::InputSectionFile), _fileName(fileName),
		_archiveName(archiveName), _keep(keep), _fileSortMode(fileSortMode),
		_archiveSortMode(archiveSortMode) {
		std::copy(std::begin(range), std::end(range),
		std::back_inserter(_sections));
		}

		void dump(raw_ostream &os) const override;

		static bool classof(const Command *c) {
		return c->getKind() == Kind::InputSectionFile;
		}

		private:
		StringRef _fileName;
		StringRef _archiveName;
		bool _keep;
		WildcardSortMode _fileSortMode;
		WildcardSortMode _archiveSortMode;
		VectorTy _sections;
		};

		/// A sections-command to specify which input sections compose a given output
		/// section.
		ruiuUnsubmitted Not Done Reply Inline Actions example here ruiu: example here
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		class OutputSectionDescription : public Command {
		public:
		enum Constraint { C_None, C_OnlyIfRO, C_OnlyIfRW };

		template <class RangeT>
		OutputSectionDescription(StringRef sectionName, const Expression *address,
		const Expression align, const Expression subAlign,
		const Expression at, const Expression fillExpr,
		bool alignWithInput, bool discard,
		Constraint constraint, RangeT range)
		: Command(Kind::OutputSectionDescription), _sectionName(sectionName),
		_address(address), _align(align), _subAlign(subAlign), _at(at),
		_fillExpr(fillExpr), _alignWithInput(alignWithInput), _discard(discard),
		_constraint(constraint) {
		std::copy(std::begin(range), std::end(range),
		std::back_inserter(_outputSectionCommands));
		}

		static bool classof(const Command *c) {
		return c->getKind() == Kind::OutputSectionDescription;
		}

		void dump(raw_ostream &os) const override;

		private:
		StringRef _sectionName;
		const Expression *_address;
		const Expression *_align;
		const Expression *_subAlign;
		const Expression *_at;
		const Expression *_fillExpr;
		bool _alignWithInput;
		bool _discard;
		Constraint _constraint;
		std::vector<const Command *> _outputSectionCommands;
		};

		class Overlay : public Command {
		ruiuUnsubmitted Not Done Reply Inline Actions Or, instead of adding example here, a link to the documentation is better? For this, https://sourceware.org/binutils/docs/ld/Overlay-Description.html#Overlay-Description ruiu: Or, instead of adding example here, a link to the documentation is better? For this, https…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		public:
		Overlay() : Command(Kind::Overlay) {}

		static bool classof(const Command *c) {
		return c->getKind() == Kind::Overlay;
		}

		void dump(raw_ostream &os) const override { os << "Overlay description\n"; }

		private:
		ruiuUnsubmitted Not Done Reply Inline Actions Remove ruiu: Remove
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		};

		/// Represents all the contents of the SECTIONS {} construct.
		class Sections : public Command {
		public:
		template <class RangeT> Sections(RangeT range) : Command(Kind::Sections) {
		std::copy(std::begin(range), std::end(range),
		std::back_inserter(_sectionsCommands));
		}

		static bool classof(const Command *c) {
		return c->getKind() == Kind::Sections;
		}

		void dump(raw_ostream &os) const override;

		private:
		std::vector<const Command *> _sectionsCommands;
		};

		/// Stores the parse tree of a linker script.
class LinkerScript {		class LinkerScript {
public:		public:
void dump(raw_ostream &os) const {		void dump(raw_ostream &os) const {
for (const Command *c : _commands)		for (const Command *c : _commands)
c->dump(os);		c->dump(os);
}		}

std::vector<Command *> _commands;		std::vector<Command *> _commands;
};		};

		/// Recognizes syntactic constructs of a linker script using a predictive
		/// parser/recursive descent implementation.
		///
		/// Based on the linker script documentation available at
		/// https://sourceware.org/binutils/docs/ld/Scripts.html
class Parser {		class Parser {
public:		public:
explicit Parser(Lexer &lex) : _lex(lex) {}		explicit Parser(Lexer &lex) : _lex(lex), _tokIndex(0), _lookAheadIndex(0) {}

LinkerScript *parse();		LinkerScript *parse();

private:		private:
void consumeToken() { _lex.lex(_tok); }		/// Advances to the next token, either asking the Lexer to lex the next token
		/// or obtaining it from the look ahead buffer.
		void consumeToken() {
		// First check if the look ahead buffer cached the next token
		if (_tokIndex + 1 >= _lookAheadIndex &&
		_lookAheadBuf.size() >= (_tokIndex - _lookAheadIndex + 2)) {
		_tok = _lookAheadBuf[_tokIndex + 1 - _lookAheadIndex];
		shankarkeUnsubmitted Not Done Reply Inline Actions From your tests, do you see any that uses Overlay command ? shankarke: From your tests, do you see any that uses Overlay command ?
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions None. Do you think it's better to remove this, for now? rafaelauler: None. Do you think it's better to remove this, for now?
		shankarkeUnsubmitted Not Done Reply Inline Actions leave it for now, at the time of Layout we could say this command is not supported :) shankarke: leave it for now, at the time of Layout we could say this command is not supported :)
		++_tokIndex;
		return;
		}
		_lex.lex(_tok);
		++_tokIndex;
		}

		/// Returns the nth token that succeeds the current one. If this operation
		/// requires lexing additional tokens, store them in a private buffer.
		const Token &peek(unsigned n) {
		ruiuUnsubmitted Not Done Reply Inline Actions In this patch you pass only 1 to this function. Do you really need to look ahead arbitrary number of tokens? The buffer management looks a bit too complicated to me. ruiu: In this patch you pass only 1 to this function. Do you really need to look ahead arbitrary…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions I agree. I changed this algorithm to only peek at the next token. I originally wrote this function to perform arbitrary look ahead because I was not sure if the linker script grammar would need it. Currently, I believe we won't need this in the future, so I modified this function to always peek the next token, which is a much simpler problem. If we ever come across the need to look ahead more than 1 token, I'll resubmit this complete algorithm again. rafaelauler: I agree. I changed this algorithm to only peek at the next token. I originally wrote this…
		// Covers the case where the look ahead buffer contains the requested token
		if (_tokIndex + n >= _lookAheadIndex &&
		_lookAheadBuf.size() >= (_tokIndex + n - _lookAheadIndex + 1))
		return _lookAheadBuf[_tokIndex + n - _lookAheadIndex];

		// In this case, the look ahead buffer is filled with old tokens that we are
		// not going to access anymore. Flush it and fill it with new tokens,
		// starting with the current token until the requested token.
		if (_tokIndex - _lookAheadIndex >= _lookAheadBuf.size()) {
		_lookAheadBuf.clear();
		_lookAheadBuf.push_back(_tok);
		for (unsigned i = 1; i <= n; ++i) {
		_lookAheadBuf.push_back(Token());
		_lex.lex(_lookAheadBuf.back());
		}
		_lookAheadIndex = _tokIndex;
		return _lookAheadBuf[n];
		}

		// In this case, the look ahead buffer starts with the current token, but is
		// not large enough to hold the requested future token. We simply expand it.
		if (_tokIndex == _lookAheadIndex) {
		for (unsigned i = 0, e = n - _lookAheadBuf.size() + 1; i != e; ++i) {
		_lookAheadBuf.push_back(Token());
		_lex.lex(_lookAheadBuf.back());
		}
		return _lookAheadBuf[n];
		}

		// This last case covers the corner case where some of the tokens in the
		// buffer are new, but others are old. We discard the old ones, keep the
		// new ones while expanding it to hold the requested token.
		SmallVector<Token, 4> temp(&_lookAheadBuf[_tokIndex - _lookAheadIndex],
		_lookAheadBuf.end());
		_lookAheadBuf.clear();
		_lookAheadBuf.insert(_lookAheadBuf.begin(), temp.begin(), temp.end());
		for (unsigned i = 0, e = n - temp.size() + 1; i != e; ++i) {
		_lookAheadBuf.push_back(Token());
		_lex.lex(_lookAheadBuf.back());
		}
		return _lookAheadBuf[n];
		}

void error(const Token &tok, Twine msg) {		void error(const Token &tok, Twine msg) {
_lex.getSourceMgr()		_lex.getSourceMgr().PrintMessage(
.PrintMessage(llvm::SMLoc::getFromPointer(tok._range.data()),		llvm::SMLoc::getFromPointer(tok._range.data()),
llvm::SourceMgr::DK_Error, msg);		llvm::SourceMgr::DK_Error, msg);
}		}

bool expectAndConsume(Token::Kind kind, Twine msg) {		bool expectAndConsume(Token::Kind kind, Twine msg) {
if (_tok._kind != kind) {		if (_tok._kind != kind) {
error(_tok, msg);		error(_tok, msg);
return false;		return false;
}		}
consumeToken();		consumeToken();
return true;		return true;
}		}

bool isNextToken(Token::Kind kind) { return (_tok._kind == kind); }		bool isNextToken(Token::Kind kind) { return (_tok._kind == kind); }

		// Recursive descent parsing member functions
		// All of these functions consumes tokens and return an AST object,
		// represented by the Command superclass. However, note that not all AST
		// objects derive from Command. For nodes of C-like expressions, used in
		// linker scripts, the superclass is Expression. For nodes that represent
		// input sections that map to an output section, the superclass is
		// InputSection.
		//
		// Example mapping common constructs to AST nodes:
		//
		// SECTIONS { /* Parsed to Sections class */
		// my_symbol = 1 + 1; /* Parsed to SymbolAssignment class */
		// /* ^~~> Parsed to Expression class */
		// .data : { (.data) } / Parsed to OutputSectionDescription class */
		// /* ^~~> Parsed to InputSectionName class */
		// /* ^~~~~> Parsed to InputSectionFile class */
		// }

		// ==== Expression parsing member functions ====

		/// Parse "identifier(param [, param]...)"
		ruiuUnsubmitted Not Done Reply Inline Actions You want to mention that identifier and "(" were already read. ruiu: You want to mention that identifier and "(" were already read.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Sorry, I'm not sure if I understood your observation here, you see, parseFunctionCall() is responsible for consuming this entire expression, including identifier and left paren. rafaelauler: Sorry, I'm not sure if I understood your observation here, you see, parseFunctionCall() is…
		///
		/// Example:
		///
		/// SECTIONS {
		/// my_symbol = 0x1000 \| ALIGN(other_symbol);
		/// /* ^~~~> parseFunctionCall()
		/// }
		const Expression *parseFunctionCall();

		/// Ensures that the current token is an expression terminal. If it is not,
		/// issues an error to the user and returns false.
		bool expectExprTerminal();

		/// Parse operands of an expression, such as function calls, identifiers or
		/// literal numbers.
		///
		/// Example:
		///
		/// SECTIONS {
		/// my_symbol = 0x1000 \| ALIGN(other_symbol);
		/// ^~~~> parseExprTerminal()
		/// }
		const Expression *parseExprTerminal();

		// As a reference to the precedence of C operators, consult
		// http://en.cppreference.com/w/c/language/operator_precedence

		/// Parse either a single expression operand and returns or parse an entire
		/// expression if its top-level node has a lower or equal precedence than the
		/// indicated.
		const Expression *parseExpression(unsigned precedence = 13);

		/// Parse an operator and its RHS operand, assuming that the LHS was already
		/// consumed. Keep parsing subsequent operator-operand pairs that do not
		/// exceed \p highestPrecedence.
		/// \p LHS points to the left-hand-side operand of this operator
		/// \p maxPrecedence has the maximum operator precedence level that this parse
		ruiuUnsubmitted Not Done Reply Inline Actions I think we don't usually use \p notation. ruiu: I think we don't usually use \p notation.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions No problem, removed. rafaelauler: No problem, removed.
		/// function is allowed to consume.
		const Expression parseOperatorOperandLoop(const Expression LHS,
		unsigned maxPrecedence);

		/// Parse ternary conditionals such as "(condition)? true: false;". This
		/// operator has precedence level 13 and associates right-to-left.
		const Expression parseTernaryCondOp(const Expression LHS);

		// ==== High-level commands parsing ====

		/// Parse the OUTPUT_FORMAT linker script command.
		/// Example:
		///
		/// OUTPUT_FORMAT(elf64-x86-64,elf64-x86-64,elf64-x86-64)
		/// ^~~~> parseOutputFormat()
		///
OutputFormat *parseOutputFormat();		OutputFormat *parseOutputFormat();

		/// Parse the OUTPUT_ARCH linker script command.
		/// Example:
		///
		/// OUTPUT_ARCH(i386:x86-64)
		/// ^~~~> parseOutputArch()
		///
OutputArch *parseOutputArch();		OutputArch *parseOutputArch();

		/// Parse the GROUP linker script command.
		/// Example:
		///
		/// GROUP ( /lib/x86_64-linux-gnu/libc.so.6
		/// /usr/lib/x86_64-linux-gnu/libc_nonshared.a
		/// AS_NEEDED ( /lib/x86_64-linux-gnu/ld-linux-x86-64.so.2 )
		/// -lm -l:libgcc.a )
		///
Group *parseGroup();		Group *parseGroup();
bool parseAsNeeded(std::vector<Path> &paths);		bool parseAsNeeded(std::vector<Path> &paths);

		/// Parse the ENTRY linker script command.
		/// Example:
		///
		/// ENTRY(init)
		/// ^~~~> parseEntry()
		///
Entry *parseEntry();		Entry *parseEntry();

		/// Parse the SEARCH_DIR linker script command.
		/// Example:
		///
		/// SEARCH_DIR("/usr/x86_64-linux-gnu/lib64");
		/// ^~~~> parseSearchDir()
		///
		SearchDir *parseSearchDir();

		/// Parse "symbol = expression" commands that live inside the
		/// SECTIONS directive.
		/// Example:
		///
		/// SECTIONS {
		/// my_symbol = 1 + 1;
		/// ^~~~> parseExpression()
		/// ^~~~ parseSymbolAssignment()
		/// }
		///
		const SymbolAssignment *parseSymbolAssignment();

		/// Parse "EXCLUDE_FILE" used inside the listing of input section names.
		/// Example:
		///
		/// SECTIONS {
		/// .data : { (EXCLUDE_FILE (crtend.o *otherfile.o) .ctors) }
		/// ^~~~> parseExcludeFile()
		/// }
		///
		ErrorOr<InputSectionFile::VectorTy> parseExcludeFile();

		/// Helper to parse SORT_BY_NAME(, SORT_BY_ALIGNMENT( and SORT_NONE(,
		/// possibly nested. Returns the number of Token::r_paren tokens that need
		/// to be consumed, while \p sortMode is updated with the parsed sort
		/// criteria.
		/// Example:
		///
		/// SORT_BY_NAME(SORT_BY_ALIGNMENT(*))
		/// ^~~~ parseSortDirectives() ~~^
		/// Returns 2, finishes with sortMode = WSM_ByNameAndAlignment
		///
		int parseSortDirectives(WildcardSortMode &sortMode);

		/// Parse a group of input section names that are sorted via SORT* directives.
		/// Example:
		/// SORT_BY_NAME(SORT_BY_ALIGNMENT(data bss))
		const InputSection *parseSortedInputSections();

		/// Parse input section description statements.
		/// Example:
		///
		/// SECTIONS {
		/// .mysection : crt.o(.data* .bss SORT_BY_NAME(name*))
		/// ^~~~ parseInputSectionFile()
		/// }
		const InputSectionFile *parseInputSectionFile();

		/// Parse output section description statements.
		/// Example:
		///
		/// SECTIONS {
		/// .data : { crt.o(.data* .bss SORT_BY_NAME(name*)) }
		/// ^~~~ parseOutputSectionDescription()
		/// }
		const OutputSectionDescription *parseOutputSectionDescription();

		/// Stub for parsing overlay commands. Currently unimplemented.
		const Overlay *parseOverlay();

		/// Parse the SECTIONS linker script command.
		/// Example:
		///
		/// SECTIONS {
		/// ^~~~ parseSections()
		/// . = 0x100000;
		/// .data : { *(.data) }
		/// }
		///
		Sections *parseSections();

private:		private:
		// Owns the entire linker script AST nodes
llvm::BumpPtrAllocator _alloc;		llvm::BumpPtrAllocator _alloc;

		// The top-level/entry-point linker script AST node
LinkerScript _script;		LinkerScript _script;

Lexer &_lex;		Lexer &_lex;

		// Current token being analyzed
Token _tok;		Token _tok;

		// Keep track of the current token index and the index of the first cached
		// token, allowing us to manage a buffer of future tokens and implement
		// lookahead.
		unsigned _tokIndex;
		unsigned _lookAheadIndex;
		llvm::SmallVector<Token, 1> _lookAheadBuf;
};		};
} // end namespace script		} // end namespace script
} // end namespace lld		} // end namespace lld

#endif		#endif

lib/ReaderWriter/LinkerScript.cpp

Show All 12 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "lld/ReaderWriter/LinkerScript.h"		#include "lld/ReaderWriter/LinkerScript.h"

namespace lld {		namespace lld {
namespace script {		namespace script {
void Token::dump(raw_ostream &os) const {		void Token::dump(raw_ostream &os) const {
switch (_kind) {		switch (_kind) {
#define CASE(name) \		#define CASE(name) \
case Token::name: \		case Token::name: \
os << #name ": "; \		os << #name ": "; \
break;		break;
CASE(eof)		CASE(eof)
CASE(identifier)		CASE(identifier)
CASE(libname)		CASE(libname)
CASE(kw_as_needed)		CASE(kw_as_needed)
		CASE(kw_search_dir)
		CASE(kw_sections)
		CASE(kw_hidden)
		CASE(kw_provide)
		CASE(kw_provide_hidden)
		CASE(kw_overlay)
		CASE(kw_discard)
		CASE(kw_at)
		CASE(kw_align)
		CASE(kw_align_with_input)
		CASE(kw_subalign)
		CASE(kw_exclude_file)
		CASE(kw_sort_by_name)
		CASE(kw_sort_by_alignment)
		CASE(kw_sort_by_init_priority)
		CASE(kw_sort_none)
		CASE(kw_keep)
		CASE(kw_only_if_ro)
		CASE(kw_only_if_rw)
CASE(kw_entry)		CASE(kw_entry)
CASE(kw_group)		CASE(kw_group)
CASE(kw_output_format)		CASE(kw_output_format)
CASE(kw_output_arch)		CASE(kw_output_arch)
CASE(comma)		CASE(comma)
		CASE(colon)
		CASE(semicolon)
		CASE(number)
CASE(l_paren)		CASE(l_paren)
CASE(r_paren)		CASE(r_paren)
		CASE(l_brace)
		CASE(r_brace)
		CASE(question)
		CASE(exclaim)
		CASE(exclaimequal)
		CASE(equal)
		CASE(plus)
		CASE(plusequal)
		CASE(minus)
		CASE(minusequal)
		CASE(star)
		CASE(starequal)
		CASE(slash)
		CASE(slashequal)
		CASE(amp)
		CASE(ampequal)
		CASE(pipe)
		CASE(pipeequal)
		CASE(less)
		CASE(lessless)
		CASE(lesslessequal)
		CASE(greater)
		CASE(greatergreater)
		CASE(greatergreaterequal)
		CASE(equalequal)
		CASE(lessequal)
		CASE(greaterequal)
CASE(unknown)		CASE(unknown)
		ruiuUnsubmitted Not Done Reply Inline Actions sort ruiu: sort
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
#undef CASE		#undef CASE
}		}
os << _range << "\n";		os << _range << "\n";
}		}

		static llvm::ErrorOr<uint64_t> parseDecimal(StringRef str) {
		uint64_t res = 0;
		for (auto &c : str) {
		res *= 10;
		if (c < '0' \|\| c > '9')
		return llvm::ErrorOr<uint64_t>(std::make_error_code(std::errc::io_error));
		ruiuUnsubmitted Not Done Reply Inline Actions I think there's an implicit conversion from std::error_code to llvm::ErrorOr. So you can just write return std::errc::io_error ruiu: I think there's an implicit conversion from std::error_code to llvm::ErrorOr. So you can just…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Thanks for the tip! rafaelauler: Thanks for the tip!
		res += c - '0';
		}
		return llvm::ErrorOr<uint64_t>(res);
		ruiuUnsubmitted Not Done Reply Inline Actions return res; ruiu: return res;
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		}

		static llvm::ErrorOr<uint64_t> parseOctal(StringRef str) {
		uint64_t res = 0;
		for (auto &c : str) {
		res <<= 3;
		if (c < '0' \|\| c > '7')
		return llvm::ErrorOr<uint64_t>(std::make_error_code(std::errc::io_error));
		res += c - '0';
		}
		return llvm::ErrorOr<uint64_t>(res);
		}

		static llvm::ErrorOr<uint64_t> parseBinary(StringRef str) {
		uint64_t res = 0;
		for (auto &c : str) {
		res <<= 1;
		if (c != '0' && c != '1')
		return llvm::ErrorOr<uint64_t>(std::make_error_code(std::errc::io_error));
		res += c - '0';
		}
		return llvm::ErrorOr<uint64_t>(res);
		}

		static llvm::ErrorOr<uint64_t> parseHex(StringRef str) {
		uint64_t res = 0;
		for (auto &c : str) {
		res <<= 4;
		if (((c < '0' \|\| c > '9') && (c < 'a' \|\| c > 'f') && (c < 'A' \|\| c > 'F')))
		return llvm::ErrorOr<uint64_t>(std::make_error_code(std::errc::io_error));
		if (c >= '0' && c <= '9')
		res += c - '0';
		else if (c >= 'a' && c <= 'f')
		res += c - 'a' + 10;
		else
		ruiuUnsubmitted Not Done Reply Inline Actions You can write else if (c >= 'A' && c <= 'F') res += c - 'A' + 10; else return llvm::ErrorOr<uint64_t>(std::make_error_code(std::errc::io_error)); ruiu: You can write else if (c >= 'A' && c <= 'F') res += c - 'A' + 10; else return llvm…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Indeed, thanks, updated it. rafaelauler: Indeed, thanks, updated it.
		res += c - 'A' + 10;
		}
		return llvm::ErrorOr<uint64_t>(res);
		}
		shankarkeUnsubmitted Not Done Reply Inline Actions Could you move all the above functions to lld/Support ? These functions may be useful for even command line parsing for example to support --section-start as well as for other flavors. shankarke: Could you move all the above functions to lld/Support ? These functions may be useful for even…

		static llvm::ErrorOr<uint64_t> parseNum(StringRef str) {
		bool suffixK = false;
		bool suffixM = false;
		enum NumKind { decimal, hex, octal, binary };
		NumKind kind = llvm::StringSwitch<NumKind>(str)
		.StartsWith("0x", hex)
		.StartsWith("0", octal)
		.Default(decimal);
		ruiuUnsubmitted Not Done Reply Inline Actions This needs to be case insensitive. ruiu: This needs to be case insensitive.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions OK. I don't think that StringSwitch has something like StartsWithLower, so I just added another case. rafaelauler: OK. I don't think that StringSwitch has something like StartsWithLower, so I just added another…

		// Parse scale
		if (str.endswith("K")) {
		suffixK = true;
		str = str.drop_back();
		} else if (str.endswith("M")) {
		suffixM = true;
		str = str.drop_back();
		}
		ruiuUnsubmitted Not Done Reply Inline Actions Instead of using two boolean values, you can define variable multiplier and set 1024 for K or 10241024 for M. ruiu:* Instead of using two boolean values, you can define variable multiplier and set 1024 for K or…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fair, changed it. rafaelauler: Fair, changed it.

		// Parse type
		if (str.endswith_lower("o")) {
		kind = octal;
		str = str.drop_back();
		} else if (str.endswith_lower("h")) {
		kind = hex;
		str = str.drop_back();
		} else if (str.endswith_lower("d")) {
		kind = decimal;
		str = str.drop_back();
		} else if (str.endswith_lower("b")) {
		kind = binary;
		str = str.drop_back();
		}
		ruiuUnsubmitted Not Done Reply Inline Actions Huh, I didn't know that the linker script supports this way of writing a number. Good to know. ruiu: Huh, I didn't know that the linker script supports this way of writing a number. Good to know.

		llvm::ErrorOr<uint64_t> res(0);
		switch (kind) {
		case hex:
		if (str.startswith("0x"))
		ruiuUnsubmitted Not Done Reply Inline Actions startswith_lower ruiu: startswith_lower
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		str = str.drop_front(2);
		res = parseHex(str);
		break;
		case octal:
		res = parseOctal(str);
		break;
		case decimal:
		res = parseDecimal(str);
		break;
		case binary:
		res = parseBinary(str);
		break;
		}
		if (!res.getError()) {
		ruiuUnsubmitted Not Done Reply Inline Actions Use early return here. if (res.getError()) return res; ruiu: Use early return here. if (res.getError()) return res;
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		if (suffixK)
		res = res << 10;
		else if (suffixM)
		res = res << 20;
		ruiuUnsubmitted Not Done Reply Inline Actions and multiply by the multiplier ruiu: and multiply by the multiplier
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions OK rafaelauler: OK
		}
		return res;
		}

		bool Lexer::canStartNumber(char c) const {
		switch (c) {
		// Digits
		case '0':
		case '1':
		case '2':
		case '3':
		case '4':
		case '5':
		case '6':
		case '7':
		case '8':
		case '9':
		ruiuUnsubmitted Not Done Reply Inline Actions return '0' <= c && c <= '9' ruiu: return '0' <= c && c <= '9'
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		return true;
		default:
		return false;
		}
		}

		bool Lexer::canContinueNumber(char c) const {
		switch (c) {
		// Digits
		case '0':
		case '1':
		case '2':
		case '3':
		case '4':
		case '5':
		case '6':
		case '7':
		case '8':
		case '9':
		case 'A':
		case 'B':
		case 'C':
		case 'D':
		case 'E':
		case 'F':
		case 'a':
		case 'b':
		case 'c':
		case 'd':
		case 'e':
		case 'f':
		// Hex marker
		case 'x':
		case 'X':
		// Type suffix
		case 'h':
		case 'H':
		case 'o':
		case 'O':
		// Scale suffix
		case 'M':
		case 'K':
		ruiuUnsubmitted Not Done Reply Inline Actions You want to write multiple cases in one line. Or if expression may be shorter. ruiu: You want to write multiple cases in one line. Or if expression may be shorter.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions In fact I did with multiple cases, but clang-format undid it :-) will fix it. rafaelauler: In fact I did with multiple cases, but clang-format undid it :-) will fix it.
		return true;
		default:
		return false;
		}
		}

bool Lexer::canStartName(char c) const {		bool Lexer::canStartName(char c) const {
switch (c) {		switch (c) {
case 'A': case 'B': case 'C': case 'D': case 'E': case 'F': case 'G':		case 'A': case 'B': case 'C': case 'D': case 'E': case 'F': case 'G':
case 'H': case 'I': case 'J': case 'K': case 'L': case 'M': case 'N':		case 'H': case 'I': case 'J': case 'K': case 'L': case 'M': case 'N':
case 'O': case 'P': case 'Q': case 'R': case 'S': case 'T': case 'U':		case 'O': case 'P': case 'Q': case 'R': case 'S': case 'T': case 'U':
case 'V': case 'W': case 'X': case 'Y': case 'Z':		case 'V': case 'W': case 'X': case 'Y': case 'Z':
case 'a': case 'b': case 'c': case 'd': case 'e': case 'f': case 'g':		case 'a': case 'b': case 'c': case 'd': case 'e': case 'f': case 'g':
case 'h': case 'i': case 'j': case 'k': case 'l': case 'm': case 'n':		case 'h': case 'i': case 'j': case 'k': case 'l': case 'm': case 'n':
case 'o': case 'p': case 'q': case 'r': case 's': case 't': case 'u':		case 'o': case 'p': case 'q': case 'r': case 's': case 't': case 'u':
case 'v': case 'w': case 'x': case 'y': case 'z':		case 'v': case 'w': case 'x': case 'y': case 'z':
case '_': case '.': case '$': case '/': case '\\':		case '_': case '.': case '$': case '/': case '\\':
		case '*':
return true;		return true;
default:		default:
return false;		return false;
}		}
}		}

bool Lexer::canContinueName(char c) const {		bool Lexer::canContinueName(char c) const {
switch (c) {		switch (c) {
Show All 19 Lines	bool Lexer::canContinueName(char c) const {
default:		default:
return false;		return false;
}		}
}		}

void Lexer::lex(Token &tok) {		void Lexer::lex(Token &tok) {
skipWhitespace();		skipWhitespace();
if (_buffer.empty()) {		if (_buffer.empty()) {
tok = Token(_buffer, Token::eof);		tok = Token(_buffer, Token::eof);
return;		return;
}		}
switch (_buffer[0]) {		switch (_buffer[0]) {
case 0:		case 0:
		shankarkeUnsubmitted Not Done Reply Inline Actions How will this perform ? For large linker scripts drop_front is going to be repeatedly called for every token. shankarke: How will this perform ? For large linker scripts drop_front is going to be repeatedly called…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions This should be inlined, in fact, it was just a refactoring suggested by Rui that would enable us to write simpler code in the next lines. You're right though, it will be called a lot. On the other hand, I don't think that this code is bad-performing: substr() will just update the size of in the StringRef object and drop_front() will directly compute a new StringRef pointer and return it (pointer arithmetic) and update size as well. But I can profile this and discover if this is a limitation. rafaelauler: This should be inlined, in fact, it was just a refactoring suggested by Rui that would enable…
tok = Token(_buffer.substr(0, 1), Token::eof);		tok = Token(_buffer.substr(0, 1), Token::eof);
_buffer = _buffer.drop_front();		_buffer = _buffer.drop_front();
return;		return;
case '(':		case '(':
tok = Token(_buffer.substr(0, 1), Token::l_paren);		tok = Token(_buffer.substr(0, 1), Token::l_paren);
_buffer = _buffer.drop_front();		_buffer = _buffer.drop_front();
return;		return;
case ')':		case ')':
tok = Token(_buffer.substr(0, 1), Token::r_paren);		tok = Token(_buffer.substr(0, 1), Token::r_paren);
_buffer = _buffer.drop_front();		_buffer = _buffer.drop_front();
return;		return;
		case '{':
		tok = Token(_buffer.substr(0, 1), Token::l_brace);
		_buffer = _buffer.drop_front();
		ruiuUnsubmitted Not Done Reply Inline Actions I think we need a function something like drop(StringRef &s, int n) that (destructively) drops n characters from s and returns them. ruiu: I think we need a function something like drop(StringRef &s, int n) that (destructively)…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Good idea. Done. rafaelauler: Good idea. Done.
		return;
		case '}':
		tok = Token(_buffer.substr(0, 1), Token::r_brace);
		_buffer = _buffer.drop_front();
		return;
		case '=':
		if (_buffer.startswith("==")) {
		tok = Token(_buffer.substr(0, 2), Token::equalequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		tok = Token(_buffer.substr(0, 1), Token::equal);
		_buffer = _buffer.drop_front();
		return;
		case '!':
		if (_buffer.startswith("!=")) {
		tok = Token(_buffer.substr(0, 2), Token::exclaimequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		tok = Token(_buffer.substr(0, 1), Token::exclaim);
		_buffer = _buffer.drop_front();
		return;
case ',':		case ',':
tok = Token(_buffer.substr(0, 1), Token::comma);		tok = Token(_buffer.substr(0, 1), Token::comma);
_buffer = _buffer.drop_front();		_buffer = _buffer.drop_front();
return;		return;
default:		case ';':
// Handle quoted strings. They are treated as identifiers for		tok = Token(_buffer.substr(0, 1), Token::semicolon);
// simplicity.		_buffer = _buffer.drop_front();
if ((_buffer[0] == '\"') \|\| (_buffer[0] == '\'')) {		return;
char c = _buffer[0];		case ':':
		tok = Token(_buffer.substr(0, 1), Token::colon);
		_buffer = _buffer.drop_front();
		return;
		case '&':
		if (_buffer.startswith("&=")) {
		tok = Token(_buffer.substr(0, 2), Token::ampequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		tok = Token(_buffer.substr(0, 1), Token::amp);
		_buffer = _buffer.drop_front();
		return;
		case '\|':
		if (_buffer.startswith("\|=")) {
		tok = Token(_buffer.substr(0, 2), Token::pipeequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		tok = Token(_buffer.substr(0, 1), Token::pipe);
		_buffer = _buffer.drop_front();
		return;
		case '+':
		if (_buffer.startswith("+=")) {
		tok = Token(_buffer.substr(0, 2), Token::plusequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		tok = Token(_buffer.substr(0, 1), Token::plus);
		_buffer = _buffer.drop_front();
		return;
		case '-': {
		if (_buffer.startswith("-=")) {
		tok = Token(_buffer.substr(0, 2), Token::minusequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		if (!_buffer.startswith("-l")) {
		tok = Token(_buffer.substr(0, 1), Token::minus);
_buffer = _buffer.drop_front();		_buffer = _buffer.drop_front();
auto quotedStringEnd = _buffer.find(c);
if (quotedStringEnd == StringRef::npos \|\| quotedStringEnd == 0)
break;
StringRef word = _buffer.substr(0, quotedStringEnd);
tok = Token(word, Token::identifier);
_buffer = _buffer.drop_front(quotedStringEnd + 1);
return;		return;
}		}
// -l<lib name>		// -l<lib name>
if (_buffer.startswith("-l")) {
_buffer = _buffer.drop_front(2);		_buffer = _buffer.drop_front(2);
StringRef::size_type start = 0;		StringRef::size_type start = 0;
if (_buffer[start] == ':')		if (_buffer[start] == ':')
++start;		++start;
if (!canStartName(_buffer[start]))		if (!canStartName(_buffer[start]))
// Create 'unknown' token.		// Create 'unknown' token.
break;		break;
auto libNameEnd =		auto libNameEnd = std::find_if(_buffer.begin() + start + 1, _buffer.end(),
std::find_if(_buffer.begin() + start + 1, _buffer.end(),
[=](char c) { return !canContinueName(c); });		[=](char c) { return !canContinueName(c); });
StringRef::size_type libNameLen =		StringRef::size_type libNameLen =
std::distance(_buffer.begin(), libNameEnd);		std::distance(_buffer.begin(), libNameEnd);
tok = Token(_buffer.substr(0, libNameLen), Token::libname);		tok = Token(_buffer.substr(0, libNameLen), Token::libname);
_buffer = _buffer.drop_front(libNameLen);		_buffer = _buffer.drop_front(libNameLen);
return;		return;
}		}
/// keyword or identifer.		case '<':
if (!canStartName(_buffer[0]))		if (_buffer.startswith("<<=")) {
		tok = Token(_buffer.substr(0, 3), Token::lesslessequal);
		_buffer = _buffer.drop_front(3);
		return;
		}
		if (_buffer.startswith("<<")) {
		tok = Token(_buffer.substr(0, 2), Token::lessless);
		_buffer = _buffer.drop_front(2);
		return;
		}
		if (_buffer.startswith("<=")) {
		tok = Token(_buffer.substr(0, 2), Token::lessequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		tok = Token(_buffer.substr(0, 1), Token::less);
		_buffer = _buffer.drop_front();
		return;
		case '>':
		if (_buffer.startswith(">>=")) {
		tok = Token(_buffer.substr(0, 3), Token::greatergreaterequal);
		_buffer = _buffer.drop_front(3);
		return;
		}
		if (_buffer.startswith(">>")) {
		tok = Token(_buffer.substr(0, 2), Token::greatergreater);
		_buffer = _buffer.drop_front(2);
		return;
		}
		if (_buffer.startswith(">=")) {
		tok = Token(_buffer.substr(0, 2), Token::greaterequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		tok = Token(_buffer.substr(0, 1), Token::greater);
		_buffer = _buffer.drop_front();
		return;
		default:
		// Handle quoted strings. They are treated as identifiers for
		// simplicity.
		if ((_buffer[0] == '\"') \|\| (_buffer[0] == '\'')) {
		ruiuUnsubmitted Not Done Reply Inline Actions This line should be before default and case '"': case '\'': ruiu: This line should be before default and case '"': case '\'':
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		char c = _buffer[0];
		_buffer = _buffer.drop_front();
		auto quotedStringEnd = _buffer.find(c);
		if (quotedStringEnd == StringRef::npos \|\| quotedStringEnd == 0)
break;		break;
auto endIter =		StringRef word = _buffer.substr(0, quotedStringEnd);
std::find_if(_buffer.begin() + 1, _buffer.end(), [=](char c) {		tok = Token(word, Token::identifier);
return !canContinueName(c);		_buffer = _buffer.drop_front(quotedStringEnd + 1);
		return;
		}
		// Handle literal numbers
		if ((_buffer.startswith("0x") && _buffer.size() > 2 &&
		canContinueNumber(_buffer[2])) \|\|
		ruiuUnsubmitted Not Done Reply Inline Actions This expression (_buffer.startswith("0x") && _buffer.size() > 2 && canContinueNumber(_buffer[2]) seems redundant, because if a string starts with "0x", it will always satisfy canStartNumber(_buffer[0]) ruiu: This expression (_buffer.startswith("0x") && _buffer.size() > 2 && canContinueNumber(_buffer…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed. rafaelauler: Fixed.
		canStartNumber(_buffer[0])) {
		auto endIter = std::find_if(_buffer.begin(), _buffer.end(), [=](char c) {
		return !canContinueNumber(c);
});		});
StringRef::size_type end =		StringRef::size_type end = endIter == _buffer.end()
endIter == _buffer.end() ? StringRef::npos		? StringRef::npos
: std::distance(_buffer.begin(), endIter);		: std::distance(_buffer.begin(), endIter);
if (end == StringRef::npos \|\| end == 0)		if (end == StringRef::npos \|\| end == 0)
break;		break;
StringRef word = _buffer.substr(0, end);		StringRef word = _buffer.substr(0, end);
Token::Kind kind = llvm::StringSwitch<Token::Kind>(word)		tok = Token(word, Token::number);
		_buffer = _buffer.drop_front(end);
		return;
		}
		/// Handle slashes '/', which can be either an operator inside an expression
		/// or the beginning of an identifier
		if (_buffer.startswith("/=")) {
		tok = Token(_buffer.substr(0, 2), Token::slashequal);
		_buffer = _buffer.drop_front(2);
		return;
		silvasUnsubmitted Not Done Reply Inline Actions Don't use doxygen comments inside the function. Here and elsewhere. silvas: Don't use doxygen comments inside the function. Here and elsewhere.
		}
		if (_buffer[0] == '/' && _buffer.size() > 1 &&
		!canContinueName(_buffer[1])) {
		tok = Token(_buffer.substr(0, 1), Token::slash);
		_buffer = _buffer.drop_front();
		return;
		}
		/// Handle stars '*'
		if (_buffer.startswith("*=")) {
		tok = Token(_buffer.substr(0, 2), Token::starequal);
		_buffer = _buffer.drop_front(2);
		return;
		}
		if (_buffer[0] == '*' && _buffer.size() > 1 &&
		!canContinueName(_buffer[1])) {
		tok = Token(_buffer.substr(0, 1), Token::star);
		_buffer = _buffer.drop_front();
		return;
		}
		/// Handle questions '?'
		if (_buffer[0] == '?' && _buffer.size() > 1 &&
		!canContinueName(_buffer[1])) {
		tok = Token(_buffer.substr(0, 1), Token::question);
		_buffer = _buffer.drop_front();
		return;
		}
		/// keyword or identifier.
		if (!canStartName(_buffer[0]))
		break;
		auto endIter = std::find_if(_buffer.begin() + 1, _buffer.end(),
		[=](char c) { return !canContinueName(c); });
		StringRef::size_type end = endIter == _buffer.end()
		? StringRef::npos
		: std::distance(_buffer.begin(), endIter);
		if (end == StringRef::npos \|\| end == 0)
		break;
		StringRef word = _buffer.substr(0, end);
		Token::Kind kind =
		llvm::StringSwitch<Token::Kind>(word)
.Case("OUTPUT_FORMAT", Token::kw_output_format)		.Case("OUTPUT_FORMAT", Token::kw_output_format)
.Case("OUTPUT_ARCH", Token::kw_output_arch)		.Case("OUTPUT_ARCH", Token::kw_output_arch)
.Case("GROUP", Token::kw_group)		.Case("GROUP", Token::kw_group)
.Case("AS_NEEDED", Token::kw_as_needed)		.Case("AS_NEEDED", Token::kw_as_needed)
		.Case("SEARCH_DIR", Token::kw_search_dir)
		.Case("SECTIONS", Token::kw_sections)
		.Case("HIDDEN", Token::kw_hidden)
		.Case("PROVIDE", Token::kw_provide)
		.Case("PROVIDE_HIDDEN", Token::kw_provide_hidden)
		.Case("OVERLAY", Token::kw_overlay)
		.Case("AT", Token::kw_at)
		.Case("ALIGN", Token::kw_align)
		.Case("ALIGN_WITH_INPUT", Token::kw_align_with_input)
		.Case("SUBALIGN", Token::kw_subalign)
		.Case("EXCLUDE_FILE", Token::kw_exclude_file)
		.Case("SORT_BY_NAME", Token::kw_sort_by_name)
		.Case("SORT_BY_ALIGNMENT", Token::kw_sort_by_alignment)
		.Case("SORT_BY_INIT_PRIORITY", Token::kw_sort_by_init_priority)
		.Case("SORT_NONE", Token::kw_sort_none)
		.Case("SORT", Token::kw_sort_by_name)
		.Case("KEEP", Token::kw_keep)
		.Case("ONLY_IF_RO", Token::kw_only_if_ro)
		.Case("ONLY_IF_RW", Token::kw_only_if_rw)
		.Case("/DISCARD/", Token::kw_discard)
		ruiuUnsubmitted Not Done Reply Inline Actions Sort ruiu: Sort
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
.Case("ENTRY", Token::kw_entry)		.Case("ENTRY", Token::kw_entry)
.Default(Token::identifier);		.Default(Token::identifier);
tok = Token(word, kind);		tok = Token(word, kind);
_buffer = _buffer.drop_front(end);		_buffer = _buffer.drop_front(end);
return;		return;
}		}
tok = Token(_buffer.substr(0, 1), Token::unknown);		tok = Token(_buffer.substr(0, 1), Token::unknown);
_buffer = _buffer.drop_front();		_buffer = _buffer.drop_front();
}		}

Show All 29 Lines	case '/':
return;		return;
break;		break;
default:		default:
return;		return;
}		}
}		}
}		}

		// Constant functions
		void Constant::dump(raw_ostream &os) const { os << _num; }

		// Symbol functions
		void Symbol::dump(raw_ostream &os) const { os << _name; }

		shankarkeUnsubmitted Not Done Reply Inline Actions move this inside the class itself. shankarke: move this inside the class itself.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Should we leave these as virtual anchors in the C++ code or is this not a concern for the LLD project? rafaelauler: Should we leave these as virtual anchors in the C++ code or is this not a concern for the LLD…
		shankarkeUnsubmitted Not Done Reply Inline Actions Leave them as virtual anchors still. shankarke: Leave them as virtual anchors still.
		// FunctionCall functions
		void FunctionCall::dump(raw_ostream &os) const {
		os << _name << "(";
		if (unsigned e = _args.size()) {
		_args[0]->dump(os);
		for (unsigned i = 1; i != e; ++i) {
		os << ", ";
		_args[i]->dump(os);
		}
		}
		silvasUnsubmitted Not Done Reply Inline Actions fwiw, my favorite pattern for doing this "intersperse with commas" operation is: for (int i = 0, e = _args.size(); i != e; ++i) { if (i) os << ", "; _args[i]->dump(os); } It's a bit cleaner than the pattern you're currently using. silvas: fwiw, my favorite pattern for doing this "intersperse with commas" operation is: ``` for (int…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Nice one, will use it. rafaelauler: Nice one, will use it.
		os << ")";
		}

		// BinOp functions
		void BinOp::dump(raw_ostream &os) const {
		os << "(";
		_LHS->dump(os);
		os << " ";
		switch (_op) {
		case Sum:
		os << "+";
		break;
		case Sub:
		os << "-";
		break;
		case Mul:
		os << "*";
		break;
		case Div:
		os << "/";
		break;
		case Shl:
		os << "<<";
		break;
		case Shr:
		os << ">>";
		break;
		case And:
		os << "&";
		break;
		case Or:
		os << "\|";
		break;
		case CompareEqual:
		os << "==";
		break;
		case CompareDifferent:
		os << "!=";
		break;
		case CompareLess:
		os << "<";
		break;
		case CompareGreater:
		os << ">";
		break;
		case CompareLessEqual:
		os << "<=";
		break;
		case CompareGreaterEqual:
		os << ">=";
		break;
		}
		os << " ";
		_RHS->dump(os);
		os << ")";
		}

		// TernaryConditional functions
		void TernaryConditional::dump(raw_ostream &os) const {
		_conditional->dump(os);
		os << " ? ";
		_trueExpr->dump(os);
		os << " : ";
		_falseExpr->dump(os);
		}

		// SymbolAssignment functions
		void SymbolAssignment::dump(raw_ostream &os) const {
		int numParen = 0;

		if (_assignmentVisibility != Normal) {
		switch (_assignmentVisibility) {
		case Hidden:
		os << "HIDDEN(";
		break;
		case Provide:
		os << "PROVIDE(";
		break;
		case ProvideHidden:
		os << "PROVIDE_HIDDEN(";
		break;
		default:
		llvm_unreachable("Unknown visibility");
		}
		++numParen;
		}

		os << _symbol << " ";
		switch (_assignmentKind) {
		case Simple:
		os << "=";
		break;
		case Sum:
		os << "+=";
		break;
		case Sub:
		os << "-=";
		break;
		case Mul:
		os << "*=";
		break;
		case Div:
		os << "/=";
		break;
		case Shl:
		os << "<<=";
		break;
		case Shr:
		os << ">>=";
		break;
		case And:
		os << "&=";
		break;
		case Or:
		os << "\|=";
		break;
		}

		os << " ";
		_expression->dump(os);
		if (numParen)
		os << ")";
		}

		static int dumpSortDirectives(raw_ostream &os, WildcardSortMode sortMode) {
		int numParen = 0;
		switch (sortMode) {
		case WSM_NA:
		break;
		case WSM_ByName:
		os << "SORT_BY_NAME(";
		numParen = 1;
		break;
		ruiuUnsubmitted Not Done Reply Inline Actions return 1; ruiu: return 1;
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		case WSM_ByAlignment:
		os << "SORT_BY_ALIGNMENT(";
		numParen = 1;
		break;
		case WSM_ByInitPriority:
		os << "SORT_BY_INIT_PRIORITY(";
		numParen = 1;
		break;
		case WSM_ByNameAndAlignment:
		os << "SORT_BY_NAME(SORT_BY_ALIGNMENT";
		ruiuUnsubmitted Not Done Reply Inline Actions Did you forget to add the trailing "("? ruiu: Did you forget to add the trailing "("?
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Right, thanks rafaelauler: Right, thanks
		numParen = 2;
		break;
		case WSM_ByAlignmentAndName:
		os << "SORT_BY_ALIGNMENT(SORT_BY_NAME";
		ruiuUnsubmitted Not Done Reply Inline Actions Ditto ruiu: Ditto
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		numParen = 2;
		break;
		case WSM_None:
		os << "SORT_NONE(";
		numParen = 1;
		break;
		}
		return numParen;
		}

		// InputSectionName functions
		void InputSectionName::dump(raw_ostream &os) const {
		if (_excludeFile)
		os << "EXCLUDE_FILE(";
		os << _name;
		if (_excludeFile)
		os << ")";
		}
		shankarkeUnsubmitted Not Done Reply Inline Actions move this inside the class itself. shankarke: move this inside the class itself.

		// InputSectionSortedGroup functions
		void InputSectionSortedGroup::dump(raw_ostream &os) const {
		int numParen = dumpSortDirectives(os, _sortMode);
		for (auto &secName : _sections) {
		secName->dump(os);
		os << " ";
		}
		for (int i = 0; i < numParen; ++i)
		os << ")";
		}

		// InputSectionFile functions
		void InputSectionFile::dump(raw_ostream &os) const {
		if (_keep)
		os << "KEEP(";
		int numParen = dumpSortDirectives(os, _fileSortMode);
		os << _fileName;
		for (int i = 0; i < numParen; ++i)
		os << ")";
		os << ":";
		numParen = dumpSortDirectives(os, _archiveSortMode);
		os << _archiveName;
		for (int i = 0; i < numParen; ++i)
		os << ")";
		os << "(";
		for (auto &command : _sections) {
		command->dump(os);
		os << " ";
		}
		os << ")";
		if (_keep)
		os << ")";
		}

		// OutputSectionDescription functions
		void OutputSectionDescription::dump(raw_ostream &os) const {
		if (_discard)
		os << "/DISCARD/";
		else
		os << _sectionName;

		if (_address) {
		os << " ";
		_address->dump(os);
		}
		os << " :\n";

		if (_at) {
		os << " AT(";
		_at->dump(os);
		os << ")\n";
		}

		if (_align) {
		os << " ALIGN(";
		_align->dump(os);
		os << ")\n";
		} else if (_alignWithInput) {
		os << " ALIGN_WITH_INPUT\n";
		}

		if (_subAlign) {
		os << " SUBALIGN(";
		_subAlign->dump(os);
		os << ")\n";
		}

		switch (_constraint) {
		case C_None:
		break;
		case C_OnlyIfRO:
		os << "ONLY_IF_RO";
		break;
		case C_OnlyIfRW:
		os << "ONLY_IF_RW";
		break;
		}

		os << " { \n";
		ruiuUnsubmitted Not Done Reply Inline Actions remove space before \n ruiu: remove space before \n
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		for (auto &command : _outputSectionCommands) {
		os << " ";
		command->dump(os);
		os << "\n";
		}
		os << " }";

		if (_fillExpr) {
		os << " =";
		_fillExpr->dump(os);
		os << ";";
		}
		}

		// Sections functions
		void Sections::dump(raw_ostream &os) const {
		os << "SECTIONS\n{\n";
		for (auto &command : _sectionsCommands) {
		command->dump(os);
		os << "\n";
		}
		os << "}\n";
		}

		// Parser functions
LinkerScript *Parser::parse() {		LinkerScript *Parser::parse() {
// Get the first token.		// Get the first token.
_lex.lex(_tok);		_lex.lex(_tok);
// Parse top level commands.		// Parse top level commands.
while (true) {		while (true) {
switch (_tok._kind) {		switch (_tok._kind) {
case Token::eof:		case Token::eof:
return &_script;		return &_script;
		case Token::semicolon:
		consumeToken();
		break;
case Token::kw_output_format: {		case Token::kw_output_format: {
auto outputFormat = parseOutputFormat();		auto outputFormat = parseOutputFormat();
if (!outputFormat)		if (!outputFormat)
return nullptr;		return nullptr;
_script._commands.push_back(outputFormat);		_script._commands.push_back(outputFormat);
break;		break;
}		}
case Token::kw_output_arch: {		case Token::kw_output_arch: {
auto outputArch = parseOutputArch();		auto outputArch = parseOutputArch();
if (!outputArch)		if (!outputArch)
return nullptr;		return nullptr;
_script._commands.push_back(outputArch);		_script._commands.push_back(outputArch);
break;		break;
}		}
case Token::kw_group: {		case Token::kw_group: {
auto group = parseGroup();		auto group = parseGroup();
if (!group)		if (!group)
return nullptr;		return nullptr;
_script._commands.push_back(group);		_script._commands.push_back(group);
break;		break;
}		}
case Token::kw_as_needed:		case Token::kw_as_needed:
// Not allowed at top level.		// Not allowed at top level.
		error(_tok, "AS_NEEDED not allowed at top level.");
return nullptr;		return nullptr;
case Token::kw_entry: {		case Token::kw_entry: {
Entry *entry = parseEntry();		Entry *entry = parseEntry();
if (!entry)		if (!entry)
return nullptr;		return nullptr;
_script._commands.push_back(entry);		_script._commands.push_back(entry);
break;		break;
}		}
		case Token::kw_search_dir: {
		SearchDir *searchDir = parseSearchDir();
		if (!searchDir)
		return nullptr;
		_script._commands.push_back(searchDir);
		break;
		}
		case Token::kw_sections: {
		Sections *sections = parseSections();
		if (!sections)
		return nullptr;
		_script._commands.push_back(sections);
		break;
		}
default:		default:
// Unexpected.		// Unexpected.
		error(_tok, "Unrecognized token.");
return nullptr;		return nullptr;
}		}
}		}

return nullptr;		return nullptr;
}		}

		const Expression *Parser::parseFunctionCall() {
		assert((_tok._kind == Token::identifier \|\| _tok._kind == Token::kw_align) &&
		"expected function call first tokens");
		std::vector<const Expression *> params;
		StringRef name = _tok._range;

		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return nullptr;

		if (_tok._kind == Token::r_paren) {
		consumeToken();
		return new (_alloc) FunctionCall(_tok._range, params);
		}

		if (const Expression *firstParam = parseExpression())
		params.push_back(firstParam);
		else
		return nullptr;

		while (_tok._kind == Token::comma) {
		consumeToken();
		if (const Expression *param = parseExpression())
		shankarkeUnsubmitted Not Done Reply Inline Actions can you convert the assert messages to appropriate error messages that can be displayed to the user ? shankarke: can you convert the assert messages to appropriate error messages that can be displayed to the…
		params.push_back(param);
		else
		return nullptr;
		}

		if (!expectAndConsume(Token::r_paren, "expected )"))
		return nullptr;
		return new (_alloc) FunctionCall(name, params);
		}

		bool Parser::expectExprTerminal() {
		if (!(_tok._kind == Token::identifier \|\| _tok._kind == Token::number \|\|
		_tok._kind == Token::kw_align \|\| _tok._kind == Token::l_paren)) {
		error(_tok, "expected symbol, number or left parenthesis.");
		return false;
		}
		return true;
		}

		const Expression *Parser::parseExprTerminal() {
		if (!expectExprTerminal())
		return nullptr;

		switch (_tok._kind) {
		case Token::identifier:
		switch (peek(1)._kind) {
		case Token::l_paren:
		ruiuUnsubmitted Not Done Reply Inline Actions This can be if (peek(1)._kind == Token::l_paren) return parseFunctionCall(); Symbol sym = ... ... return sym; ruiu:* This can be if (peek(1)._kind == Token::l_paren) return parseFunctionCall(); Symbol…
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		return parseFunctionCall();
		default: {
		Symbol *sym = new (_alloc) Symbol(_tok._range);
		consumeToken();
		return sym;
		}
		}
		break;
		case Token::kw_align:
		return parseFunctionCall();
		case Token::number: {
		auto val = parseNum(_tok._range);
		if (val.getError()) {
		error(_tok, "Unrecognized number constant");
		return nullptr;
		}
		Constant c = new (_alloc) Constant(val);
		consumeToken();
		return c;
		}
		case Token::l_paren: {
		consumeToken();
		const Expression *expr = parseExpression();
		if (!expectAndConsume(Token::r_paren, "expected )"))
		return nullptr;
		return expr;
		}
		default:
		llvm_unreachable("Unknown token");
		}
		}

		static bool TokenToBinOp(const Token &tok, BinOp::Operation &op,
		unsigned &precedence) {
		switch (tok._kind) {
		case Token::star:
		op = BinOp::Mul;
		precedence = 3;
		return true;
		case Token::slash:
		op = BinOp::Div;
		precedence = 3;
		return true;
		case Token::plus:
		op = BinOp::Sum;
		precedence = 4;
		return true;
		case Token::minus:
		op = BinOp::Sub;
		precedence = 4;
		return true;
		case Token::lessless:
		op = BinOp::Shl;
		precedence = 5;
		return true;
		case Token::greatergreater:
		op = BinOp::Shr;
		precedence = 5;
		return true;
		case Token::less:
		op = BinOp::CompareLess;
		precedence = 6;
		return true;
		case Token::greater:
		op = BinOp::CompareGreater;
		precedence = 6;
		return true;
		case Token::lessequal:
		op = BinOp::CompareLessEqual;
		precedence = 6;
		return true;
		case Token::greaterequal:
		op = BinOp::CompareGreaterEqual;
		precedence = 6;
		return true;
		case Token::equalequal:
		op = BinOp::CompareEqual;
		precedence = 7;
		return true;
		case Token::exclaimequal:
		op = BinOp::CompareDifferent;
		precedence = 7;
		return true;
		case Token::amp:
		op = BinOp::And;
		precedence = 8;
		return true;
		case Token::pipe:
		op = BinOp::Or;
		precedence = 10;
		return true;
		default:
		break;
		}
		return false;
		}

		const Expression *Parser::parseExpression(unsigned precedence) {
		assert(precedence <= 13 && "Invalid precedence value");
		if (!expectExprTerminal())
		return nullptr;

		const Expression *expr = parseExprTerminal();
		if (!expr)
		return nullptr;

		BinOp::Operation op;
		unsigned binOpPrecedence = 0;
		if (TokenToBinOp(_tok, op, binOpPrecedence)) {
		if (precedence >= binOpPrecedence)
		return parseOperatorOperandLoop(expr, precedence);
		return expr;
		}

		// Non-binary operators
		if (_tok._kind == Token::question && precedence >= 13)
		return parseOperatorOperandLoop(expr, precedence);
		return expr;
		}

		const Expression Parser::parseOperatorOperandLoop(const Expression LHS,
		ruiuUnsubmitted Not Done Reply Inline Actions s/LHS/lhs/ ruiu: s/LHS/lhs/
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		unsigned highestPrecedence) {
		assert(highestPrecedence <= 13 && "Invalid precedence value");
		unsigned precedence = 0;
		const Expression *binOp = nullptr;

		while (1) {
		BinOp::Operation op;
		if (!TokenToBinOp(_tok, op, precedence)) {
		if (_tok._kind == Token::question && highestPrecedence >= 13)
		return parseTernaryCondOp(LHS);
		return binOp;
		}

		if (precedence > highestPrecedence)
		return binOp;

		consumeToken();
		const Expression *RHS = parseExpression(precedence - 1);
		ruiuUnsubmitted Not Done Reply Inline Actions s/RHS/rhs/ ruiu: s/RHS/rhs/
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		if (!RHS)
		return nullptr;
		binOp = new (_alloc) BinOp(LHS, op, RHS);
		LHS = binOp;
		}
		}

		const Expression Parser::parseTernaryCondOp(const Expression LHS) {
		ruiuUnsubmitted Not Done Reply Inline Actions lhs ruiu: lhs
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Fixed rafaelauler: Fixed
		assert(_tok._kind == Token::question && "Expected question mark");

		consumeToken();

		// The ternary conditional operator has right-to-left associativity.
		// To implement this, we allow our children to contain ternary conditional
		// operators themselves (precedence 13).
		const Expression *trueExpr = parseExpression(13);
		if (!trueExpr)
		return nullptr;

		if (!expectAndConsume(Token::colon, "expected :"))
		return nullptr;

		const Expression *falseExpr = parseExpression(13);
		if (!falseExpr)
		return nullptr;

		return new (_alloc) TernaryConditional(LHS, trueExpr, falseExpr);
		}

// Parse OUTPUT_FORMAT(ident)		// Parse OUTPUT_FORMAT(ident)
OutputFormat *Parser::parseOutputFormat() {		OutputFormat *Parser::parseOutputFormat() {
assert(_tok._kind == Token::kw_output_format && "Expected OUTPUT_FORMAT!");		assert(_tok._kind == Token::kw_output_format && "Expected OUTPUT_FORMAT!");
consumeToken();		consumeToken();
if (!expectAndConsume(Token::l_paren, "expected ("))		if (!expectAndConsume(Token::l_paren, "expected ("))
return nullptr;		return nullptr;

if (_tok._kind != Token::identifier) {		if (_tok._kind != Token::identifier) {
▲ Show 20 Lines • Show All 120 Lines • ▼ Show 20 Lines	Entry *Parser::parseEntry() {
}		}
StringRef entryName(_tok._range);		StringRef entryName(_tok._range);
consumeToken();		consumeToken();
if (!expectAndConsume(Token::r_paren, "expected )"))		if (!expectAndConsume(Token::r_paren, "expected )"))
return nullptr;		return nullptr;
return new (_alloc) Entry(entryName);		return new (_alloc) Entry(entryName);
}		}

		// Parse SEARCH_DIR(ident)
		SearchDir *Parser::parseSearchDir() {
		assert(_tok._kind == Token::kw_search_dir && "Expected SEARCH_DIR!");
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return nullptr;
		if (_tok._kind != Token::identifier) {
		error(_tok, "expected identifier in SEARCH_DIR");
		return nullptr;
		}
		StringRef searchPath(_tok._range);
		consumeToken();
		if (!expectAndConsume(Token::r_paren, "expected )"))
		return nullptr;
		return new (_alloc) SearchDir(searchPath);
		}

		const SymbolAssignment *Parser::parseSymbolAssignment() {
		assert((_tok._kind == Token::identifier \|\| _tok._kind == Token::kw_hidden \|\|
		_tok._kind == Token::kw_provide \|\|
		_tok._kind == Token::kw_provide_hidden) &&
		"Expected identifier!");
		SymbolAssignment::AssignmentVisibility visibility = SymbolAssignment::Normal;
		SymbolAssignment::AssignmentKind kind;
		int numParen = 0;

		if (_tok._kind == Token::kw_hidden \|\| _tok._kind == Token::kw_provide \|\|
		_tok._kind == Token::kw_provide_hidden) {
		ruiuUnsubmitted Not Done Reply Inline Actions Remove this if ruiu: Remove this if
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		switch (_tok._kind) {
		case Token::kw_hidden:
		visibility = SymbolAssignment::Hidden;
		break;
		case Token::kw_provide:
		visibility = SymbolAssignment::Provide;
		break;
		case Token::kw_provide_hidden:
		visibility = SymbolAssignment::ProvideHidden;
		break;
		default:
		llvm_unreachable("Unknown token");
		}
		++numParen;
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return nullptr;
		}

		StringRef name = _tok._range;
		consumeToken();

		// Parse assignment operator (=, +=, -= etc.)
		switch (_tok._kind) {
		case Token::equal:
		kind = SymbolAssignment::Simple;
		consumeToken();
		ruiuUnsubmitted Not Done Reply Inline Actions Move consumeToken()s after this switch statement. ruiu: Move consumeToken()s after this switch statement.
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Done rafaelauler: Done
		break;
		case Token::plusequal:
		kind = SymbolAssignment::Sum;
		consumeToken();
		break;
		case Token::minusequal:
		kind = SymbolAssignment::Sub;
		consumeToken();
		break;
		case Token::starequal:
		kind = SymbolAssignment::Mul;
		consumeToken();
		break;
		case Token::slashequal:
		kind = SymbolAssignment::Div;
		consumeToken();
		break;
		case Token::ampequal:
		kind = SymbolAssignment::And;
		consumeToken();
		break;
		case Token::pipeequal:
		kind = SymbolAssignment::Or;
		consumeToken();
		break;
		case Token::lesslessequal:
		kind = SymbolAssignment::Shl;
		consumeToken();
		break;
		case Token::greatergreaterequal:
		kind = SymbolAssignment::Shr;
		consumeToken();
		break;
		default:
		error(_tok, "unexpected token");
		return nullptr;
		}

		const Expression *expr = nullptr;
		switch (_tok._kind) {
		case Token::number:
		case Token::kw_align:
		case Token::identifier:
		case Token::l_paren:
		expr = parseExpression();
		if (!expr)
		return nullptr;
		break;
		default:
		error(_tok, "unexpected token while parsing assignment value.");
		return nullptr;
		}

		for (int i = 0; i < numParen; ++i)
		if (!expectAndConsume(Token::r_paren, "expected )"))
		return nullptr;

		return new (_alloc) SymbolAssignment(name, expr, kind, visibility);
		}

		llvm::ErrorOr<InputSectionFile::VectorTy> Parser::parseExcludeFile() {
		assert(_tok._kind == Token::kw_exclude_file && "Expected EXCLUDE_FILE!");
		InputSectionFile::VectorTy res;
		consumeToken();

		if (!expectAndConsume(Token::l_paren, "expected ("))
		return llvm::ErrorOr<InputSectionFile::VectorTy>(
		std::make_error_code(std::errc::io_error));

		while (_tok._kind == Token::identifier) {
		res.push_back(new (_alloc) InputSectionName(_tok._range, true));
		consumeToken();
		}

		if (!expectAndConsume(Token::r_paren, "expected )"))
		return llvm::ErrorOr<InputSectionFile::VectorTy>(
		std::make_error_code(std::errc::io_error));
		return llvm::ErrorOr<InputSectionFile::VectorTy>(std::move(res));
		}

		int Parser::parseSortDirectives(WildcardSortMode &sortMode) {
		int numParsedDirectives = 0;
		sortMode = WSM_NA;

		if (_tok._kind == Token::kw_sort_by_name) {
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return -1;
		++numParsedDirectives;
		sortMode = WSM_ByName;
		}

		if (_tok._kind == Token::kw_sort_by_init_priority) {
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return -1;
		++numParsedDirectives;
		sortMode = WSM_ByInitPriority;
		}

		if (_tok._kind == Token::kw_sort_by_alignment) {
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return -1;
		++numParsedDirectives;
		if (sortMode != WSM_ByName)
		sortMode = WSM_ByAlignment;
		else
		sortMode = WSM_ByNameAndAlignment;
		}

		if (numParsedDirectives < 2 && _tok._kind == Token::kw_sort_by_name) {
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return -1;
		++numParsedDirectives;
		if (sortMode == WSM_ByAlignment)
		sortMode = WSM_ByAlignmentAndName;
		}

		if (numParsedDirectives < 2 && _tok._kind == Token::kw_sort_by_alignment) {
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return -1;
		++numParsedDirectives;
		}

		if (numParsedDirectives == 0 && _tok._kind == Token::kw_sort_none) {
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return -1;
		++numParsedDirectives;
		sortMode = WSM_None;
		}

		return numParsedDirectives;
		}

		const InputSection *Parser::parseSortedInputSections() {
		assert((_tok._kind == Token::kw_sort_by_name \|\|
		_tok._kind == Token::kw_sort_by_alignment \|\|
		_tok._kind == Token::kw_sort_by_init_priority \|\|
		_tok._kind == Token::kw_sort_none) &&
		"Expected SORT directives!");

		WildcardSortMode sortMode = WSM_NA;
		int numParen = parseSortDirectives(sortMode);
		if (numParen == -1)
		return nullptr;

		std::vector<const InputSection *> inputSections;

		while (_tok._kind == Token::identifier) {
		inputSections.push_back(new (_alloc) InputSectionName(_tok._range, false));
		consumeToken();
		}

		// Eat "numParen" rparens
		for (int i = 0, e = numParen; i != e; ++i)
		if (!expectAndConsume(Token::r_paren, "expected )"))
		return nullptr;

		return new (_alloc) InputSectionSortedGroup(sortMode, inputSections);
		}

		const InputSectionFile *Parser::parseInputSectionFile() {
		assert((_tok._kind == Token::identifier \|\| _tok._kind == Token::colon \|\|
		_tok._kind == Token::star \|\| _tok._kind == Token::kw_keep \|\|
		_tok._kind == Token::kw_sort_by_name \|\|
		_tok._kind == Token::kw_sort_by_alignment \|\|
		_tok._kind == Token::kw_sort_by_init_priority \|\|
		_tok._kind == Token::kw_sort_none) &&
		"Expected input section first tokens!");
		int numParen = 1;
		bool keep = false;
		WildcardSortMode fileSortMode = WSM_NA;
		WildcardSortMode archiveSortMode = WSM_NA;
		StringRef fileName;
		StringRef archiveName;

		if (_tok._kind == Token::kw_keep) {
		consumeToken();
		if (!expectAndConsume(Token::l_paren, "expected ("))
		return nullptr;
		++numParen;
		keep = true;
		}

		// Input name
		if (_tok._kind != Token::colon) {
		int numParen = parseSortDirectives(fileSortMode);
		if (numParen == -1)
		return nullptr;
		fileName = _tok._range;
		consumeToken();
		if (numParen) {
		while (numParen--)
		if (!expectAndConsume(Token::r_paren, "expected )"))
		return nullptr;
		}
		}
		if (_tok._kind == Token::colon) {
		consumeToken();
		if (_tok._kind == Token::identifier \|\|
		_tok._kind == Token::kw_sort_by_name \|\|
		_tok._kind == Token::kw_sort_by_alignment \|\|
		_tok._kind == Token::kw_sort_by_init_priority \|\|
		_tok._kind == Token::kw_sort_none) {
		int numParen = parseSortDirectives(archiveSortMode);
		if (numParen == -1)
		return nullptr;
		archiveName = _tok._range;
		consumeToken();
		if (numParen) {
		while (numParen--)
		if (!expectAndConsume(Token::r_paren, "expected )"))
		silvasUnsubmitted Not Done Reply Inline Actions You use this simpler pattern elsewhere: for (int i = 0; i != numParen; ++i) if (!expectAndConsume(Token::r_paren, "expected )")) return nullptr; also, just above you use this more complicated if-while pattern. silvas: You use this simpler pattern elsewhere: ``` for (int i = 0; i != numParen; ++i) if (!
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Makes sense, I will change this code to use the simpler pattern rafaelauler: Makes sense, I will change this code to use the simpler pattern
		return nullptr;
		}
		}
		}

		std::vector<const InputSection *> inputSections;

		if (_tok._kind != Token::l_paren)
		return new (_alloc)
		InputSectionFile(fileName, archiveName, keep, fileSortMode,
		archiveSortMode, inputSections);
		;
		ruiuUnsubmitted Not Done Reply Inline Actions remove ruiu: remove
		rafaelaulerAuthorUnsubmitted Not Done Reply Inline Actions Removed rafaelauler: Removed
		consumeToken();

		while (_tok._kind == Token::identifier \|\|
		_tok._kind == Token::kw_exclude_file \|\|
		_tok._kind == Token::kw_sort_by_name \|\|
		_tok._kind == Token::kw_sort_by_alignment \|\|
		_tok._kind == Token::kw_sort_by_init_priority \|\|
		_tok._kind == Token::kw_sort_none) {
		switch (_tok._kind) {
		case Token::kw_exclude_file: {
		auto vec = parseExcludeFile();
		if (vec.getError())
		return nullptr;
		inputSections.insert(inputSections.end(), vec->begin(), vec->end());
		break;
		}
		case Token::star:
		case Token::identifier: {
		inputSections.push_back(new (_alloc)
		InputSectionName(_tok._range, false));
		consumeToken();
		break;
		}
		case Token::kw_sort_by_name:
		case Token::kw_sort_by_alignment:
		case Token::kw_sort_by_init_priority:
		case Token::kw_sort_none: {
		const InputSection *group = parseSortedInputSections();
		if (!group)
		return nullptr;
		inputSections.push_back(group);
		break;
		}
		default:
		llvm_unreachable("Unknown token");
		}
		}

		for (int i = 0; i < numParen; ++i)
		if (!expectAndConsume(Token::r_paren, "expected )"))
		return nullptr;
		return new (_alloc)
		InputSectionFile(fileName, archiveName, keep, fileSortMode,
		archiveSortMode, inputSections);
		}

		const OutputSectionDescription *Parser::parseOutputSectionDescription() {
		assert((_tok._kind == Token::kw_discard \|\| _tok._kind == Token::identifier) &&
		"Expected /DISCARD/ or identifier!");
		StringRef sectionName;
		const Expression *address = nullptr;
		const Expression *align = nullptr;
		const Expression *subAlign = nullptr;
		const Expression *at = nullptr;
		const Expression *fillExpr = nullptr;
		bool alignWithInput = false;
		bool discard = false;
		OutputSectionDescription::Constraint constraint =
		OutputSectionDescription::C_None;
		std::vector<const Command *> outputSectionCommands;

		if (_tok._kind == Token::kw_discard)
		discard = true;
		else
		sectionName = _tok._range;
		consumeToken();

		if (_tok._kind == Token::number \|\| _tok._kind == Token::identifier \|\|
		_tok._kind == Token::kw_align \|\| _tok._kind == Token::l_paren) {
		address = parseExpression();
		if (!address)
		return nullptr;
		}

		if (!expectAndConsume(Token::colon, "expected :"))
		return nullptr;

		if (_tok._kind == Token::kw_at) {
		consumeToken();
		at = parseExpression();
		if (!at)
		return nullptr;
		}

		if (_tok._kind == Token::kw_align) {
		consumeToken();
		align = parseExpression();
		if (!align)
		return nullptr;
		}

		if (_tok._kind == Token::kw_align_with_input) {
		consumeToken();
		alignWithInput = true;
		}

		if (_tok._kind == Token::kw_subalign) {
		consumeToken();
		subAlign = parseExpression();
		if (!subAlign)
		return nullptr;
		}

		if (_tok._kind == Token::kw_only_if_ro) {
		consumeToken();
		constraint = OutputSectionDescription::C_OnlyIfRO;
		} else if (_tok._kind == Token::kw_only_if_rw) {
		consumeToken();
		constraint = OutputSectionDescription::C_OnlyIfRW;
		}

		if (!expectAndConsume(Token::l_brace, "expected {"))
		return nullptr;

		// Parse zero or more output-section-commands
		while (_tok._kind != Token::r_brace) {
		switch (_tok._kind) {
		case Token::semicolon:
		consumeToken();
		break;
		case Token::identifier:
		switch (peek(1)._kind) {
		case Token::equal:
		case Token::plusequal:
		case Token::minusequal:
		case Token::starequal:
		case Token::slashequal:
		case Token::ampequal:
		case Token::pipeequal:
		case Token::lesslessequal:
		case Token::greatergreaterequal:
		if (const Command *cmd = parseSymbolAssignment())
		outputSectionCommands.push_back(cmd);
		else
		return nullptr;
		break;
		default:
		if (const Command *cmd = parseInputSectionFile())
		outputSectionCommands.push_back(cmd);
		else
		return nullptr;
		break;
		}
		break;
		case Token::kw_keep:
		case Token::star:
		case Token::colon:
		case Token::kw_sort_by_name:
		case Token::kw_sort_by_alignment:
		case Token::kw_sort_by_init_priority:
		case Token::kw_sort_none:
		if (const Command *cmd = parseInputSectionFile())
		outputSectionCommands.push_back(cmd);
		else
		return nullptr;
		break;
		case Token::kw_hidden:
		case Token::kw_provide:
		case Token::kw_provide_hidden:
		if (const Command *cmd = parseSymbolAssignment())
		outputSectionCommands.push_back(cmd);
		else
		return nullptr;
		break;
		default:
		error(_tok, "expected symbol assignment or input file name.");
		return nullptr;
		}
		}

		if (!expectAndConsume(Token::r_brace, "expected }"))
		return nullptr;

		if (_tok._kind == Token::equal) {
		consumeToken();
		fillExpr = parseExpression();
		if (!fillExpr)
		return nullptr;
		}

		return new (_alloc) OutputSectionDescription(
		sectionName, address, align, subAlign, at, fillExpr, alignWithInput,
		discard, constraint, outputSectionCommands);
		}

		const Overlay *Parser::parseOverlay() {
		assert(_tok._kind == Token::kw_overlay && "Expected OVERLAY!");
		error(_tok, "Overlay description is not yet supported.");
		return nullptr;
		}

		Sections *Parser::parseSections() {
		assert(_tok._kind == Token::kw_sections && "Expected SECTIONS!");
		consumeToken();
		if (!expectAndConsume(Token::l_brace, "expected {"))
		return nullptr;
		std::vector<const Command *> sectionsCommands;

		bool unrecognizedToken = false;
		// Parse zero or more sections-commands
		while (!unrecognizedToken) {
		switch (_tok._kind) {
		case Token::semicolon:
		consumeToken();
		break;

		case Token::identifier:
		switch (peek(1)._kind) {
		case Token::equal:
		case Token::plusequal:
		case Token::minusequal:
		case Token::starequal:
		case Token::slashequal:
		case Token::ampequal:
		case Token::pipeequal:
		case Token::lesslessequal:
		case Token::greatergreaterequal:
		if (const Command *cmd = parseSymbolAssignment())
		sectionsCommands.push_back(cmd);
		else
		return nullptr;
		break;
		default:
		if (const Command *cmd = parseOutputSectionDescription())
		sectionsCommands.push_back(cmd);
		else
		return nullptr;
		break;
		}
		break;

		case Token::kw_discard:
		case Token::star:
		if (const Command *cmd = parseOutputSectionDescription())
		sectionsCommands.push_back(cmd);
		else
		return nullptr;
		break;

		case Token::kw_entry:
		if (const Command *cmd = parseEntry())
		sectionsCommands.push_back(cmd);
		else
		return nullptr;
		break;

		case Token::kw_hidden:
		case Token::kw_provide:
		case Token::kw_provide_hidden:
		if (const Command *cmd = parseSymbolAssignment())
		sectionsCommands.push_back(cmd);
		else
		return nullptr;
		break;

		case Token::kw_overlay:
		if (const Command *cmd = parseOverlay())
		sectionsCommands.push_back(cmd);
		else
		return nullptr;
		break;

		default:
		unrecognizedToken = true;
		break;
		}
		}

		if (!expectAndConsume(
		Token::r_brace,
		"expected symbol assignment, entry, overlay or output section name."))
		return nullptr;

		return new (_alloc) Sections(sectionsCommands);
		}

} // end namespace script		} // end namespace script
} // end namespace lld		} // end namespace lld

test/LinkerScript/expr-precedence.test

This file was added.

				/*
				RUN: linker-script-test %s \| FileCheck %s
				*/
				SECTIONS {
				. = foo >= bar + 1 ? bar : 1;
				}

				/*
				CHECK: kw_sections: SECTIONS
				CHECK: l_brace: {
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: identifier: foo
				CHECK: greaterequal: >=
				CHECK: identifier: bar
				CHECK: plus: +
				CHECK: number: 1
				CHECK: question: ?
				CHECK: identifier: bar
				CHECK: colon: :
				CHECK: number: 1
				CHECK: semicolon: ;
				CHECK: r_brace: }
				CHECK: eof:
				CHECK: SECTIONS
				CHECK: {
				CHECK: . = (foo >= (bar + 1)) ? bar : 1
				CHECK: }
				*/

test/LinkerScript/incomplete-ternary.test

This file was added.

				/*
				RUN: linker-script-test %s 2> %t \| FileCheck %s
				RUN: FileCheck -input-file %t -check-prefix=CHECK-ERR %s
				*/
				SECTIONS {
				. = foo ? bar;
				}

				/*
				CHECK: kw_sections: SECTIONS
				CHECK: l_brace: {
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: identifier: foo
				CHECK: question: ?
				CHECK: identifier: bar
				CHECK: semicolon: ;
				CHECK: r_brace: }
				CHECK: eof:
				CHECK-ERR: 6:18: error: expected :
				*/

test/LinkerScript/missing-entry-symbol.test

This file was added.

				/*
				RUN: linker-script-test %s 2> %t \| FileCheck %s
				RUN: FileCheck -input-file %t -check-prefix=CHECK-ERR %s
				*/
				SECTIONS {
				ENTRY()
				}

				/*
				CHECK: l_brace: {
				CHECK: kw_entry: ENTRY
				CHECK: l_paren: (
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: eof:
				CHECK-ERR: 6:11: error: expected identifier in ENTRY
				*/

test/LinkerScript/missing-input-file-name.test

This file was added.

				/*
				RUN: linker-script-test %s 2> %t \| FileCheck %s
				RUN: FileCheck -input-file %t -check-prefix=CHECK-ERR %s
				*/
				SECTIONS {
				.text : { ()}
				}

				/*
				CHECK: kw_sections: SECTIONS
				CHECK: l_brace: {
				CHECK: identifier: .text
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: l_paren: (
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: r_brace: }
				CHECK: eof:
				CHECK-ERR: 6:15: error: expected symbol assignment or input file name.
				*/

test/LinkerScript/missing-input-sections.test

This file was added.

				/*
				RUN: linker-script-test %s 2> %t \| FileCheck %s
				RUN: FileCheck -input-file %t -check-prefix=CHECK-ERR %s
				*/
				SECTIONS {
				.text : { *(+)}
				}

				/*
				CHECK: kw_sections: SECTIONS
				CHECK: l_brace: {
				CHECK: identifier: .text
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: plus: +
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: r_brace: }
				CHECK: eof:
				CHECK-ERR: 6:16: error: expected )
				*/

test/LinkerScript/missing-operand.test

This file was added.

				/*
				RUN: linker-script-test %s 2> %t \| FileCheck %s
				RUN: FileCheck -check-prefix=CHECK-ERR -input-file %t %s
				*/
				SECTIONS {
				. = foo / ;
				}

				/*
				CHECK: kw_sections: SECTIONS
				CHECK: l_brace: {
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: identifier: foo
				CHECK: slash: /
				CHECK: semicolon: ;
				CHECK: r_brace: }
				CHECK: eof:
				CHECK-ERR: 6:15: error: expected symbol, number or left parenthesis.
				*/
				silvasUnsubmitted Not Done Reply Inline Actions FileCheck has some special functionality for checking diagnostics that avoids the need to hard-code absolute line numbers. You should use it: http://llvm.org/docs/CommandGuide/FileCheck.html#filecheck-expressions Also, it should be pretty easy to do caret diagnostics. silvas: FileCheck has some special functionality for checking diagnostics that avoids the need to hard…

test/LinkerScript/missing-output-section-name.test

This file was added.

				/*
				RUN: linker-script-test %s 2> %t \| FileCheck %s
				RUN: FileCheck -input-file %t -check-prefix=CHECK-ERR %s
				*/
				SECTIONS {
				: { *()}
				}

				/*
				CHECK: kw_sections: SECTIONS
				CHECK: l_brace: {
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: r_brace: }
				CHECK: eof:
				CHECK-ERR: 6:5: error: expected symbol assignment, entry, overlay or output section name
				*/

test/LinkerScript/missing-symbol.test

This file was added.

				/*
				RUN: linker-script-test %s 2> %t \| FileCheck %s
				RUN: FileCheck -input-file %t -check-prefix=CHECK-ERR %s
				*/
				SECTIONS {
				= foo + bar;
				}

				/*
				CHECK: kw_sections: SECTIONS
				CHECK: l_brace: {
				CHECK: equal: =
				CHECK: identifier: foo
				CHECK: plus: +
				CHECK: identifier: bar
				CHECK: semicolon: ;
				CHECK: r_brace: }
				CHECK: eof:
				CHECK-ERR: 6:5: error: expected symbol assignment, entry, overlay or output section name
				*/

test/LinkerScript/sections.test

This file was added.

				/*
				This test exercises parsing typical commands found in GNU ld linker scripts.
				RUN: linker-script-test %s \| FileCheck %s
				*/

				SEARCH_DIR("/usr/x86_64-linux-gnu/lib64"); SEARCH_DIR("=/usr/local/lib/x86_64-linux-gnu");
				SECTIONS
				{
				PROVIDE (__executable_start = SEGMENT_START("text-segment", 0x400000)); . = SEGMENT_START("text-segment", 0x400000) + SIZEOF_HEADERS;
				.interp : { *(.interp) }
				.note.gnu.build-id : { *(.note.gnu.build-id) }
				.hash : { *(.hash) }
				.rela.dyn :
				{
				*(.rela.init)
				(.rela.text .rela.text. .rela.gnu.linkonce.t.*)
				*(.rela.fini)
				(.rela.rodata .rela.rodata. .rela.gnu.linkonce.r.*)
				}
				.rela.plt :
				{
				*(.rela.plt)
				PROVIDE_HIDDEN (__rela_iplt_start = .);
				*(.rela.iplt)
				PROVIDE_HIDDEN (__rela_iplt_end = .);
				}
				.init :
				{
				KEEP (*(SORT_NONE(.init)))
				}
				PROVIDE (__etext = .);
				.eh_frame : ONLY_IF_RO { KEEP (*(.eh_frame)) }
				.exception_ranges : ONLY_IF_RO { *(.exception_ranges
				.exception_ranges*) }
				. = ALIGN (CONSTANT (MAXPAGESIZE)) - ((CONSTANT (MAXPAGESIZE) - .) & (CONSTANT (MAXPAGESIZE) - 1)); . = DATA_SEGMENT_ALIGN (CONSTANT (MAXPAGESIZE), CONSTANT (COMMONPAGESIZE));
				/* Exception handling */
				.eh_frame : ONLY_IF_RW { KEEP (*(.eh_frame)) }
				.ctors :
				{
				KEEP (*crtbegin.o(.ctors))
				KEEP (*crtbegin?.o(.ctors))
				KEEP ((EXCLUDE_FILE (crtend.o *crtend?.o ) .ctors))
				KEEP ((SORT(.ctors.)))
				KEEP (*(.ctors))
				}
				.dtors :
				{
				KEEP (*crtbegin.o(.dtors))
				KEEP (*crtbegin?.o(.dtors))
				KEEP ((EXCLUDE_FILE (crtend.o *crtend?.o ) .dtors))
				KEEP ((SORT(.dtors.)))
				KEEP (*(.dtors))
				}
				. = DATA_SEGMENT_RELRO_END (SIZEOF (.got.plt) >= 24 ? 24 : 0, .);
				.got.plt : { (.got.plt) (.igot.plt) }
				.lrodata ALIGN(CONSTANT (MAXPAGESIZE)) + (. & (CONSTANT (MAXPAGESIZE) - 1)) :
				{
				(.lrodata .lrodata. .gnu.linkonce.lr.*)
				}
				.ldata ALIGN(CONSTANT (MAXPAGESIZE)) + (. & (CONSTANT (MAXPAGESIZE) - 1)) :
				{
				(.ldata .ldata. .gnu.linkonce.l.*)
				. = ALIGN(. != 0 ? 64 / 8 : 1);
				}
				. = ALIGN(64 / 8);
				_end = .; PROVIDE (end = .);
				. = DATA_SEGMENT_END (.);
				/DISCARD/ : { (.note.GNU-stack) (.gnu_debuglink) (.gnu.lto_) }
				}

				/*
				CHECK: kw_search_dir: SEARCH_DIR
				CHECK: l_paren: (
				CHECK: identifier: /usr/x86_64-linux-gnu/lib64
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: kw_search_dir: SEARCH_DIR
				CHECK: l_paren: (
				CHECK: identifier: =/usr/local/lib/x86_64-linux-gnu
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: kw_sections: SECTIONS
				CHECK: l_brace: {
				CHECK: kw_provide: PROVIDE
				CHECK: l_paren: (
				CHECK: identifier: __executable_start
				CHECK: equal: =
				CHECK: identifier: SEGMENT_START
				CHECK: l_paren: (
				CHECK: identifier: text-segment
				CHECK: comma: ,
				CHECK: number: 0x400000
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: identifier: SEGMENT_START
				CHECK: l_paren: (
				CHECK: identifier: text-segment
				CHECK: comma: ,
				CHECK: number: 0x400000
				CHECK: r_paren: )
				CHECK: plus: +
				CHECK: identifier: SIZEOF_HEADERS
				CHECK: semicolon: ;
				CHECK: identifier: .interp
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .interp
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .note.gnu.build-id
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .note.gnu.build-id
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .hash
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .hash
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .rela.dyn
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .rela.init
				CHECK: r_paren: )
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .rela.text
				CHECK: identifier: .rela.text.*
				CHECK: identifier: .rela.gnu.linkonce.t.*
				CHECK: r_paren: )
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .rela.fini
				CHECK: r_paren: )
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .rela.rodata
				CHECK: identifier: .rela.rodata.*
				CHECK: identifier: .rela.gnu.linkonce.r.*
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .rela.plt
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .rela.plt
				CHECK: r_paren: )
				CHECK: kw_provide_hidden: PROVIDE_HIDDEN
				CHECK: l_paren: (
				CHECK: identifier: __rela_iplt_start
				CHECK: equal: =
				CHECK: identifier: .
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .rela.iplt
				CHECK: r_paren: )
				CHECK: kw_provide_hidden: PROVIDE_HIDDEN
				CHECK: l_paren: (
				CHECK: identifier: __rela_iplt_end
				CHECK: equal: =
				CHECK: identifier: .
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: r_brace: }
				CHECK: identifier: .init
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: kw_sort_none: SORT_NONE
				CHECK: l_paren: (
				CHECK: identifier: .init
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: kw_provide: PROVIDE
				CHECK: l_paren: (
				CHECK: identifier: __etext
				CHECK: equal: =
				CHECK: identifier: .
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: identifier: .eh_frame
				CHECK: colon: :
				CHECK: kw_only_if_ro: ONLY_IF_RO
				CHECK: l_brace: {
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .eh_frame
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .exception_ranges
				CHECK: colon: :
				CHECK: kw_only_if_ro: ONLY_IF_RO
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .exception_ranges
				CHECK: identifier: .exception_ranges*
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: kw_align: ALIGN
				CHECK: l_paren: (
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: MAXPAGESIZE
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: minus: -
				CHECK: l_paren: (
				CHECK: l_paren: (
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: MAXPAGESIZE
				CHECK: r_paren: )
				CHECK: minus: -
				CHECK: identifier: .
				CHECK: r_paren: )
				CHECK: amp: &
				CHECK: l_paren: (
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: MAXPAGESIZE
				CHECK: r_paren: )
				CHECK: minus: -
				CHECK: number: 1
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: identifier: DATA_SEGMENT_ALIGN
				CHECK: l_paren: (
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: MAXPAGESIZE
				CHECK: r_paren: )
				CHECK: comma: ,
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: COMMONPAGESIZE
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: identifier: .eh_frame
				CHECK: colon: :
				CHECK: kw_only_if_rw: ONLY_IF_RW
				CHECK: l_brace: {
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .eh_frame
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .ctors
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: identifier: *crtbegin.o
				CHECK: l_paren: (
				CHECK: identifier: .ctors
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: identifier: *crtbegin?.o
				CHECK: l_paren: (
				CHECK: identifier: .ctors
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: kw_exclude_file: EXCLUDE_FILE
				CHECK: l_paren: (
				CHECK: identifier: *crtend.o
				CHECK: identifier: *crtend?.o
				CHECK: r_paren: )
				CHECK: identifier: .ctors
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: kw_sort_by_name: SORT
				CHECK: l_paren: (
				CHECK: identifier: .ctors.*
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .ctors
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .dtors
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: identifier: *crtbegin.o
				CHECK: l_paren: (
				CHECK: identifier: .dtors
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: identifier: *crtbegin?.o
				CHECK: l_paren: (
				CHECK: identifier: .dtors
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: kw_exclude_file: EXCLUDE_FILE
				CHECK: l_paren: (
				CHECK: identifier: *crtend.o
				CHECK: identifier: *crtend?.o
				CHECK: r_paren: )
				CHECK: identifier: .dtors
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: kw_sort_by_name: SORT
				CHECK: l_paren: (
				CHECK: identifier: .dtors.*
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: kw_keep: KEEP
				CHECK: l_paren: (
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .dtors
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: identifier: DATA_SEGMENT_RELRO_END
				CHECK: l_paren: (
				CHECK: identifier: SIZEOF
				CHECK: l_paren: (
				CHECK: identifier: .got.plt
				CHECK: r_paren: )
				CHECK: greaterequal: >=
				CHECK: number: 24
				CHECK: question: ?
				CHECK: number: 24
				CHECK: colon: :
				CHECK: number: 0
				CHECK: comma: ,
				CHECK: identifier: .
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: identifier: .got.plt
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .got.plt
				CHECK: r_paren: )
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .igot.plt
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .lrodata
				CHECK: kw_align: ALIGN
				CHECK: l_paren: (
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: MAXPAGESIZE
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: plus: +
				CHECK: l_paren: (
				CHECK: identifier: .
				CHECK: amp: &
				CHECK: l_paren: (
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: MAXPAGESIZE
				CHECK: r_paren: )
				CHECK: minus: -
				CHECK: number: 1
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .lrodata
				CHECK: identifier: .lrodata.*
				CHECK: identifier: .gnu.linkonce.lr.*
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: identifier: .ldata
				CHECK: kw_align: ALIGN
				CHECK: l_paren: (
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: MAXPAGESIZE
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: plus: +
				CHECK: l_paren: (
				CHECK: identifier: .
				CHECK: amp: &
				CHECK: l_paren: (
				CHECK: identifier: CONSTANT
				CHECK: l_paren: (
				CHECK: identifier: MAXPAGESIZE
				CHECK: r_paren: )
				CHECK: minus: -
				CHECK: number: 1
				CHECK: r_paren: )
				CHECK: r_paren: )
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .ldata
				CHECK: identifier: .ldata.*
				CHECK: identifier: .gnu.linkonce.l.*
				CHECK: r_paren: )
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: kw_align: ALIGN
				CHECK: l_paren: (
				CHECK: identifier: .
				CHECK: exclaimequal: !=
				CHECK: number: 0
				CHECK: question: ?
				CHECK: number: 64
				CHECK: slash: /
				CHECK: number: 8
				CHECK: colon: :
				CHECK: number: 1
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: r_brace: }
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: kw_align: ALIGN
				CHECK: l_paren: (
				CHECK: number: 64
				CHECK: slash: /
				CHECK: number: 8
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: identifier: _end
				CHECK: equal: =
				CHECK: identifier: .
				CHECK: semicolon: ;
				CHECK: kw_provide: PROVIDE
				CHECK: l_paren: (
				CHECK: identifier: end
				CHECK: equal: =
				CHECK: identifier: .
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: identifier: .
				CHECK: equal: =
				CHECK: identifier: DATA_SEGMENT_END
				CHECK: l_paren: (
				CHECK: identifier: .
				CHECK: r_paren: )
				CHECK: semicolon: ;
				CHECK: kw_discard: /DISCARD/
				CHECK: colon: :
				CHECK: l_brace: {
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .note.GNU-stack
				CHECK: r_paren: )
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .gnu_debuglink
				CHECK: r_paren: )
				CHECK: star: *
				CHECK: l_paren: (
				CHECK: identifier: .gnu.lto_*
				CHECK: r_paren: )
				CHECK: r_brace: }
				CHECK: r_brace: }
				CHECK: eof:
				CHECK: SEARCH_DIR(/usr/x86_64-linux-gnu/lib64)
				CHECK: SEARCH_DIR(=/usr/local/lib/x86_64-linux-gnu)
				CHECK: SECTIONS
				CHECK: {
				CHECK: PROVIDE(__executable_start = SEGMENT_START(text-segment, 4194304))
				CHECK: . = (SEGMENT_START(text-segment, 4194304) + SIZEOF_HEADERS)
				CHECK: .interp :
				CHECK: {
				CHECK: *:(.interp )
				CHECK: }
				CHECK: .note.gnu.build-id :
				CHECK: {
				CHECK: *:(.note.gnu.build-id )
				CHECK: }
				CHECK: .hash :
				CHECK: {
				CHECK: *:(.hash )
				CHECK: }
				CHECK: .rela.dyn :
				CHECK: {
				CHECK: *:(.rela.init )
				CHECK: :(.rela.text .rela.text. .rela.gnu.linkonce.t.* )
				CHECK: *:(.rela.fini )
				CHECK: :(.rela.rodata .rela.rodata. .rela.gnu.linkonce.r.* )
				CHECK: }
				CHECK: .rela.plt :
				CHECK: {
				CHECK: *:(.rela.plt )
				CHECK: PROVIDE_HIDDEN(__rela_iplt_start = .)
				CHECK: *:(.rela.iplt )
				CHECK: PROVIDE_HIDDEN(__rela_iplt_end = .)
				CHECK: }
				CHECK: .init :
				CHECK: {
				CHECK: KEEP(*:(SORT_NONE(.init ) ))
				CHECK: }
				CHECK: PROVIDE(__etext = .)
				CHECK: .eh_frame :
				CHECK: ONLY_IF_RO {
				CHECK: KEEP(*:(.eh_frame ))
				CHECK: }
				CHECK: .exception_ranges :
				CHECK: ONLY_IF_RO {
				CHECK: :(.exception_ranges .exception_ranges )
				CHECK: }
				CHECK: . = (ALIGN(CONSTANT(MAXPAGESIZE)) - ((CONSTANT(MAXPAGESIZE) - .) & (CONSTANT(MAXPAGESIZE) - 1)))
				CHECK: . = DATA_SEGMENT_ALIGN(CONSTANT(MAXPAGESIZE), CONSTANT(COMMONPAGESIZE))
				CHECK: .eh_frame :
				CHECK: ONLY_IF_RW {
				CHECK: KEEP(*:(.eh_frame ))
				CHECK: }
				CHECK: .ctors :
				CHECK: {
				CHECK: KEEP(*crtbegin.o:(.ctors ))
				CHECK: KEEP(*crtbegin?.o:(.ctors ))
				CHECK: KEEP(:(EXCLUDE_FILE(crtend.o) EXCLUDE_FILE(*crtend?.o) .ctors ))
				CHECK: KEEP(:(SORT_BY_NAME(.ctors. ) ))
				CHECK: KEEP(*:(.ctors ))
				CHECK: }
				CHECK: .dtors :
				CHECK: {
				CHECK: KEEP(*crtbegin.o:(.dtors ))
				CHECK: KEEP(*crtbegin?.o:(.dtors ))
				CHECK: KEEP(:(EXCLUDE_FILE(crtend.o) EXCLUDE_FILE(*crtend?.o) .dtors ))
				CHECK: KEEP(:(SORT_BY_NAME(.dtors. ) ))
				CHECK: KEEP(*:(.dtors ))
				CHECK: }
				CHECK: . = DATA_SEGMENT_RELRO_END((SIZEOF(.got.plt) >= 24) ? 24 : 0, .)
				CHECK: .got.plt :
				CHECK: {
				CHECK: *:(.got.plt )
				CHECK: *:(.igot.plt )
				CHECK: }
				CHECK: .lrodata (ALIGN(CONSTANT(MAXPAGESIZE)) + (. & (CONSTANT(MAXPAGESIZE) - 1))) :
				CHECK: {
				CHECK: :(.lrodata .lrodata. .gnu.linkonce.lr.* )
				CHECK: }
				CHECK: .ldata (ALIGN(CONSTANT(MAXPAGESIZE)) + (. & (CONSTANT(MAXPAGESIZE) - 1))) :
				CHECK: {
				CHECK: :(.ldata .ldata. .gnu.linkonce.l.* )
				CHECK: . = ALIGN((. != 0) ? (64 / 8) : 1)
				CHECK: }
				CHECK: . = ALIGN((64 / 8))
				CHECK: _end = .
				CHECK: PROVIDE(end = .)
				CHECK: . = DATA_SEGMENT_END(.)
				CHECK: :
				CHECK: {
				CHECK: *:(.note.GNU-stack )
				CHECK: *:(.gnu_debuglink )
				CHECK: :(.gnu.lto_ )
				CHECK: }
				CHECK: }
				*/

This is an archive of the discontinued LLVM Phabricator instance.

[lld] Teach LLD how to parse complete linker scriptsAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 15233

include/lld/ReaderWriter/LinkerScript.h

lib/ReaderWriter/LinkerScript.cpp

test/LinkerScript/expr-precedence.test

test/LinkerScript/incomplete-ternary.test

test/LinkerScript/missing-entry-symbol.test

test/LinkerScript/missing-input-file-name.test

test/LinkerScript/missing-input-sections.test

test/LinkerScript/missing-operand.test

test/LinkerScript/missing-output-section-name.test

test/LinkerScript/missing-symbol.test

test/LinkerScript/sections.test

[lld] Teach LLD how to parse complete linker scripts
AbandonedPublic