swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-14 20:36:38 +01:00

Author	SHA1	Message	Date
Argyrios Kyrtzidis	cda7a109bd	[Lexer] Remove the special hack in the Lexer that handles lexing an expression in an interpolated string. -Parse the expression using a begin/end state sub-Lexer instead of a general StringRef Lexer -Introduce a Lexer constructor accepting a BufferID and a range inside the buffer, and use it for swift::tokenize. Swift SVN r6938	2013-08-06 14:58:59 +00:00
Dmitri Hrybenko	7684d6dc2a	Add a good diagnostic for a hashbang line when it is not allowed Also centralizes the knowledge about whether the hashbang is allowed in the SourceManager. This fixes a bug in tokenize() because previously it just had to guess. Swift SVN r6822	2013-08-01 23:07:43 +00:00
Dmitri Hrybenko	5cea4ebf41	Code completion: put CodeCompletionOffset on SourceManager instead of passing around everywhere Fixes: rdar://14585108 Code completion does not work at the beginning of the file rdar://14592634 Code completion returns zero results at EOF in a function without a closing brace Swift SVN r6820	2013-08-01 22:08:58 +00:00
Dmitri Hrybenko	e1c4ae3174	Wrap llvm::SourceMgr in swift::SourceManager so that we can add new members to the source manager. Swift SVN r6815	2013-08-01 20:39:22 +00:00
Dmitri Hrybenko	015cea67a5	Move hashbang line detection to the Lexer implementation, instead of requiring the Lexer client to skip it before constructing the Lexer. Swift SVN r6801	2013-08-01 02:18:25 +00:00
Dmitri Hrybenko	a0cd8e93ae	Lexer: don't consume the first token before we apply all code completion settings This allows us to code complete at EOF when there is no newline. Swift SVN r6640	2013-07-26 03:48:20 +00:00
Dmitri Hrybenko	8363bc3ba4	Remove unused function Lexer::stateRangeHasCodeCompletionToken() Swift SVN r6636	2013-07-26 01:14:07 +00:00
Dmitri Hrybenko	f9fa6aa8fc	Code completion: insert a zero character into the buffer to mark the code completion token This is required to handle cases like fooObject.#^A^#.bar where code completion is invoked inside the ".." token. Previously, the token would not be split and the lexer would produce an incorrect tokenization for this case. Now we produce ".", tok::code_complete, ".". Swift SVN r6635	2013-07-26 00:45:57 +00:00
Argyrios Kyrtzidis	1e597f5d8a	[Parser] When delay-parsing a function body, make sure the closing brace is included. Swift SVN r6599	2013-07-25 16:20:13 +00:00
Argyrios Kyrtzidis	66d1e516c8	Refactor how multiple parsing passes and delayed parsing works. -Introduce PersistentParserState to represent state persistent among multiple parsing passes. The advantage is that PersistentParserState is independent of a particular Parser or Lexer object. -Use PersistentParserState to keep information about delayed function body parsing and eliminate parser-specific state from the AST (ParserTokenRange). -Introduce DelayedParsingCallbacks to abstract out of the parser the logic about which functions should be delayed or skipped. Many thanks to Dmitri for his valuable feedback! Swift SVN r6580	2013-07-25 01:40:16 +00:00
Dmitri Hrybenko	02084efab7	Implement code completion for some function calls and member variable accesses in expr-dot and expr-postfix that can be typechecked without typechecking the beginning of the function body. Swift SVN r6198	2013-07-12 02:00:41 +00:00
Dmitri Hrybenko	3c5b12fc0f	Code completion: add a lot of infrastructure code * Added a mode in swift-ide-test to test code completion. Unlike c-index-test, the code completion token in tests is a real token -- we don't need to count lines and columns anymore. * Added support in lexer to produce a code completion token. * Added a parser interface to code completion. It is passed down from the libFrontend to the parser, but its functions are not called yet. * Added a sketch of the interface of code completion consumer and code completion results. Note: all this is not doing anything useful yet. Swift SVN r6128	2013-07-10 20:53:40 +00:00
Argyrios Kyrtzidis	6e7d0490f7	Allow optionally to produce comment tokens when lexing and add a tokenize() utility function. Swift SVN r6008	2013-07-05 15:02:42 +00:00
Argyrios Kyrtzidis	fe8d9bba2f	[Lexer] Make the order of parameters of the protected constructor significantly different than the public one. This is to make overload resolution for the protected constructor call sites more explicit. Otherwise, adding a parameter in the protected constructor and a default parameter in the public one can inadvertently result in the protected constructor call sites switching to picking the public one. Swift SVN r6007	2013-07-05 15:02:41 +00:00
Dmitri Hrybenko	03074dceb2	StringLiteralExpr: always include a good source range When we are interpreting escape sequences in the lexer, we copy the string literal bytes to ASTContext instead of storing a pointer to the source buffer. But then we used to try to get a source location for that string in the heap, which is not a valid source buffer. It succeeds during parsing, but breaks when we try to print a diagnostic using this location. Added a verifier check for this. Also added a real source range for StringLiteralExpr, instead of a source range with begin == end, produced from the beginning location. Swift SVN r5961	2013-07-02 17:19:27 +00:00
Dmitri Hrybenko	0b8f72b9c1	Introduce an artificial EOF in the Lexer This ensures that we don't go past the end of a subrange of a buffer when doing delayed parsing. Swift SVN r5952	2013-07-01 21:48:38 +00:00
Dmitri Hrybenko	f73d866d91	Implement delayed parsing for function bodies In order to do this, we need to save and restore parser state easily. The important pieces of state are: * lexer position; * lexical scope stack. Lexer position can be saved/restored easily. We don't need to store the tokens for the function body because swift does not have a preprocessor and we can easily re-lex everything we need. We just store the lexer state for the beginning and the end of the body. To save the lexical scope stack, we had to change the underlying data structure. Originally, the parser used the ScopedHashTable, which supports only a stack of scopes. But we need a tree of scopes. I implemented TreeScopedHashTable based on ScopedHashTable. It has an optimization for pushing/popping scopes in a stack fashion -- these scopes will not be allocated on the heap. While ‘detached’ scopes that we want to re-enter later, and all their parent scopes, are moved to the heap. In parseIntoTranslationUnit() we do a second pass over the 'structural AST' that does not contain function bodies to actually parse them from saved token ranges. Swift SVN r5886	2013-06-28 22:38:10 +00:00
Dmitri Hrybenko	f4d6053ebd	Fix typo in 'getStateForBeginnigOfToken' Swift SVN r5761	2013-06-21 22:36:40 +00:00
Dmitri Hrybenko	a2ddd4e488	Refactor lexer backtracking. Introduce opaque types that encapsulate lexer and parser state. Backtracking will be used a lot when we implement delayed parsing for function bodies, and we don't want to leak lexer and parser state details to AST classes when we store the state for the first and last token for the function body. Swift SVN r5759	2013-06-21 22:26:41 +00:00
Chris Lattner	bde8f753da	wire up minimal sil function parsing (still not validating or creating a SIL function). This introduces contextual lexing state for SIL function bodies for SIL-specific lexing rules. Swift SVN r5179	2013-05-16 18:46:46 +00:00
Chris Lattner	5b4c31dc94	reland r4968, with a bugfix to avoid breaking the lexer measuring token lengths. Original message: SIL Parsing: add plumbing to know when we're parsing a .sil file Enhance the lexer to lex "sil" as a keyword in sil mode. Swift SVN r4988	2013-04-30 00:28:37 +00:00
Chris Lattner	aafe3bdbdc	revert r4968, it apparently breaks the world. I'll recommit it when I have time to investigate. Swift SVN r4971	2013-04-29 16:58:43 +00:00
Chris Lattner	b503206bec	SIL Parsing: add plumbing to know when we're parsing a .sil file Enhance the lexer to lex "sil" as a keyword in sil mode. Swift SVN r4970	2013-04-29 05:42:59 +00:00
Joe Groff	88d4284178	Parser: Relax whitespace rules for ( and [. Now that we enforce semicolon or newline separation between statements, we can relax the whitespace requirements on '(' and '[' tokens. A "following" token is now just a token that isn't at the start of a line, and any token can be a "starting" token. This allows for: a(b) a (b) a[b] a [b] to parse as applications and subscripts, and: a (b) a [b] to parse as an expr followed by a tuple or an expr followed by a container literal. Swift SVN r4573	2013-04-02 16:43:20 +00:00
Dave Zarzycki	f043cdca5a	Revert "wipUnifyFormToken" This reverts commit 39e8ace65c29d9e83f4fbebc24be9c9896e9a3e1. Sorry! Swift SVN r4185	2013-02-25 03:13:14 +00:00
Dave Zarzycki	06d3c09d97	wipUnifyFormToken Swift SVN r4183	2013-02-25 03:11:11 +00:00
Joe Groff	dc016b8d7e	Parse: Fake parsing type parameters in expressions When parsing an expression, if we see the production [[identifier '<']], use the following heuristic to choose whether to parse it as a type list or as an operator expression: - Speculatively parse the subsequent production as a type parameter list. - If the parse succeeds, examine the token after the closing '>'. If it is one of the following: l_paren_following l_square_following r_paren r_square l_brace r_brace comma semicolon period then accept the parse as a type list. - If the parse fails, or if the type list is not followed by one of those tokens, reject the type list and parse as an operator expression. This only implements the parsing rule. The type parameters are just dropped on the floor--the AST representation and Sema changes are forthcoming. Encouragingly, no test or library code appears to be broken by this rule. Swift SVN r4044	2013-02-14 00:34:55 +00:00
Joe Groff	081787c3d5	Lexer: Lex C99-style hexadecimal float literals. APFloat's parser gives us the parsing for free. Unlike C99 we require at least one digit on both sides of the hexadecimal point in order to allow '0x1.method()' expressions, similar to Dave's proposed change to float lexing. Also, we were requiring a sign after 'e' in the exponent, which is inconsistent with C, C++, and the Java regex we claim to follow, so I made the exponent sign optional. Swift SVN r3940	2013-02-03 19:06:08 +00:00
Dave Zarzycki	cd3f31fa9d	Formalize "starting" vs "following" token generation If we generalize John's insight about l_(paren\|square) being about "starting" and "following" tokens, then we can detect many statement or declaration boundaries that are lacking either white space or a semicolon. Ensuring some amount of whitespace between statements and declarations is good for future proofing. Swift SVN r3914	2013-01-31 18:57:53 +00:00
Dave Zarzycki	5fe85d7020	Formalize unary prefix '&' to mean "make ref" This makes reserved operator parsing more robust and easier to understand. Swift SVN r3884	2013-01-27 21:20:06 +00:00
Dave Zarzycki	735294a5c9	Make the lexing of '(', '[', and '.' consistent The lexer now models tuples, patterns, subscripting, function calls, and field access robustly. The output tokens are now better named as well: l_paren and l_paren_call, and l_square and l_square_subscript. It should be much more clear now which one to use. Also, the use of l_paren or l_square will not arbitrarily flip flop if the token before it is a keyword or if the token before it was the trailing ']' of an attribute list. Similarly, tuples will always cause the lexer to produce l_paren, regardless if the user typed '((x,y))' or '( (x,y))'. When we someday add array literals, the right token is now naturally falling out of the lexer. Swift SVN r3840	2013-01-23 03:23:17 +00:00
Dave Zarzycki	7a25fe61fa	Robustness: tighten up ".42" float literal lexing Swift SVN r3704	2013-01-07 22:03:22 +00:00
Doug Gregor	35e6e56595	When we split a token starting with '<' or '>', make sure that the resulting token goes back through the lexer to get the appropriate token kind. Thanks to Chris for spotting this. Also, document the '<' and '>' splitting behavior in LangRef. Swift SVN r2192	2012-06-18 16:40:59 +00:00
John McCall	8c46c69efa	Lexically distinguish prefix, postfix, and binary operators and use this information as cues in the language. Right now, we do not accept things like "-- i" because the prefix operator is not correctly right-bound; instead you have to write "--(i)". I'm okay with that; I did add a specialized diagnostic recognizing operator-binary in a place where we're expecting a potential operator-prefix. Swift SVN r2161	2012-06-07 01:00:06 +00:00
Chris Lattner	22300ed57f	improve diagnostic for a " in the middle of an interpolated string, including a range: t.swift:4:19: error: unexpected '"' character in string interpolation "Hello \(x+1 world" ~~~~~~~~~~^ Swift SVN r1759	2012-05-05 23:12:07 +00:00
Chris Lattner	024e592beb	Now that the parser's lexer can be changed, just change it, instead of mutating the lexer, simplifying things a bit. Swift SVN r1741	2012-05-04 06:16:29 +00:00
Chris Lattner	2f44c0038c	Initial stab at implementing string literal interpolation for simple expressions, e.g. "foo is \(i+j)". This implements rdar://11223686 Doug implemented all the hard parts of this. I ripped out support for nested string literals (i.e. string literals within an interpolated string), which simplified the approach and defined away some problems with his patch in progress. I plan a few refinements on top of this basic patch. Swift SVN r1738	2012-05-04 05:53:50 +00:00
Chris Lattner	615ca4a360	implement the rest of character literal support, and enhance Char to be character literal compatible. This wraps up rdar://11305635, though some cleanup of the testsuite can now be done. Swift SVN r1672	2012-04-27 06:18:30 +00:00
Chris Lattner	c7e68a4a8b	langref and lexer support for character literals. Swift SVN r1671	2012-04-27 05:51:30 +00:00
John McCall	e6d56fd718	Require an unspaced [ to start a subscript or array-type suffix. Per discussion, this should probably be "no newline since the last token", but that decision should be made simultaneously for ( and [. Swift SVN r1461	2012-04-18 08:08:58 +00:00
Chris Lattner	ce7403dd1e	implement IRGen of escapes. High unicode escapes aren't handled yet, but all the basics are. Swift SVN r1363	2012-04-10 20:59:45 +00:00
Chris Lattner	c78bdcff0e	Initial lexer support for string literals. The regex is trivial and will be extended in the future. Swift SVN r1348	2012-04-10 00:38:13 +00:00
John McCall	b8b1694564	Complain about file names that aren't valid identifiers. When we divide the world into scripts and modules, this won't matter for the former. Recognize <stdin> as a special case; it should instead just always be a script. Fixes rdar://problem/10986311. Swift SVN r1181	2012-03-11 09:15:17 +00:00
Doug Gregor	b1c98e1731	Introduce Lexer::getLocForEndOfToken() to adjust source locations to the end of the current token, and use it for proper translation from SourceRange to SMRange when printing diagnostics. Swift SVN r853	2011-11-09 22:03:30 +00:00
Doug Gregor	fc183b9cc3	Decouple the lexer from the ASTContext and specific BufferID. They aren't needed for the lexer proper (which just needs a buffer to dig through). Also, make it possible to suppress lexer diagnostics merely by not giving it a diagnostic engine to work with. Swift SVN r852	2011-11-09 21:49:40 +00:00
Doug Gregor	d0dd4d44bc	Surface the Token and Lexer headers as public headers, since we're going to need to be able to use the lexer from the 'swift' tool itself. Swift SVN r851	2011-11-09 21:09:49 +00:00
Chris Lattner	26f66a8b24	move all the parser headers into lib/Parse since they are now all private. I chose to just delete the -lex action in swift, since it was only useful for about 10 minutes during bringup and probably never will be again. Swift SVN r557	2011-08-13 22:51:04 +00:00
Chris Lattner	b8fd157450	rip out the dead "isPrecededByIdentifier" logic from the AST and Lexer. Swift SVN r529	2011-08-12 21:36:25 +00:00
Chris Lattner	0871b128f8	add support for /**/ comments to the lexer, allow nesting of them since we won't have the #if 0 hack to handle nesting. Swift SVN r506	2011-08-12 00:37:01 +00:00
Chris Lattner	23759c8624	rename Lex -> lex too. Swift SVN r484	2011-08-03 00:08:21 +00:00

1 2 3 4 5

218 Commits