swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-14 20:36:38 +01:00

Author	SHA1	Message	Date
Michael Ilseman	f7cdda2720	[gardening] Clean up many String computed vars	2019-04-08 15:16:48 -07:00
Michael Ilseman	415cc8fb0c	[String.Index] Deprecate encodedOffset var/init String.Index has an encodedOffset-based initializer and computed property that exists for serialization purposes. It was documented as UTF-16 in the SE proposal introducing it, which was String's underlying encoding at the time, but the dream of String even then was to abstract away whatever encoding happend to be used. Serialization needs an explicit encoding for serialized indices to make sense: the offsets need to align with the view. With String utilizing UTF-8 encoding for native contents in Swift 5, serialization isn't necessarily the most efficient in UTF-16. Furthermore, the majority of usage of encodedOffset in the wild is buggy and operates under the assumption that a UTF-16 code unit was a Swift Character, which isn't even valid if the String is known to be all-ASCII (because CR-LF). This change introduces a pair of semantics-preserving alternatives to encodedOffset that explicitly call out the UTF-16 assumption. These serve as a gentle off-ramp for current mis-uses of encodedOffset.	2019-02-13 18:42:40 -08:00
Michael Ilseman	b01ee7267a	[String] Custom iterator for UTF16View (#20929 ) Defining a custom iterator for the UTF16View avoid some redundant computation over the indexing model. This speeds up iteration by around 40% on non-ASCII strings.	2018-12-01 09:35:27 -08:00
Michael Ilseman	034f76d10b	[String] Remove some unneeded inlinable annotations	2018-11-15 09:43:34 -08:00
Maxim Moiseev	cbf83ac04f	[NFC][stdlib] Add FIXME markers to simplify audit	2018-11-14 11:58:42 -08:00
Slava Pestov	f6c2caf64b	stdlib: Add @inlinable to @inline(__always) declarations These should be audited since some might not actually need to be @inlinable, but for now: - Anything public and @inline(__always) is now also @inlinable - Anything @usableFromInline and @inline(__always) is now @inlinable	2018-11-13 15:15:07 -05:00
Michael Ilseman	75943350d2	[String] Give String a custom iterator Gives us modest wins on complex grapheme strings, but up to 40% on heavy-ASCII strings.	2018-11-08 18:25:01 -08:00
Michael Ilseman	948655e850	[String] Cleanups, comments, documentation After rebasing on master and incorporating more 32-bit support, perform a bunch of cleanup, documentation updates, comments, move code back to String declaration, etc.	2018-11-04 10:42:42 -08:00
Michael Ilseman	7aea40680d	[String] NFC iterator fast-paths Refactor and rename _StringGutsSlice, apply NFC-aware fast paths to a new buffered iterator. Also, fix bug in _typeName which used to assume ASCIIness and better SIL optimizations on StringObject.	2018-11-04 10:42:41 -08:00
Michael Ilseman	a0e639eaf5	[String] Grapheme breaking fast-paths Add in our scalar-based fast-paths for UTF-8 and foreign strings, and update the grapheme cache.	2018-11-04 10:42:40 -08:00
Michael Ilseman	fe7c3ce2e4	[String] Refactorings and cleanup * Refactor out RRC implementation into dedicated file. * Change our `_invariantCheck` pattern to generate efficient code in asserts builds and make the optimizer job's easier. * Drop a few Bidi shims we no longer need. * Restore View decls to String, workaround no longer needed * Cleaner unicode helper facilities	2018-11-04 10:42:40 -08:00
Michael Ilseman	f23a3c19b8	[String] Bounds checking and Index cleanup	2018-11-04 10:42:40 -08:00
Michael Ilseman	89d18e1a3a	[String] Refactor helper code into UnicodeHelpers.swift. Clean up some of the index assumptions, stick index-aware methods on _StringGuts, and otherwise migrate code over to UnicodeHelpers.swift.	2018-11-04 10:42:40 -08:00
Michael Ilseman	4ab45dfe20	[String] Drop in initial UTF-8 String prototype This is a giant squashing of a lot of individual changes prototyping a switch of String in Swift 5 to be natively encoded as UTF-8. It includes what's necessary for a functional prototype, dropping some history, but still leaves plenty of history available for future commits. My apologies to anyone trying to do code archeology between this commit and the one prior. This was the lesser of evils.	2018-11-04 10:42:40 -08:00
Michael Ilseman	463e3747a8	[gardening] Factor out String bidi conformance Add StringCharacterView.swift for String's bidi conformance. NFC.	2018-07-25 14:14:37 -07:00
Ben Cohen	a4230ab2ad	[stdlib] Update stdlib to 4.0 and reorganize compatibility shims (#17580 ) * Update stdlib to 4.0 and move all compatibility shims into a dedicated source file	2018-06-29 06:26:52 -07:00
Ben Cohen	a51cc89b11	Replace _CharacterView with a typealias (#17472 )	2018-06-25 13:22:09 -07:00
Karoy Lorentey	23c630ac92	[stdlib] Add @usableFromInline to internal typealiases that need it This fixes 3659 warnings in the standard library.	2018-06-18 16:34:19 +01:00
Michael Ilseman	3ee17102ed	[String.Index] Restore compound offsets. Move the shifts to index creation time rather than index comparison time. This seems to benefit micro benchmarks and cover up inefficiencies in our generic index distance calculations.	2018-05-25 09:54:35 -07:00
Michael Ilseman	614016fecd	[String.Index] Simplify and prepare for more resilience. Simplify String.Index by sinking transcoded offsets into the .utf8 variant. This is in preparation for a more resilient index type capable of supporting existential string indices.	2018-05-24 14:47:04 -07:00
Michael Ilseman	4a368ab46c	[string] Drop many @inlinable from big API. Drop append-related @inlinable annotations for String, StringGuts, StringStorage, and the Views. Drop several for larger operations, such as case conversion. Drop as many as we can from StringGuts for now.	2018-05-13 07:38:55 -07:00
Nate Cook	58933d88c5	[stdlib] Rename index(...) methods to firstIndex(...) A la SE-204.	2018-04-21 18:07:25 -05:00
Slava Pestov	2e5aef9c8d	stdlib: Remove redundant @usableFromInline attributes	2018-04-06 00:02:30 -07:00
Slava Pestov	e1f50b2d36	SE-0193: Rename @_inlineable to @inlinable, @_versioned to @usableFromInline	2018-03-30 21:55:30 -07:00
Karoy Lorentey	e6afe829a1	[stdlib] Silence deprecation warnings about CharacterView in stdlib - Rename `Substring.CharacterView` to `Substring._CharacterView`, adding a deprecated typealias for the original name, like we do for `String.CharacterView`. - Add a non-deprecated `Substring._characters` property, emulating `String.characters`. - Explicitly deprecate the following members: * String.withMutableCharacters<R>(_: (inout CharacterView) -> R) -> R * String.subscript(Range<Index>) -> String.CharacterView * Substring._CharacterView.subscript(Range<Index>) -> Substring.CharacterView * Substring.init(_: CharacterView) * String.init(_: Substring.CharacterView)	2018-01-24 21:16:48 +00:00
Michael Ilseman	3be2faf5d3	[String] Initial implementation of 64-bit StringGuts. Include the initial implementation of _StringGuts, a 2-word replacement for _LegacyStringCore. 64-bit Darwin supported, 32-bit and Linux support in subsequent commits.	2018-01-21 12:32:26 -08:00
Michael Ilseman	75463e30f3	[stdlib] Rename _StringCore to _LegacyStringCore. NFC. In grand LLVM tradition, the first step to redesigning _StringCore is to first rename it to _LegacyStringCore. Subsequent commits will introduce the replacement, and eventually all uses of the old one will be moved to the new one. NFC.	2018-01-21 12:28:56 -08:00
Ben Cohen	4ddac3fbbd	[stdlib] Eradicate IndexDistance associated type (#12641 ) * Eradicate IndexDistance associated type, replacing with Int everywhere * Consistently use Int for ExistentialCollection’s IndexDistance type. * Fix test for IndexDistance removal * Remove a handful of no-longer-needed explicit types * Add compatibility shims for non-Int index distances * Test compatibility shim * Move IndexDistance typealias into the Collection protocol	2017-12-08 12:00:23 -08:00
Ben Cohen	dcab9493ae	Removed some warnings (#12753 )	2017-11-30 15:12:56 -08:00
Max Moiseev	a24998a5b1	[stdlib] Add missing @_fixed_layout attributes to fix resilience build	2017-10-02 15:19:06 -07:00
Max Moiseev	ef6b5c4795	Add missing @_inlineable attributes and deinits	2017-09-29 11:26:56 -07:00
Max Moiseev	53b8419279	[stdlib] Make all the stdlib APIs @_inlineable This change in theory should allow us to remove a special stdlib-only sil-serialize-all compilation mode. <rdar://problem/34138683>	2017-09-29 11:26:56 -07:00
swift-ci	79a3f9c415	Merge pull request #11670 from natecook1000/nc-rev-77-2	2017-09-19 10:15:59 -07:00
Nate Cook	050268d876	[stdlib] Documentation revisions - Update NSRange -> Range guidance - Fix example in Optional - Improve RangeExpression docs - Fix issue in UnsafeRawBufferPointer.initializeMemory - Code point -> scalar value most places - Reposition the dot above the scripty `i' - Fix ExpressibleByArrayLiteral code sample	2017-08-29 09:41:55 -05:00
Maxim Moiseev	ee5fb33656	[stdlib] Remove the Grand Renaming artifacts of Swift 3 era	2017-08-28 15:54:11 -07:00
Michael Ilseman	7c705c3a75	[stdlib] Deprecate String/Substring.CharacterView CharacterView is now entirely redundant in Swift 4. Deprecate its use. This also allows us to schedule the unbreaking of String.CharacterView leakiness without a hard source break.	2017-08-10 17:24:06 -07:00
Dave Abrahams	9159239995	Un-revert "[stdlib] String index interchange, etc." (#10812 ) I failed to merge the upstream changes to swift-corelibs-foundation at the same time as I merged that #9806, and it broke on linux. Going to get it right this time.	2017-07-07 12:13:25 -07:00
Xi Ge	d9fb110674	Revert "[stdlib] String index interchange, etc." (#10812 ) rdar://33186295	2017-07-07 12:03:16 -07:00
Dave Abrahams	e523c80339	[stdlib] Index interchange, part I	2017-07-07 00:59:04 -07:00
Michael Ilseman	5bc20cba08	[stdlib] Clean up non-contiguous string grapheme breaking code. Removes the legacy grapheme breaking code paths. Simplifies and clarifies the non-contiguous grapheme breaking code through consistent naming and handling of absolute positions vs relative offsets.	2017-06-28 15:46:44 -07:00
Michael Ilseman	b3b28e0c50	[gardening] 80 columns; NFC	2017-06-28 15:46:39 -07:00
Michael Ilseman	a37a823e6e	[stdlib] Update non-contiguous NSStrings to Unicode 9 This adds Unicode 9 grapheme breaking support for non-contiguous NSStrings. Non-contiguous NSStrings that don't hit our fast paths are very rare, but should still behave identically to contiguous strings. We first copy a fixed number of code units into a fixed size buffer (currently 16 in size) and try to grapheme break inside of that buffer. This is sufficient storage for all known non-pathological graphemes. Any graphemes larger than the buffer are handled by copying larger portions of the string into an Array. Test cases added, including pathological "zalgo" text that stresses extremely long graphemes.	2017-06-28 15:35:25 -07:00
Michael Ilseman	4c0ba61e53	[gardening] Remove done TODO comments	2017-06-27 20:37:16 -07:00
Michael Ilseman	bd5189c25a	[String] Grapheme fast paths for punctuation: 5-8x speedup. Many strings use non-sub-300 punctuation characters (e.g. unicode hyphen, CJK quotes, etc). This can cause switching between fast and slow paths for grapheme breaking. Add in fast-paths for general punctuation characters and CJK punctuation and symbol characters. This results in about a 5-8x speedup for heavily (unicode) punctuated Latiny and CJKy workloads.	2017-06-27 19:18:51 -07:00
Nate Cook	825e9d077d	[stdlib] More documentation revisions / consistency fixes.	2017-06-13 14:08:00 -05:00
Dave Abrahams	562fd79aa6	[stdlib] Encode small Characters as UTF-16 This takes care of the standard library portion, but we need a new BuiltinUTF16ExtendedGraphemeClusterLiteralConvertible protocol in order to fully recover the performance of character literals. Note that part of the character_literals.swift test is currently disabled. That will need to be fixed before we can merge this work.	2017-06-01 20:57:25 -07:00
Michael Ilseman	44cccba22d	[stdlib] Change dynamic check to sanity check. Double-checking for CR-LF is redundant in _internalExtraCheckGraphemeBreakBetween. Add in a sanity check and omit the overly conservative CR check.	2017-05-31 14:55:24 -07:00
Michael Ilseman	0a88de53d3	[stdlib] Grapheme break fast-paths for Cyrillic, Arabic, Hangul Add in more grapheme break fast paths for scripts based on Cyrillic, Arabic, or Hangul. Generates significant performance wins, similar to those for the unihan fast paths. While every extra check does slow down the runtime of _internalExtraCheckGraphemeBreakBetween as currently implemented, I've not found the performance cost to be relevant for workloads with occasional mixed emoji contents, nor for workloads that his the earlier checks. A pure Korean workload (currently the last check) does pays a rather noticable price for the previous checks, but this is only because the workload is now so greatly improved. Optimizing this implementation is interesting future work, but not urgent.	2017-05-31 11:09:43 -07:00
Dave Abrahams	801b9c5544	[stdlib] Move specialization from init to append Since init just calls append anyway, it's 2 birds/1 stone	2017-05-24 16:10:34 -07:00
Dave Abrahams	794a287c27	Kill a stray TAB How'd that get in there? Thanks, @moiseev	2017-05-24 04:10:25 -07:00

1 2 3 4

191 Commits