Commit Graph

6 Commits

Author SHA1 Message Date
Alejandro Alonso
b49466de06 Update Unicode generators to support 16 2025-01-14 14:16:29 -08:00
Alejandro Alonso
70a56e60f3 Add Unicode Word Break generator 2024-02-16 17:11:00 -08:00
Alejandro Alonso
3b402f0179 [stdlib] Implement native Unicode.Scalar binary properties (#39597)
* Generate Unicode data for Scalar Binary Properties

* Use native scalar binary property lookup

* Add _BinaryProperties to Scalar Properties

narrow access control

* Upgrade the notice to a warning in UnicodeScalarProperties
2021-11-15 15:20:46 -08:00
Alejandro Alonso
5a0bbb9f89 [stdlib] Implement native grapheme breaking for String (#37864)
* Implement GraphemeWalker that does native grapheme breaking

* Bridged strings use native grapheme breaking for forward strides

* Implement bidirectional native grapheme breaking for native and foreign strings

* Remove ICU's grapheme breaking support

* Use UnicodeScalarView to implement GraphemeWalker

use an Iterator approach

remove Iterator conformance

* Incorporate Michael's feedback

more comments addressed

fix crlf bug

* Try bringing back some old fast paths

* Parameterize nextBoundary and previousBoundary

Parameterize nextBoundary and previousBoundary

* Implement Michael's suggestions
2021-11-01 16:52:28 -07:00
Alejandro Alonso
9b25a2d02b Add a Unicode normalization data generator 2021-09-29 14:20:21 -07:00
Alejandro Alonso
b189084f4f [utils] Add a Unicode data generator util package (#39213)
* Add a Unicode data generator util package

* Add a common GenUtils library

* Use string methods for reading/writing files
2021-09-09 15:47:37 -07:00