commit | 1eef3623293115fe2cd2fc52ccf85a723f7ad70f | [log] [tgz] |
---|---|---|
author | Andy Heninger <andy.heninger@gmail.com> | Tue Jun 09 13:19:17 2020 -0700 |
committer | Andy Heninger <andy.heninger@gmail.com> | Wed Jun 17 12:00:14 2020 -0700 |
tree | f23996397151a11f1ef0979b3f5724b7c4ce863c | |
parent | 85aee40cc3b25eeaccba6ef59fca124fd9ee5100 [diff] |
ICU-13565 Break Iteration, remove the dictionary bit from the implementation. For identifying text that needs to be handled by a word dictionary for Break Iteration, change from using a bit in the character category to sorting all dictionary categories together, and recording the boundary between the non-dictionary and dictionary ranges. This is internal to the implementaion. It does not affect behavior. It does increase the number of character categories that can be handled using a compact 8 bit Trie, from 127 to 255.
This is the repository for the International Components for Unicode. The ICU project is under the stewardship of The Unicode Consortium.
master
branch)Build | Status |
---|---|
TravisCI | |
Azure Pipelines | |
Azure Pipelines (Exhaustive Tests) | |
Azure Pipelines (Valgrind ICU4C) | |
AppVeyor | |
Fuzzing |
icu4c/
ICU for C/C++icu4j/
ICU for Javatools/
Toolsvendor/
Vendor dependenciesPlease see ./icu4c/LICENSE (C and J are under an identical license file.)
Copyright © 2016 and later Unicode, Inc. and others. All Rights Reserved. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. Terms of Use and License