Dan Gohman 479c6f0da9 Handle multiple starters in the stream-safe fuzzer.
NFC compositions can involve multiple starters, such as `\u{11347}` and
`\u{11357}`. Adjust the counting iterator in the streaming fuzzer to
only count non-starters, so that it doesn't over-count.

Fixes #76.
2021-05-17 21:20:24 -07:00
2021-02-08 17:06:21 -08:00
2021-01-05 22:51:46 -08:00
2020-04-10 23:37:38 +03:00
2021-02-08 10:02:55 -08:00
2020-11-18 13:33:10 -08:00

unicode-normalization

Build Status Docs

Unicode character composition and decomposition utilities as described in Unicode Standard Annex #15.

This crate requires Rust 1.36+.

extern crate unicode_normalization;

use unicode_normalization::char::compose;
use unicode_normalization::UnicodeNormalization;

fn main() {
    assert_eq!(compose('A','\u{30a}'), Some('Å'));

    let s = "ÅΩ";
    let c = s.nfc().collect::<String>();
    assert_eq!(c, "ÅΩ");
}

crates.io

You can use this package in your project by adding the following to your Cargo.toml:

[dependencies]
unicode-normalization = "0.1.16"

no_std + alloc support

This crate is completely no_std + alloc compatible. This can be enabled by disabling the std feature, i.e. specifying default-features = false for this crate on your Cargo.toml.

S
Description
Unicode字符组成和分解工具
Readme 2.9 MiB
Languages
Rust 96.5%
Python 3.5%