third_party_rust_regex

mirror of https://gitee.com/openharmony/third_party_rust_regex synced 2025-04-14 00:20:48 +00:00

Author	SHA1	Message	Date
Andrew Gallant	56ea4a835c	Fixes #99 . TL;DR - The combination of case folding, character classes and nested negation is darn tricky. The problem presented in #99 was related to how we're storing case folded character classes. Namely, we only store the canonical representation of each character (which means that when we match text, we must apply case folding to the input). But when this representation is negated, information is lost. From #99, consider the negated class with a single range `x`. The class is negated before applying case folding. The negated class includes `X`, so that case folding includes both `X` and `x` even though the regex in #99 is specifically trying to not match either `X` or `x`. The solution is to apply case folding after negation. But given our representation, this doesn't work. Namely, case folding the range `x` yields `x` with a case insensitive flag set. Negating this class ends up matching all characters sans `x`, which means it will match `X`. So I've backtracked the representation to include all case folding variants. This means we can negate case folded classes and get the expected result. e.g., case folding the class `[x]` yields `[xX]`, and negating `[xX]` gives the desired result for the regex in #99.	2015-07-05 11:46:11 -04:00
Andrew Gallant	1e79c4d9ee	regex-syntax: version bump	2015-06-02 18:16:43 -04:00
Andrew Gallant	f9fc8614d2	Optimize case folding. When `regex-syntax` is compiled under debug mode, case folding can take a significant amount of time. This path is easily triggered by using case insensitive regexes. This commit speeds up the case folding process by skipping binary searches, although it is still not optimal. It could probably benefit from a fresh approach, but let's leave it alone for now.	2015-06-02 18:16:04 -04:00
Andrew Gallant	7a72b1fc57	version bump. Actually, I don't think I needed to bump `regex` proper. Whoops.	2015-05-28 19:14:55 -04:00
Pascal Hertleif	c427a3f4ff	Adjust Some Formatting, Use `checkadd` More Related to #88	2015-05-29 00:52:43 +02:00
Pascal Hertleif	13eb7bef5f	Add '\#' Escaping Fixes #88	2015-05-28 20:22:54 +02:00
Pascal Hertleif	349158ed27	[WIP] Treat '#' as Punctuation Relates to #88	2015-05-28 18:31:06 +02:00
Andrew Gallant	6d5e909e5e	Fixes from code review. The big change here is the addition of a non-public variant in the error enums. This will hint to users that one shouldn't exhaustively match the enums in case new variants are added.	2015-05-27 18:43:28 -04:00

1 2 3 4

158 Commits