1181 Commits

Author SHA1 Message Date
openharmony_ci
c67d65738e
!2 Add OAT.xml and README.OpenSource
Merge pull request !2 from fangting/master
OpenHarmony-v4.0.3-Release OpenHarmony-v4.0.2-Release OpenHarmony-v5.0-Beta1 OpenHarmony-v4.1-Beta1 OpenHarmony-v4.0-Beta1 OpenHarmony-v4.0-Beta2 OpenHarmony-v4.0-Release OpenHarmony-v4.0.1-Release OpenHarmony-v4.1-Release OpenHarmony-v4.1.1-Release master-v weekly_20240115-v OpenHarmony-v5.0.0-Release OpenHarmony-v4.0.4-Release OpenHarmony-v5.0.1-Release OpenHarmony-v4.1.2-Release OpenHarmony-v5.0.2-Release OpenHarmony-v4.1.3-Release OpenHarmony-v5.0.3-Release
2023-04-14 08:10:33 +00:00
fangting
f0653fd6b8 Add OAT.xml and README.OpenSource
Signed-off-by: fangting <fangting12@huawei.com>
2023-04-14 14:15:21 +08:00
openharmony_ci
b36eed6b75
!1 [regex]Add GN Build Files and Custom Modifications to Rust Third-party Libraries
Merge pull request !1 from lubinglun/master
2023-04-13 11:33:38 +00:00
lubinglun
683028773a Add GN Build Files and Custom Modifications
Issue:https://gitee.com/openharmony/build/issues/I6UFTP
Signed-off-by: lubinglun <lubinglun@huawei.com>
2023-04-12 17:26:48 +08:00
Andrew Gallant
a9b2e02352
1.7.1 github.com/rust-lang/regex/1.7.1 github.com/rust-lang/regex.git/1.7.1 2023-01-09 08:19:50 -05:00
Andrew Gallant
98c1b63ffe
changelog: 1.7.1 2023-01-09 08:19:34 -05:00
Alex
9330ea50f5
ci: harden configuration
This makes it so the permissions are locked down by default. The threat
model here is something like, "what happens if an authorized party gains
control of the non-PR CI configuration somehow."

To be honest, I (BurntSushi) don't quite understand how that might
happen without also the ability to set the permissions itself. But
locking permissions down by default does seem like a good and sensible
thing to do.

Closes #932
2022-12-08 14:53:25 -05:00
Jonas Berlin
ac2d0e1b33
impl: optimize replacen loop
The previous implementation didn't bail out of the replace
loop when the limit was reached until one more than the
total number of 'find' operations had completed. By moving
the limit check to the end of the loop body, we execute only
the number of 'find' operations that is necessary, instead of
one extra.

This optimization only applies to 'replacen' calls with a limit
not equal to '0'. That includes 'replace' but not 'replace_all'.

PR #930
2022-11-28 13:37:22 -05:00
Andrew Gallant
f871a8eb1d
1.7.0 2022-11-05 13:35:07 -04:00
Andrew Gallant
ca7b99c647
changelog: 1.7.0 2022-11-05 13:34:25 -04:00
Andrew Gallant
ea3b132080
regex-syntax-0.6.28 2022-11-05 13:32:31 -04:00
Andrew Gallant
9a1892737b syntax: update to Unicode 15
Closes #916
2022-11-05 13:32:03 -04:00
Andrew Gallant
0d0023e412
rure-0.2.2 2022-10-05 10:56:19 -04:00
Andrew Gallant
3bac5c8075
capi: add 'rlib' crate type
I didn't realize this was a useful build output for a C library, but I
guess it is. Namely, it permits it to be built with other rlibs into one
giant single shared library.

Fixes #909
2022-10-05 10:53:46 -04:00
Jakub Kołodziejczak
159a63c85e
doc: add a note about the empty regex
This improvement is a consequence of the Q&A discussion from
https://github.com/rust-lang/regex/discussions/896

PR #897
2022-07-26 07:19:38 -04:00
Andrew Gallant
fc6f5ccc51
readme: re-word usage to remove version number
We cheat a little bit by leveraging 'cargo add'.

Closes #889
2022-07-19 08:34:57 -04:00
Andrew Gallant
67824c7af2 ci: switch to dtolnay/rust-toolchain
The actions-rs/toolchain we were using appears dead:
https://github.com/actions-rs/toolchain/issues/216

Here's dtolnay's take: https://github.com/dtolnay/rust-toolchain

In particular, it looks like a much more maintainable path here. For
this one, if it ever goes unmaintained, I can maintain it myself without
too much fuss.

See also: https://old.reddit.com/r/rust/comments/vyx4oj/actionsrs_organization_became_unmaintained/ig54zv7/
2022-07-14 13:13:46 -04:00
Linda_pp
54660765af
capi: fix 'unused return value' warnings
Somewhat recently, 'CString::from_raw' got a '#[must_use]' slapped
on it. Arguably, writing 'drop' around its return value is indeed much
clearer. So we do that here.

We also do that for 'Box::from_raw' even though it doesn't have a
'#[must_use]' on it. But the same principle applies.

PR #882
2022-07-14 09:36:58 -04:00
Andrew Gallant
fc9ee6a249
1.6.0 2022-07-05 14:00:31 -04:00
Andrew Gallant
8ddc8ece4b
deps: bump regex-syntax to 0.6.27 2022-07-05 13:59:55 -04:00
Andrew Gallant
8c0eccd0c6
regex-syntax-0.6.27 2022-07-05 13:59:34 -04:00
Andrew Gallant
dee5c25d8e
changelog: 1.6.0 2022-07-05 13:59:16 -04:00
Elie ROUDNINSKI
9ca3099037 regex: fix clippy lints up to rust 1.41.1
Some lints have been intentionally ignored, especially:

* any lints that would change public APIs (like &self -> self)
* any lints that would introduce new public APIs (like Default over new)

Closes #780
2022-07-05 13:53:46 -04:00
Elie ROUDNINSKI
de838287bb syntax: fix clippy lints up to rust 1.41.1
Some lints have been intentionally ignored, especially:

* any lints that would change public APIs (like &self -> self)
* any lints that would introduce new public APIs (like Default over new)
2022-07-05 13:53:46 -04:00
Alexander Gonzalez
b87cd88476 syntax: include only the start of the character class on error
This fixes a bug where the caret in some types of error messages was not
quite correct.

Fixes #792, Closes #794
2022-07-05 13:53:46 -04:00
Alexander Gonzalez
9d1478cfb5 doc: fix typos 2022-07-05 13:53:46 -04:00
Christopher Durham
7d21662735 doc: clarify Captures::len includes non-matching captures
We do the same for CaptureLocations too.

Closes #832
2022-07-05 13:53:46 -04:00
Élie ROUDNINSKI
5e98788947
perf: use is_match_at instead of shortest_match_at
Local benchmarks show up to 15% reduction in terms of number of
retired instructions executed and at least 5% reduction in terms
of CPU time.

This is basically a bit of a manual inlining here, instead of doing more
redirection.

PR #788
2022-07-05 13:31:23 -04:00
Alex Chan
0c2774894a
doc: fix spelling typo of "insignificant"
PR #839
2022-07-05 13:07:24 -04:00
James Mills
fb4b1c1cce
api: impl ExactSizeIterator for SubCaptureMatches
PR #857
2022-07-05 13:06:39 -04:00
Danny McClanahan
039bb4ddf6
doc: demonstrate a separately-compiled regex set
This adds an example of an intended pattern where the regex
set is used to find a matching regex, and then a separately
compiled regex is used to find where it matched.

PR #861
2022-07-05 13:04:55 -04:00
Alexander Beedie
30dba7422d
doc: fix typo in 'is_meta_character'
PR #873
2022-07-05 13:02:32 -04:00
Andrew Gallant
c31428ad59 ci: various CI updates
Mostly updating various "actions" and doing a little tidying.
2022-07-05 13:00:10 -04:00
Andrew Gallant
95af74d8d9 syntax: update to Unicode 14
Closes #878
2022-07-05 13:00:10 -04:00
Andrew Gallant
8fe3716e29 scripts: update docs for 'generate-unicode-tables'
The docs are now updated to work with Unicode 14. (In particular,
emoji-data.txt no longer needs to be downloaded separately.) We also
include a note about adding a new case for "age" in regex-syntax.
2022-07-05 13:00:10 -04:00
Andrew Gallant
6ff285e375 tests: add Unicode tests for 'Vithkuqi'
Vithkuqi support was added to Unicode 14.

Fixes #877
2022-07-05 13:00:10 -04:00
Andrew Gallant
c01b633804
bench: add new benchmark baseline
I added this so that I can compare the results of the old benchmark
suite with the new one I'm working on in regex-automata. (The idea is to
port all or most of the benchmarks from the old suite and make sure the
results are at least roughly consistent.)
2022-07-01 09:15:49 -04:00
Andrew Gallant
ea525cd1bf
bench: remove D and C++ regex engines
Neither of them were particularly competitive and they make building the
benchmark harness more trouble than it's worth.
2022-07-01 09:15:49 -04:00
Linda_pp
2f23f6034e
impl: remove duplicate of 'cfg' attribute
PR #871
2022-06-18 12:14:09 -04:00
Laurenz
284d846f70
doc: fix return value of match_indices in pattern example
PR #870
2022-06-16 09:05:22 -04:00
Guillaume Gomez
a6ec74f076
doc: fix rustdoc warning about automatic link
PR #868
2022-06-14 09:21:53 -04:00
Andrew Gallant
8e8b2fcdfe ci: attempt to fix cross in CI
This is a perfect example of why adding support for more targets is
costly. Every so often, the cross compiling toolchain breaks for
mysterious reasons.
2022-06-14 09:06:42 -04:00
Andrew Gallant
9aef5b1edc
1.5.6 2022-05-20 14:06:04 -04:00
Andrew Gallant
2931b070fd
syntax: bump minimum regex-syntax version to 0.6.26 2022-05-20 14:05:37 -04:00
Andrew Gallant
b41bde0b85
regex-syntax-0.6.26 2022-05-20 14:05:16 -04:00
Andrew Gallant
d98da65bb3
changelog: 1.5.6 2022-05-20 14:03:31 -04:00
Andrew Gallant
1c19619672 syntax: fix literal extraction for 'ab??'
Previously, 'ab??' returned [Complete(ab), Complete(a)], but the order
matters here because of greediness. The correct result is [Complete(a),
Complete(ab)].

Instead of trying to actually fix literal extraction (which is a mess),
we just rewrite 'ab?' (and 'ab??') as 'ab*'. 'ab*' still produces
literals in the incorrect order, i.e., [Cut(ab), Complete(a)], but since
one is cut we are guaranteed that the regex engine will be called to
confirm the match. In so doing, it will correctly report 'a' as a match
for 'ab??' in 'ab'.

Fixes #862
2022-05-20 14:02:08 -04:00
Andrew Gallant
88a2a62d86 syntax: fix 'is_match_empty' predicate
This was incorrectly defined for \b. Previously, I had erroneously made
it return true only for \B since \B matches '' and \b does not match
''. However, \b does match the empty string. Like \B, it only matches a
subset of empty strings, depending on what the surrounding context is.
The important bit is that it can match *an* empty string, not that it
matches *the* empty string.

We were not yet using this predicate anywhere in the regex crate, so we
just fix the implementation and update the tests.

This does present a compatibility hazard for anyone who was using this
function, but as of this time, I'm considering this a bug fix since \b
clearly matches an empty string.

Fixes #859
2022-05-18 08:18:14 -04:00
Andrew Gallant
72f09f1aeb syntax: fix ascii class union bug
This fixes a bug in how ASCII class unioning was implemented. Namely, it
previously and erroneously unioned together two classes and then applied
negation/case-folding based on the most recently added class, even if
the class added previously wasn't negated. So for example, given the
regex '[[:alnum:][:^ascii:]]', this would initialize the class with
'[:alnum:]', then add all '[:^ascii:]' codepoints and then negate the
entire thing because of the negation in '[:^ascii:]'. Negating the
entire thing is clearly wrong and not the intended semantics.

We fix this by applying negation/case-folding only to the class we're
dealing with, and then we union it with whatever existing class we're
building.

Fixes #680
2022-05-18 08:18:14 -04:00
cui fliter
b5372864e2
doc: fix some typos
PR #856
2022-04-24 13:24:49 -04:00