提交 · 17361482108e2a8757dc4aa69ed36b002251a08f · OpenHarmony / Third Party Musl

08 5月, 2012 3 次提交

R

fix copy and paste error in regex code causing mishandling of \) in BRE · 17361482
由 Rich Felker 提交于 5月 07, 2012

17361482
R

fix regex breakage in last commit (failure to handle empty regex, etc.) · a5a47783
由 Rich Felker 提交于 5月 07, 2012

a5a47783

fix ugly bugs in TRE regex parser · d7a90b35

由 Rich Felker 提交于 5月 07, 2012

1. * in BRE is not special at the beginning of the regex or a
subexpression. this broke ncurses' build scripts.

2. \\( in BRE is a literal \ followed by a literal (, not a literal \
followed by a subexpression opener.

3. the ^ in \\(^ in BRE is a literal ^ only at the beginning of the
entire BRE. POSIX allows treating it as an anchor at the beginning of
a subexpression, but TRE's code for checking if it was at the
beginning of a subexpression was wrong, and fixing it for the sake of
supporting a non-portable usage was too much trouble when just
removing this non-portable behavior was much easier.

this patch also moved lots of the ugly logic for empty atom checking
out of the default/literal case and into new cases for the relevant
characters. this should make parsing faster and make the code smaller.
if nothing else it's a lot more readable/logical.

at some point i'd like to revisit and overhaul lots of this code...

d7a90b35

14 4月, 2012 1 次提交

remove invalid code from TRE · 386b34a0

由 Rich Felker 提交于 4月 13, 2012

TRE wants to treat + and ? after a +, ?, or * as special; ? means
ungreedy and + is reserved for future use. however, this is
non-conformant. although redundant, these redundant characters have
well-defined (no-op) meaning for POSIX ERE, and are actually _literal_
characters (which TRE is wrongly ignoring) in POSIX BRE mode.

the simplest fix is to simply remove the unneeded nonstandard
functionality. as a plus, this shaves off a small amount of bloat.

386b34a0

21 3月, 2012 1 次提交

upgrade to latest upstream TRE regex code (0.8.0) · ad47d45e

由 Rich Felker 提交于 3月 20, 2012

the main practical results of this change are
1. the regex code is no longer subject to LGPL; it's now 2-clause BSD
2. most (all?) popular nonstandard regex extensions are supported

I hesitate to call this a "sync" since both the old and new code are
heavily modified. in one sense, the old code was "more severely"
modified, in that it was actively hostile to non-strictly-conforming
expressions. on the other hand, the new code has eliminated the
useless translation of the entire regex string to wchar_t prior to
compiling, and now only converts multibyte character literals as
needed.

in the future i may use this modified TRE as a basis for writing the
long-planned new regex engine that will avoid multibyte-to-wide
character conversion entirely by compiling multibyte bracket
expressions specific to UTF-8.

ad47d45e

17 6月, 2011 1 次提交
- R
  
  duplicate re_nsub in LSB/glibc ABI compatible location · 32aea208
  由 Rich Felker 提交于 6月 16, 2011
  
  32aea208
12 2月, 2011 1 次提交
- R
  
  initial check-in, version 0.5.0 · 0b44a031
  由 Rich Felker 提交于 2月 12, 2011
  
  0b44a031

OpenHarmony / Third Party Musl 大约 1 年 前同步成功

OpenHarmony / Third Party Musl
大约 1 年前同步成功