提交 · 06d4075a50b84d4b80e09eb5662fc1153bd559f7 · OpenHarmony / Third Party Musl

26 10月, 2019 4 次提交

R

update case mappings to unicode 12.1.0 · 06d4075a
由 Rich Felker 提交于 10月 25, 2019

06d4075a
U

update ctype data to unicode 12.1.0 · e95538fa
由 u_quark 提交于 10月 12, 2019

e95538fa

overhaul wide character case mapping implementation · a11a6246

由 Rich Felker 提交于 10月 25, 2019

the existing implementation of case mappings was very small (typically
around 1.5k), but unmaintainable, requiring manual addition of new
case mappings with each new edition of Unicode. often, it turned out
that newly-added case mappings were not easily representable in the
existing tightly-constrained table structures, requiring new hacks to
be invented and delaying support for new characters.

the new implementation added here follows the pattern used for
character class membership, with a two-level table allowing Unicode
blocks for which no data is needed to be elided. however, rather than
single-bit data, each character maps to a one of up to 6 case-mapping
rules available to its block, where 6 is floor(cbrt(256)) and allow 3
characters to be represented per byte (vs 8 with bit tables). blocks
that would need more than 6 rules designate one as an exception and
let lookup pass into a binary search of exceptional cases for the
block.

the number 6 was chosen empirically; many blocks would be ok with 4
rules (uncased, lower, upper, possible exceptions), some even just
with 2, but the latter are rare and fitting 4 characters per byte
rather than 3 does not save significant space. moreover, somewhat
surprisingly, there are sufficiently many blocks where even 4 rules
don't suffice without a lot of exceptions (blocks where some case
pairs are laced, others offset) that originally I was looking at
supporting variable-width tables, with 1-, 2-, or 3-bit entries,
thereby allowing blocks with 8 rules. as implemented in my
experiments, that version was significantly larger and involved more
memory accesses/cache lines.

improvements in size at the expense of some performance might be
possible by utilizing iswalpha data or merging the table of case
mapping identity with alphabetic identity. these were explored
somewhat when the code was first written, and might be worth
revisiting in the future.

a11a6246

add missing case mapping between U+03F3 and U+037F · e8aba58a

由 Rich Felker 提交于 10月 25, 2019

somehow this seems to have been overlooked. add it now so that
subsequent overhaul of case mapping implementation will not introduce
a functional change at the same time.

e8aba58a

24 10月, 2019 1 次提交
- R
  fix errno for posix_openpt with no free ptys available · 4fd0f205
  由 Rich Felker 提交于 10月 22, 2019
```
linux fails the open with ENOSPC, but POSIX mandates EAGAIN.
```
  4fd0f205
20 10月, 2019 6 次提交

adjust struct timespec definition to be time64-ready · 9b2921be

由 Rich Felker 提交于 10月 20, 2019

for time64 support on 32-bit archs, the kernel interfaces use a
timespec layout padded to match the representation of a pair of 64-bit
values, which requires endian-specific padding.

use of an ordinary, non-bitfield, named member for the padding is
undesirable because, on big endian archs, it would alter the
interpretation of traditional (non-designated) initializers of the
form {s,ns}, initializing the padding instead of the tv_nsec member.
unnamed bitfield members solve this problem by not taking part in
initialization, and were the expected solution when the kernel
interfaces were designed. however, they also have further advantages
which we take advantage of here:

positioning of the padding could be controlled by having a
preprocessor conditional with separate definitions of struct timespec
for little and big endian, but whether padding should appear at all is
a function of whether time_t is larger than long. this condition is
not something the preprocessor can determine unless we were to define
a new macro specifically for that purpose.

by using unnamed bitfield members instead of ordinary named members,
we can arrange for the size of the padding to collapse to zero when it
should not be present, just by using sizeof(time_t) and sizeof(long)
in the bitfield width expression, which can be any integer constant
expression.

9b2921be

clock_adjtime: generalize time64 not to assume old struct layout match · 928674dc

由 Rich Felker 提交于 10月 20, 2019

commit 2b4fd6f7 added time64 for this
function, but did so with a hidden assumption that the new time64
version of struct timex will be layout-compatible with the old one.
however, there is little benefit to doing it that way, and the cost is
permanent special-casing of 32-bit archs with 64-bit time_t in the
public interface definitions.

instead, do a full translation of the structure going in and out. this
commit is actually a revision to an earlier uncommited version of the
code.

928674dc

wait4, getrusage: add time64/x32 variant · 5850546e

由 Rich Felker 提交于 10月 19, 2019

presently the kernel does not actually define time64 versions of these
syscalls, and they're not really needed except to represent extreme
cpu time usage. however, x32's versions of the syscalls already behave
as time64 ones, meaning the functions were broken on x32 if the caller
used any part of the rusage result other than ru_utime and ru_stime.
commit 7e817114 made it possible to
fix this by treating x32's syscalls as time64 versions.

in the non-time64-syscall case, make the syscall with the rusage
destination pointer adjusted so that all members but the timevals line
up between the libc and kernel structures. on 64-bit archs, or present
32-bit archs with 32-bit time_t, the timevals will line up too and no
further work is needed. for future 32-bit archs with 64-bit time_t,
the timevals are copied into place, contingent on time_t being larger
than long.

5850546e

internally, define time64 rusage syscalls on x32 as the existing ones · 7e817114

由 Rich Felker 提交于 10月 19, 2019

this is analogous to commit 40aa18d5.
so far, there are not any actual time64 versions of the rusage
syscalls (getrusage and wait4) and might never be. however, the
existing x32 ones behave the way time64 versions would if they
existed: using 64-bit slots in place of all longs.

presently, wait4 and getrusage are broken on x32, storing the timevals
correctly but messing up everything else due to the long/kernel-long
mismatch. this would be a huge buffer overflow if not for the 16
reserved slots we left long ago, which suffice to prevent 14
double-sized longs from overflowing into unrelated memory. this commit
will make it possible to fix them.

7e817114

use struct pt_regs * rather than void * for powerpc[64] sigcontext regs · c2518a8e

由 Rich Felker 提交于 10月 19, 2019

this is to match the kernel and glibc interfaces. here, struct pt_regs
is an incomplete type, but that's harmless, and if it's completed by
inclusion of another header then members of the struct pointed to by
the regs member can be accessed directly without going through a cast
or intermediate pointer object.

c2518a8e

fix fpregset_t type on powerpc64 · c9f48cde

由 Rich Felker 提交于 10月 19, 2019

the userspace ucontext API has this as an array rather than a
structure.

commit 3c59a868 fixed the
corresponding mistake for vrregset_t, namely that the original
powerpc64 port used a mix of types from 32-bit powerpc and powerpc64
rather than matching the 64-bit types.

c9f48cde

19 10月, 2019 2 次提交

fix return value of ungetc when argument is outside unsigned char range · f6ecd0c2

由 Rich Felker 提交于 10月 18, 2019

aside from the special value EOF, ungetc is specified to accept and
convert values outside the range of unsigned char. conversion takes
place automatically as part of assignment when storing into the
buffer, but the return value is also required to be the resulting
converted value, and this requirement was not satisfied.

simplified from patch by Wang Jianjian.

f6ecd0c2

fix incorrect use of fabs on long double operand in floatscan.c · bff78954

由 Rich Felker 提交于 10月 18, 2019

based on patch by Dan Gohman, who caught this via compiler warnings.
analysis by Szabolcs Nagy determined that it's a bug, whereby errno
can be set incorrectly for values where the coercion from long double
to double causes rounding. it seems likely that floating point status
flags may be set incorrectly as a result too.

at the same time, clean up use of preprocessor concatenation involving
LDBL_MANT_DIG, which spuriously depends on it being a single unadorned
decimal integer literal, and instead use the equivalent formulation
2/LDBL_EPSILON. an equivalent change on the printf side was made in
commit bff6095d.

bff78954

18 10月, 2019 8 次提交

move pthread types out of per-arch alltypes.h · 2d3083e7