提交 · ffd8ac2dd50f99c3c83d7d9d845df9874ec3e7d5 · OpenHarmony / Third Party Musl

19 5月, 2013 1 次提交

math: fix two fma issues (only affects non-nearest rounding mode, x86) · ffd8ac2d

由 Szabolcs Nagy 提交于 5月 19, 2013

1) in downward rounding fma(1,1,-1) should be -0 but it was 0 with
gcc, the code was correct but gcc does not support FENV_ACCESS ON
so it used common subexpression elimination where it shouldn't have.
now volatile memory access is used as a barrier after fesetround.

2) in directed rounding modes there is no double rounding issue
so the complicated adjustments done for nearest rounding mode are
not needed. the only exception to this rule is raising the underflow
flag: assume "small" is an exactly representible subnormal value in
double precision and "verysmall" is a much smaller value so that
	(long double)(small plus verysmall) == small
then
	(double)(small plus verysmall)
raises underflow because the result is an inexact subnormal, but
	(double)(long double)(small plus verysmall)
does not because small is not a subnormal in long double precision
and it is exact in double precision.
now this problem is fixed by checking inexact using fenv when the
result is subnormal

ffd8ac2d

13 11月, 2012 1 次提交
- S
  
  math: use '#pragma STDC FENV_ACCESS ON' when fenv is accessed · 033a9d6a
  由 Szabolcs Nagy 提交于 11月 13, 2012
  
  033a9d6a
21 6月, 2012 1 次提交

math: fix fma bug on x86 (found by Bruno Haible with gnulib) · e5fb6820

由 nsz 提交于 6月 20, 2012

The long double adjustment was wrong:
The usual check is
  mant_bits & 0x7ff == 0x400
before doing a mant_bits++ or mant_bits-- adjustment since
this is the only case when rounding an inexact ld80 into
double can go wrong. (only in nearest rounding mode)

After such a check the ++ and -- is ok (the mantissa will end
in 0x401 or 0x3ff).

fma is a bit different (we need to add 3 numbers with correct
rounding: hi_xy + lo_xy + z so we should survive two roundings
at different places without precision loss)

The adjustment in fma only checks for zero low bits
  mant_bits & 0x3ff == 0
this way the adjusted value is correct when rounded to
double or *less* precision.
(this is an important piece in the fma puzzle)

Unfortunately in this case the -- is not a correct adjustment
because mant_bits might underflow so further checks are needed
and this was the source of the bug.

e5fb6820

20 3月, 2012 1 次提交

use scalbn or *2.0 instead of ldexp, fix fmal · 2786c7d2

由 nsz 提交于 3月 19, 2012

Some code assumed ldexp(x, 1) is faster than 2.0*x,
but ldexp is a wrapper around scalbn which uses
multiplications inside, so this optimization is
wrong.

This commit also fixes fmal which accidentally
used ldexp instead of ldexpl loosing precision.

There are various additional changes from the
work-in-progress const cleanups.

2786c7d2

19 3月, 2012 2 次提交
- N
  
  remove unnecessary TODO comments from fma.c · 682e4714
  由 nsz 提交于 3月 19, 2012
  
  682e4714
- N
  add fma implementation for x86 · b1cbd707
  由 nsz 提交于 3月 19, 2012
```
correctly rounded double precision fma using extended
precision arithmetics for ld80 systems (x87)
```
  b1cbd707
17 3月, 2012 1 次提交

make fma and lrint functions build without full fenv support · 2e77dc13

由 Rich Felker 提交于 3月 16, 2012

this is necessary to support archs where fenv is incomplete or
unavailable (presently arm). fma, fmal, and the lrint family should
work perfectly fine with this change; fmaf is slightly broken with
respect to rounding as it depends on non-default rounding modes to do
its work.

2e77dc13

13 3月, 2012 1 次提交

first commit of the new libm! · b69f695a

由 Rich Felker 提交于 3月 13, 2012

thanks to the hard work of Szabolcs Nagy (nsz), identifying the best
(from correctness and license standpoint) implementations from freebsd
and openbsd and cleaning them up! musl should now fully support c99
float and long double math functions, and has near-complete complex
math support. tgmath should also work (fully on gcc-compatible
compilers, and mostly on any c99 compiler).

based largely on commit 0376d44a890fea261506f1fc63833e7a686dca19 from
nsz's libm git repo, with some additions (dummy versions of a few
missing long double complex functions, etc.) by me.

various cleanups still need to be made, including re-adding (if
they're correct) some asm functions that were dropped.

b69f695a

OpenHarmony / Third Party Musl 大约 1 年 前同步成功

OpenHarmony / Third Party Musl
大约 1 年前同步成功