1. 22 9月, 2013 1 次提交
    • R
      fix arm atomic store and generate simpler/less-bloated/faster code · 35a6801c
      Rich Felker 提交于
      atomic store was lacking a barrier, which was fine for legacy arm with
      no real smp and kernel-emulated cas, but unsuitable for more modern
      systems. the kernel provides another "kuser" function, at 0xffff0fa0,
      which could be used for the barrier, but using that would drop support
      for kernels 2.6.12 through 2.6.14 unless an extra conditional were
      added to check for barrier availability. just using the barrier in the
      kernel cas is easier, and, based on my reading of the assembly code in
      the kernel, does not appear to be significantly slower.
      
      at the same time, other atomic operations are adapted to call the
      kernel cas function directly rather than using a_cas; due to small
      differences in their interface contracts, this makes the generated
      code much simpler.
      35a6801c
  2. 20 9月, 2013 2 次提交
    • R
      fix potential deadlock bug in libc-internal locking logic · e803829e
      Rich Felker 提交于
      if a multithreaded program became non-multithreaded (i.e. all other
      threads exited) while one thread held an internal lock, the remaining
      thread would fail to release the lock. the the program then became
      multithreaded again at a later time, any further attempts to obtain
      the lock would deadlock permanently.
      
      the underlying cause is that the value of libc.threads_minus_1 at
      unlock time might not match the value at lock time. one solution would
      be returning a flag to the caller indicating whether the lock was
      taken and needs to be unlocked, but there is a simpler solution: using
      the lock itself as such a flag.
      
      note that this flag is not needed anyway for correctness; if the lock
      is not held, the unlock code is harmless. however, the memory
      synchronization properties associated with a_store are costly on some
      archs, so it's best to avoid executing the unlock code when it is
      unnecessary.
      e803829e
    • R
      correct the sysconf value for RTSIG_MAX · d8e283df
      Rich Felker 提交于
      this is the number of realtime signals available, not the maximum
      signal number or total number of signals.
      d8e283df
  3. 17 9月, 2013 1 次提交
  4. 16 9月, 2013 5 次提交
    • R
      fix clobbering of caller's stack in mips __clone function · cffb9e1e
      Rich Felker 提交于
      this was resulting in crashes in posix_spawn on mips, and would have
      affected applications calling clone too. since the prototype for
      __clone has it as a variadic function, it may not assume that 16($sp)
      is writable for use in making the syscall. instead, it needs to
      allocate additional stack space, and then adjust the stack pointer
      back in both of the code paths for the parent process/thread.
      cffb9e1e
    • S
      sys/resource.h: add PRIO_MIN and PRIO_MAX for getpriority and setpriority · 90710df5
      Szabolcs Nagy 提交于
      These constants are not specified by POSIX, but they are in the reserved
      namespace, glibc and bsd systems seem to provide them as well.
      (Note that POSIX specifies -NZERO and NZERO-1 to be the limits, but
      PRIO_MAX equals NZERO)
      90710df5
    • S
      update include/elf.h following glibc changes · 268375c1
      Szabolcs Nagy 提交于
      the changes were verified using various sources:
      linux: include/uapi/linux/elf.h
      binutils: include/elf/common.h
      glibc: elf/elf.h
      sysv gabi: http://www.sco.com/developers/gabi/latest/contents.html
      sun linker docs: http://docs.oracle.com/cd/E18752_01/pdf/817-1984.pdf
      and platform specific docs
      
      - fixed:
      EF_MIPS_* E_MIPS_* e_flags: fixed accoding to glibc and binutils
      
      - added:
      ELFOSABI_GNU for EI_OSABI entry: glibc, binutils and sysv gabi
      EM_* e_machine values: updated according to linux and glibc
      PN_XNUM e_phnum value: from glibc and linux, see oracle docs
      NT_* note types: updated according to linux and glibc
      DF_1_* flags for DT_FLAGS_1 entry: following glibc and oracle docs
      AT_HWCAP2 auxv entry for more hwcap bits accoding to linux and glibc
      R_386_SIZE32 relocation according to glibc and binutils
      EF_ARM_ABI_FLOAT_* e_flags: added following glibc and binutils
      R_AARCH64_* relocs: added following glibc and aarch64 elf specs
      R_ARM_* relocs: according to glibc, binutils and arm elf specs
      R_X86_64_* relocs: added missing relocs following glibc
      
      - removed:
      HWCAP_SPARC_* flags were moved to arch specific header in glibc
      R_ARM_SWI24 reloc is marked as obsolete in glibc, not present in binutils
        not specified in arm elf spec, R_ARM_TLS_DESC reused its number
        see http://www.codesourcery.com/publications/RFC-TLSDESC-ARM.txt
      
      - glibc changes not pulled in:
      ELFOSABI_ARM_AEABI (bare-metal system, binutils and glibc disagrees about the name)
      R_68K_* relocs for unsupported platform
      R_SPARC_* ditto
      EF_SH* ditto (e_flags)
      EF_S390* ditto (e_flags)
      R_390* ditto
      R_MN10300* ditto
      R_TILE* ditto
      268375c1
    • R
      omit CLONE_PARENT flag to clone in pthread_create · 271c2119
      Rich Felker 提交于
      CLONE_PARENT is not necessary (CLONE_THREAD provides all the useful
      parts of it) and Linux treats CLONE_PARENT as an error in certain
      situations, without noticing that it would be a no-op due to
      CLONE_THREAD. this error case prevents, for example, use of a
      multi-threaded init process and certain usages with containers.
      271c2119
    • R
      use symbolic names for clone flags in pthread_create · f68a3468
      Rich Felker 提交于
      f68a3468
  5. 15 9月, 2013 8 次提交
  6. 13 9月, 2013 1 次提交
    • R
      fix x86_64 lrintl asm, again · 2f1de805
      Rich Felker 提交于
      the underlying problem was not incorrect sign extension (fixed in the
      previous commit to this file by nsz) but that code that treats "long"
      as 32-bit was copied blindly from i386 to x86_64.
      
      now lrintl is identical to llrintl on x86_64, as it should be.
      2f1de805
  7. 10 9月, 2013 1 次提交
    • R
      do not use default when dynamic linker fails to open existing path file · ff4be700
      Rich Felker 提交于
      if fopen fails for a reason other than ENOENT, we must assume the
      intent is that the path file be used. failure may be due to
      misconfiguration or intentional resource-exhaustion attack (against
      suid programs), in which case falling back to loading libraries from
      an unintended path could be dangerous.
      ff4be700
  8. 07 9月, 2013 2 次提交
    • S
      067aea7c
    • S
      math: remove STRICT_ASSIGN macro · 9b0fcb44
      Szabolcs Nagy 提交于
      gcc did not always drop excess precision according to c99 at assignments
      before version 4.5 even if -std=c99 was requested which caused badly
      broken mathematical functions on i386 when FLT_EVAL_METHOD!=0
      
      but STRICT_ASSIGN was not used consistently and it is worked around for
      old compilers with -ffloat-store so it is no longer needed
      
      the new convention is to get the compiler respect c99 semantics and when
      excess precision is not harmful use float_t or double_t or to specialize
      code using FLT_EVAL_METHOD
      9b0fcb44
  9. 06 9月, 2013 2 次提交
    • S
      math: support invalid ld80 representations in fpclassify · f657fe4b
      Szabolcs Nagy 提交于
      apparently gnulib requires invalid long double representations
      to be handled correctly in printf so we classify them according
      to how the fpu treats them: bad inf is nan, bad nan is nan,
      bad normal is nan and bad subnormal/zero is minimal normal
      f657fe4b
    • S
      math: fix atanh (overflow and underflow issues) · f4d9bfb3
      Szabolcs Nagy 提交于
      in atanh exception handling was left to the called log functions,
      but the argument to those functions could underflow or overflow.
      
      use double_t and float_t to avoid some useless stores on x86
      f4d9bfb3
  10. 05 9月, 2013 17 次提交
    • S
      math: remove libc.h include from libm.h · afa2aacc
      Szabolcs Nagy 提交于
      libc.h is only for weak_alias so include it directly where it is used
      afa2aacc
    • S
      math: fix acoshf on negative values · 101e6012
      Szabolcs Nagy 提交于
      acosh(x) is invalid for x<1, acoshf tried to be clever using
      signed comparisions to handle all x<2 the same way, but the
      formula was wrong on large negative values.
      101e6012
    • S
      math: fix expm1l on x86_64 (avoid underflow for large negative x) · 02343946
      Szabolcs Nagy 提交于
      copy the fix from i386: return -1 instead of exp2l(x)-1 when x <= -65
      02343946
    • S
    • S
      math: fix exp2l asm on x86 (raise underflow correctly) · 07039ed8
      Szabolcs Nagy 提交于
      there were two problems:
      * omitted underflow on subnormal results: exp2l(-16383.5) was calculated
      as sqrt(2)*2^-16384, the last bits of sqrt(2) are zero so the down scaling
      does not underflow eventhough the result is in subnormal range
      * spurious underflow for subnormal inputs: exp2l(0x1p-16400) was evaluated
      as f2xm1(x)+1 and f2xm1 raised underflow (because inexact subnormal result)
      
      the first issue is fixed by raising underflow manually if x is in
      (-32768,-16382] and not integer (x-0x1p63+0x1p63 != x)
      
      the second issue is fixed by treating x in (-0x1p64,0x1p64) specially
      
      for these fixes the special case handling was completely rewritten
      07039ed8
    • S
    • S
      math: remove *_WORD64 macros from libm.h · 63b9cc77
      Szabolcs Nagy 提交于
      only fma used these macros and the explicit union is clearer
      63b9cc77
    • S
      math: remove old longdbl.h · 94a3d13a
      Szabolcs Nagy 提交于
      94a3d13a
    • S
      math: long double fix (use ldshape union) · aa0c4a20
      Szabolcs Nagy 提交于
      * use new ldshape union consistently
      * add ld128 support to frexpl
      * simplify sqrtl comment (ld64 is not just arm)
      aa0c4a20
    • S
      math: use float_t and double_t in scalbnf and scalbn · 2eaed464
      Szabolcs Nagy 提交于
      remove STRICT_ASSIGN (c99 semantics is assumed) and use the conventional
      union to prepare the scaling factor (so libm.h is no longer needed)
      2eaed464
    • S
      math: fix remaining old long double code (erfl, fmal, lgammal, scalbnl) · 34660d73
      Szabolcs Nagy 提交于
      in lgammal don't handle 1 and 2 specially, in fma use the new ldshape
      union instead of ld80 one.
      34660d73
    • S
      math: cbrt cleanup and long double fix · 535104ab
      Szabolcs Nagy 提交于
      * use float_t and double_t
      * cleanup subnormal handling
      * bithacks according to the new convention (ldshape for long double
      and explicit unions for float and double)
      535104ab
    • S
      math: fix underflow in exp*.c and long double handling in exp2l · 39c910fb
      Szabolcs Nagy 提交于
      * don't care about inexact flag
      * use double_t and float_t (faster, smaller, more precise on x86)
      * exp: underflow when result is zero or subnormal and not -inf
      * exp2: underflow when result is zero or subnormal and not exact
      * expm1: underflow when result is zero or subnormal
      * expl: don't underflow on -inf
      * exp2: fix incorrect comment
      * expm1: simplify special case handling and overflow properly
      * expm1: cleanup final scaling and fix negative left shift ub (twopk)
      39c910fb
    • S
      math: long double trigonometric cleanup (cosl, sinl, sincosl, tanl) · ea9bb95a
      Szabolcs Nagy 提交于
      ld128 support was added to internal kernel functions (__cosl, __sinl,
      __tanl, __rem_pio2l) from freebsd (not tested, but should be a good
      start for when ld128 arch arrives)
      
      __rem_pio2l had some code cleanup, the freebsd ld128 code seems to
      gather the results of a large reduction with precision loss (fixed
      the bug but a todo comment was added for later investigation)
      
      the old copyright was removed from the non-kernel wrapper functions
      (cosl, sinl, sincosl, tanl) since these are trivial and the interesting
      parts and comments had been already rewritten.
      ea9bb95a
    • S
      math: long double inverse trigonometric cleanup (acosl, asinl, atanl, atan2l) · bcd797a5
      Szabolcs Nagy 提交于
      * added ld128 support from freebsd fdlibm (untested)
      * using new ldshape union instead of IEEEl2bits
      * inexact status flag is not supported
      bcd797a5
    • S
      math: rewrite hypot · c2a0dfea
      Szabolcs Nagy 提交于
      method: if there is a large difference between the scale of x and y
      then the larger magnitude dominates, otherwise reduce x,y so the
      argument of sqrt (x*x+y*y) does not overflow or underflow and calculate
      the argument precisely using exact multiplication. If the argument
      has less error than 1/sqrt(2) ~ 0.7 ulp, then the result has less error
      than 1 ulp in nearest rounding mode.
      
      the original fdlibm method was the same, except it used bit hacks
      instead of dekker-veltkamp algorithm, which is problematic for long
      double where different representations are supported. (the new hypot
      and hypotl code should be smaller and faster on 32bit cpu archs with
      fast fpu), the new code behaves differently in non-nearest rounding,
      but the error should be still less than 2ulps.
      
      ld80 and ld128 are supported
      c2a0dfea
    • S
      math: rewrite remainder functions (remainder, remquo, fmod, modf) · ee2ee92d
      Szabolcs Nagy 提交于
      * results are exact
      * modfl follows truncl (raises inexact flag spuriously now)
      * modf and modff only had cosmetic cleanup
      * remainder is just a wrapper around remquo now
      * using iterative shift+subtract for remquo and fmod
      * ld80 and ld128 are supported as well
      ee2ee92d