• E
    hardfloat: implement float32/64 division · 4a629561
    Emilio G. Cota 提交于
    Performance results for fp-bench:
    
    1. Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
    - before:
    div-single: 34.84 MFlops
    div-double: 34.04 MFlops
    - after:
    div-single: 275.23 MFlops
    div-double: 216.38 MFlops
    
    2. ARM Aarch64 A57 @ 2.4GHz
    - before:
    div-single: 9.33 MFlops
    div-double: 9.30 MFlops
    - after:
    div-single: 51.55 MFlops
    div-double: 15.09 MFlops
    
    3. IBM POWER8E @ 2.1 GHz
    - before:
    div-single: 25.65 MFlops
    div-double: 24.91 MFlops
    - after:
    div-single: 96.83 MFlops
    div-double: 31.01 MFlops
    
    Here setting 2FP64_USE_FP to 1 pays off for x86_64:
    [1] 215.97 vs [0] 62.15 MFlops
    Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>
    Signed-off-by: NEmilio G. Cota <cota@braap.org>
    Signed-off-by: NAlex Bennée <alex.bennee@linaro.org>
    4a629561
softfloat.c 262.3 KB