• B
    Improved the performance of full reductions on GPU. · 03dba169
    Benoit Steiner 提交于
    NEW
    BM_fullReduction/10        4591       4595     153149  20.8M items/s
    BM_fullReduction/64        5073       5075     100000  770.0M items/s
    BM_fullReduction/512       9067       9070      75263  26.9G items/s
    BM_fullReduction/4k      243984     244125       2868  64.0G items/s
    BM_fullReduction/5k      359125     359273       1951  64.8G items/s
    
    OLD
    BM_fullReduction/10        9085       9087      74395  10.5M items/s
    BM_fullReduction/64        9478       9478      72014  412.1M items/s
    BM_fullReduction/512      14643      14646      46902  16.7G items/s
    BM_fullReduction/4k      260338     260384       2678  60.0G items/s
    BM_fullReduction/5k      385076     385178       1818  60.5G items/s
    Change: 124290852
    03dba169
Eigenvalues 54 字节