1. 15 3月, 2018 3 次提交
    • D
      [Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53
      dzhwinter 提交于
      * "add softmax cudnn functor support"
      
      * "add testing"
      
      * "refine cmakelist"
      
      * "sequence softmax forward speed up"
      
      * "add softmax grad"
      
      * "fix sequence softmax test"
      
      * "add double precision'
      
      * "fix softmax test"
      
      * "add softmax cudnn support"
      
      * "fix softmax cudnn test"
      
      * "add softmax to nn.py"
      
      * "fix compile bug"
      
      * "refine cmakelist"
      
      * "fix ci"
      
      * "fix based on comment"
      
      * "fix based on comments"
      
      * "fix ci"
      128adf53
    • K
      Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017) · e26f1123
      Kexin Zhao 提交于
      * add fp16 mul op support
      
      * small fix
      
      * fix bug
      
      * small fix
      
      * fix PADDLE_WITH_CUDA compiling issue
      
      * reorg code
      
      * test for pybind
      
      * treate as float16 as uint16_t in pybind
      
      * bind np.float16 to paddle float16
      
      * small fix
      
      * clean code
      
      * remove redundancy
      
      * fix mul_op test
      
      * address comments
      
      * small fix
      
      * add is_float16_supported func
      e26f1123
    • D
      "exported scatter to python" (#9038) · 71400711
      dzhwinter 提交于
      * "exported scatter to python"
      
      * Revert ""exported scatter to python""
      
      This reverts commit 38745a62.
      
      * "polish scatter and export to python"
      71400711
  2. 14 3月, 2018 7 次提交
  3. 13 3月, 2018 4 次提交
  4. 12 3月, 2018 8 次提交
  5. 10 3月, 2018 1 次提交
  6. 09 3月, 2018 5 次提交
  7. 08 3月, 2018 3 次提交
  8. 07 3月, 2018 7 次提交
  9. 06 3月, 2018 2 次提交