1. 13 5月, 2019 2 次提交
    • Y
      Optimize the elementwise op using eigen (#15494) · dcda2023
      Yiqun Liu 提交于
      * Optimize the elementwise op with CUDA kernels.
      test=develop
      
      * Support setting of attr in op config file.
      test=develop
      
      * Add the support the setting dtype and initializer in config.
      test=develop
      
      * Save workspace.
      
      * Add initializer "zeros".
      test=develop
      
      * Fix compiling error.
      
      * Support the use of existed file to initailize tensor in op_tester.
      
      * Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims.
      test=develop
      dcda2023
    • K
      add double grad for elementwise_mul op (#17255) · 8bae8590
      Kaipeng Deng 提交于
      * add double grad for elementwise_mul. test=develop
      
      * remove comment. test=develop
      
      * fix grad sum. test=develop
      
      * fix for axis expand. test=develop
      
      * add test for axis expand. test=develop
      8bae8590
  2. 09 5月, 2019 1 次提交
    • Z
      Mod floordiv (#17251) · 4292bd86
      zhoukunsheng 提交于
      * test=develop
      add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod
      4292bd86
  3. 08 5月, 2019 2 次提交
  4. 06 5月, 2019 1 次提交
  5. 16 4月, 2019 1 次提交
  6. 12 4月, 2019 1 次提交
  7. 03 4月, 2019 1 次提交
  8. 28 3月, 2019 1 次提交
    • J
      [MKL-DNN] Tensor modifications revert (#16462) · 26323274
      Jacek Czaja 提交于
      * Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)"
      
      This reverts commit 13816dd4.
      Apart from enabling transformer for MKL-DNN
      
      * Revert "- MKL-DNN pooling updated to set_prim_desc"
      
      This reverts commit c63f6b20.
      
      Conflicts:
      	paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc
      
      * Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429)"
      
      test=develop
      
      This reverts commit dec9cf53.
      
      * - concat compilation fix
      
      - lint
      
      test=develop
      
      - Lint fixes
      
      test=develop
      
      - Lint fixes
      
      test=develop
      
      - Fix Transpose MKLDNN op
      
      test=develop
      26323274
  9. 27 3月, 2019 1 次提交
    • L
      Memory optimize (#16410) · 8d22bc17
      liuwei1031 提交于
      * fix cdn issue, test=develop
      
      * fix memory optimize bugs, test=develop
      
      * fix memory optimize bugs, test=develop
      
      * remove add/sub_2 op, test=develop
      
      * disable memory_optimize by default, test=develop
      
      * disable inplace activation in python, test=develop
      
      * fix unittests, test=develop
      
      * fix unittests, test=develop
      
      * bug-fix, test=develop
      8d22bc17
  10. 24 3月, 2019 1 次提交
  11. 21 3月, 2019 3 次提交
  12. 08 3月, 2019 1 次提交
  13. 07 3月, 2019 1 次提交
  14. 26 2月, 2019 1 次提交
    • J
      - MKL-DNN pooling updated to set_prim_desc · c63f6b20
      Jacek Czaja 提交于
      - MKLDNN ops revisited
      
      - disabled softmax modifications
      
      - disabled elementwise_add
      
      - reverted LRN modifications
      
      - reverted SUM primitive
      
      - Partial reviing of softmax
      
      - Enable softmax
      
      - Softmax changes
      
      - LRN is back
      
      - LRN partially disabled
      
      - LRN is back
      
      - LRN fix
      
      - compilation fixes
      
      - Sum fixed(hopefully)
      
      - Enabling (partially) elementwise_add
      
      - Fixes to elemenwise_add
      
      - Lint fixes
      
      quantize fix
      
      - compilation fix
      
      test=develop
      
      Disabling pooling
      
      - Disabled quantize op
      
      test=develop
      c63f6b20
  15. 09 2月, 2019 1 次提交
  16. 06 2月, 2019 1 次提交
  17. 29 1月, 2019 2 次提交
  18. 24 1月, 2019 1 次提交
  19. 21 1月, 2019 1 次提交
  20. 10 1月, 2019 1 次提交
    • W
      [Feature] support mix precision training for resnet (#14899) · fd854183
      Wu Yi 提交于
      * clip softmax for fp16
      
      * updates
      
      * fuse xent support fp16 test=develop
      
      * wip
      
      * wip
      
      * add simple row reduce
      
      * wip fp16 accurate softmax
      
      * add accurate softmax kernel for fp16 test=develop
      
      * update test=develop
      
      * fix cpu build test=develop
      
      * update api.spec test=develop
      
      * follow comments test=develop
      
      * fix build test=develop
      
      * fix trt build test=develop
      
      * fix inference build test=develop
      
      * fix merge test=develop
      
      * update test=develop
      
      * try fix build test=develop
      
      * fix build test=develop
      
      * rename real_exp test=develop
      
      * fortest
      
      * remove hacky kernels test=develop
      
      * clean up test=develop
      fd854183
  21. 26 12月, 2018 1 次提交
    • W
      Fp16 training (#14992) · 856f0da0
      Wu Yi 提交于
      * wip
      
      * wip
      
      * wip
      
      * wip for test
      
      * add fp16 tests test=develop
      
      * fix cpu build test=develop
      
      * fix test=develop
      
      * fix py3 tests test=develop
      
      * fix lr_scheduler dtype test=develop
      
      * fix test=dvelop
      
      * test fix ci compile test=develop
      
      * fix build and merge test=develop
      
      * fallback momentumop change to general test=develop
      
      * make fp16 lr schedule simple test=develop
      
      * fix ut test=develop
      
      * fix tests test=develop
      
      * remove fp16 learning rate cast test=develop
      856f0da0
  22. 24 12月, 2018 1 次提交
  23. 21 12月, 2018 2 次提交
  24. 20 12月, 2018 3 次提交
  25. 19 12月, 2018 5 次提交
  26. 18 12月, 2018 3 次提交