1. 24 11月, 2019 1 次提交
  2. 18 9月, 2019 1 次提交
    • L
      Update elementwise double grad to save gpu memory (#19509) · 982e61f5
      Leo Chen 提交于
      * update elementwise double grad to save gpu memory, test=develop
      
      * update elementwise_mul/div_grad_grad to save memory, test=develop
      
      * remove eval function in eigen statement to save memory, test=develop
      
      * add unittest for elementwise_div_grad_grad without dout, test=develop
      
      * add unittest for elementwise_add_grad_grad without ddx, test=develop
      
      * add float16 cuda kernel for elementwise double grad op, test=develop
      982e61f5
  3. 17 9月, 2019 2 次提交
  4. 11 9月, 2019 1 次提交
  5. 12 8月, 2019 1 次提交
    • H
      Add hard swish op (new op) (#19001) · 20f18930
      huangjun12 提交于
      * add hard_swish activation op (new op)
      test=develop
      
      * remove redundancy files
      
      * modify document content of HardSwish OP
      
      * add API test in test_layers.py
      
      * add dynamic_graph for test_hard_swish
      20f18930
  6. 09 8月, 2019 1 次提交
  7. 31 7月, 2019 1 次提交
  8. 22 7月, 2019 1 次提交
  9. 19 7月, 2019 1 次提交
  10. 16 6月, 2019 1 次提交
  11. 15 5月, 2019 1 次提交
    • L
      Double backward sqrt (#17387) · 4ef63101
      lvmengsi 提交于
      * double backward sqrt
      
      * refine unittest. test=develop
      
      * refine test. test=develop
      
      * remove alpha in unittest. test=develop
      4ef63101
  12. 13 5月, 2019 1 次提交
    • K
      add double grad for square op (#17173) · 11d3a38f
      Kaipeng Deng 提交于
      * add double grad for square. test=develop
      
      * formax code. test=develop
      
      * fix for grad sum. test=develop
      
      * refine shape. test=develop
      
      * refine extract. test=develop
      11d3a38f
  13. 28 4月, 2019 1 次提交
    • Z
      Refine dropout gpu memory (#17095) · 28d69d71
      Zeng Jinle 提交于
      * refine_dropout_mem,test=develop
      
      * # This is a combination of 14 commits.
      # The first commit's message is:
      remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)
      
      # This is the 2nd commit message:
      
      Fleet unify distributed training (#16791)
      
      * implement distributed transpiler with fleet
      # This is the 3rd commit message:
      
      ParallelDyGraph with GPU collective mode (#16827)
      
      implement dygraph.parallel.DataParallel to hook reduce op.
      
      # This is the 4th commit message:
      
      Init mixed precision training interface (#16856)
      
      * Init mixed precision training interface
      
      * Add fp16 test script
      
      test=develop
      
      * All initializers support float16
      
      test=develop
      
      * Code cleanup & add more code annotations
      
      test=develop
      
      * Update API spec
      
      test=develop
      
      * Add usage example in doc
      
      test=develop
      
      # This is the 5th commit message:
      
      fix reference_count_pass,test=develop (#17060)
      
      test=develop
      # This is the 6th commit message:
      
      Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)
      
      * Cache the information of linear interpolation in forward and use it in backward.
      test=develop
      
      * Fix cuda kernel.
      test=develop
      
      # This is the 7th commit message:
      
      remove unnecessary prepare_data (#17080)
      
      test=develop
      # This is the 8th commit message:
      
      fix interpolate cu. test=develop (#17101)
      
      # This is the 9th commit message:
      
      test=develop, double backward leaky_relu (#17067)
      
      backward of backward: leaky_relu
      # This is the 10th commit message:
      
      fix fuse optimizer ops (#17102)
      
      test=develop
      # This is the 11th commit message:
      
      truncated_gaussian_random supported in distributed training, test=develop (#17091)
      
      # This is the 12th commit message:
      
       Detailed coordinate description for yolov3 loss (#17007)
      
      * Detailed coordinate description for yolov3 loss
      
      test=develop
      
      * modified api.spec
      
      test=develop
      
      * modified loss name
      
      * fix api.spec
      
      test=develop
      
      * polish description
      
      test=develop
      
      * modified api.spec
      
      test=develop
      
      # This is the 13th commit message:
      
      fix test_weight_decay (#17109)
      
      test=develop
      # This is the 14th commit message:
      
      Path flag (#17105)
      
      * fix python/paddle/fluid/__init__.py detecting problems
      28d69d71
  14. 26 4月, 2019 1 次提交
  15. 23 4月, 2019 1 次提交
    • Q
      Support backward of backward for Relu and add a new gradient checker by... · c1c2633a
      qingqing01 提交于
      Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)
      
      * Support backward of backward and a new gradient checker
      * Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package.
      
      1. Add ReluDoubleGradMaker when register relu_grad.
      2. Add a new gradient checker by comparing theoretical and numerical Jacobian.  Check double gradients by double_grad_check.
      c1c2633a
  16. 12 4月, 2019 1 次提交
  17. 10 4月, 2019 1 次提交
  18. 29 3月, 2019 1 次提交
  19. 28 3月, 2019 1 次提交
  20. 10 3月, 2019 1 次提交
  21. 04 3月, 2019 2 次提交
  22. 27 2月, 2019 1 次提交
    • D
      polish cudnn related code and fix bug. (#15164) · 225c11a9
      dzhwinter 提交于
      * staged.
      
      * polish code
      
      * polish code. test=develop
      
      * polish code. test=develop
      
      * api change. test=develop
      
      * fix default value. test=develop
      
      * fix default value. test=develop
      225c11a9
  23. 26 2月, 2019 1 次提交
  24. 22 2月, 2019 2 次提交
    • T
      Revert 15770 develop a6910f90 gelu mkl opt (#15872) · ee2321de
      tensor-tang 提交于
      * Revert "Optimze Gelu with MKL Erf function (#15770)"
      
      This reverts commit 676995c8.
      
      * test=develop
      ee2321de
    • Y
      Optimze Gelu with MKL Erf function (#15770) · 676995c8
      Yihua Xu 提交于
      * Optimize for gelu operator
      
      * Set up the low accuracy mode of MKL ERF function.
      
      test=develop
      
      * Only enable MKLML ERF when OS is linux
      
      * Use the speical mklml version included vmsErf function to verify gelu mkl kernel.
      
      test=develop
      
      * Add the CUDA macro to avoid NVCC's compile issue.
      
      test=develop
      
      * Add the TODO comments for mklml library modification.
      
      test=develop
      
      * Clean Code
      
      test=develop
      
      * Add the comment of marco for NVCC compiler.
      
      test=develop
      676995c8
  25. 12 12月, 2018 1 次提交
  26. 05 12月, 2018 1 次提交
    • C
      Fix clip.py (#14718) · 04539d4c
      chengduo 提交于
      * expose square
      test=develop
      
      * fix activation
      test=develop
      
      * Add square API
      test=develop
      
      * add necessary op
      
      * code refine
      
      * fix API.spec
      test=develop
      
      * fix unit test
      test=develop
      
      * add unit test sparse_grad_clip
      test=develop
      
      * fix API.spec
      test=develop
      
      * remove mac test for test_gradient_clip
      test=develop
      
      * remove selectedrows_mul_tensor
      test=develop
      04539d4c
  27. 27 11月, 2018 1 次提交
  28. 26 11月, 2018 1 次提交
  29. 08 11月, 2018 1 次提交
  30. 07 11月, 2018 1 次提交
    • C
      Add fp16 backward support (#14202) · a9b5d42d
      chengduo 提交于
      * add fp16 backward support
      test=develop
      
      * add sum_op fp16 test
      
      * disable test_dist_save_load
      test=develop
      
      * add check_grad for sum
      
      * add unit test for softmax_grad fp16
      test=develop
      
      * add scale_op unit test
      
      * add mul_grad_op unit test for fp16
      
      * add cross_entropy_grad and eman_grad unit test for fp16
      test=develop
      
      * fix cross_entropy unit test
      
      * add pool2d fp16 unit test
      
      * refine conv2d fp16 unit test
      test=develop
      
      * refine activation unit test
      test=develop
      
      * fix ci
      test=develop
      
      * follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
      test=develop
      a9b5d42d
  31. 03 9月, 2018 1 次提交
  32. 25 8月, 2018 1 次提交
  33. 17 8月, 2018 1 次提交
  34. 16 8月, 2018 1 次提交
  35. 25 6月, 2018 1 次提交
  36. 12 6月, 2018 1 次提交
  37. 16 4月, 2018 1 次提交