1. 01 8月, 2019 1 次提交
  2. 22 4月, 2019 2 次提交
    • G
      resolve #16988 (#16995) · ae7a2cb8
      guomingz 提交于
      Update the filter generation mechanism that it could generate the negative parameter.
      The original calling(np.random.random()) couldn't simulate the conv/relu fusion case.
      
      test=develop
      ae7a2cb8
    • G
      resolve #16987 (#16994) · 23df084b
      guomingz 提交于
      Rename the testcuda function to has_cuda, it will elimate the unnecessary testing.
      test=develop
      23df084b
  3. 21 1月, 2019 1 次提交
    • D
      Memory optimization of depthwise conv op and group norm op (#15313) · 9f8f0fc2
      Dun 提交于
      * mem opt
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * refine code  test=develop
      
      * refine code  test=develop
      
      * refine code  test=develop
      
      * refine code  test=develop
      
      * refine with cub test=develop
      
      * fix mkldnn test && remove comments && test=develop
      
      * polish code && test=develop
      
      * add only_forward test && test=develop
      9f8f0fc2
  4. 04 1月, 2019 1 次提交
    • X
      Enable basic MKL-DNN INT8 Conv OP (#15124) · bbc93368
      xiaolil1 提交于
      * Enable basic MKL-DNN INT8 Conv OP
      test=develop
      
      * Modify test case
      test=develop
      
      * Clean unittest code
      test=develop
      
      * Fix test
      test=develop
      
      * Modify test
      test=develop
      
      * Modify basic INT8 Conv
      test=develop
      bbc93368
  5. 09 11月, 2018 1 次提交
    • Q
      Exhaustive search for cuDNN conv. (#14286) · abe20923
      qingqing01 提交于
      * exhaustive search for cuDNN conv.
      * Refine code and add unit testing.
      * Fix model load in fluid/inference and unit testing in conv2d
      * Follow comments.
      * Fix compiling test=develop
      abe20923
  6. 08 11月, 2018 1 次提交
  7. 07 11月, 2018 3 次提交
    • C
      Add fp16 backward support (#14202) · a9b5d42d
      chengduo 提交于
      * add fp16 backward support
      test=develop
      
      * add sum_op fp16 test
      
      * disable test_dist_save_load
      test=develop
      
      * add check_grad for sum
      
      * add unit test for softmax_grad fp16
      test=develop
      
      * add scale_op unit test
      
      * add mul_grad_op unit test for fp16
      
      * add cross_entropy_grad and eman_grad unit test for fp16
      test=develop
      
      * fix cross_entropy unit test
      
      * add pool2d fp16 unit test
      
      * refine conv2d fp16 unit test
      test=develop
      
      * refine activation unit test
      test=develop
      
      * fix ci
      test=develop
      
      * follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
      test=develop
      a9b5d42d
    • Q
      Revert " Exhaustive search for cuDNN conv. (#14043)" · db8c52da
      qingqing01 提交于
      This reverts commit ce7d9b07.
      db8c52da
    • Q
      Exhaustive search for cuDNN conv. (#14043) · ce7d9b07
      qingqing01 提交于
      * exhaustive search for cuDNN conv.
      * Refine code and add unit testing.
      * Clean code
      * Fix model load in fluid/inference and unit testing in conv2d
      * Follow comments.
      ce7d9b07
  8. 29 9月, 2018 4 次提交
  9. 15 8月, 2018 1 次提交
  10. 10 8月, 2018 2 次提交
  11. 09 8月, 2018 2 次提交
  12. 26 7月, 2018 2 次提交
  13. 17 4月, 2018 1 次提交
  14. 07 4月, 2018 1 次提交
  15. 17 3月, 2018 2 次提交
  16. 16 3月, 2018 3 次提交
  17. 07 3月, 2018 1 次提交
  18. 27 2月, 2018 1 次提交
  19. 24 2月, 2018 2 次提交
  20. 13 2月, 2018 1 次提交
    • X
      Run Python OP tests in a single Python process to improve test time. (#8362) · cde6241a
      Xin Pan 提交于
      Currently, our tests run with 2 GPUs, the init time is absurdly long:
      about 4s for each process.  Currently, we run each OP test on
      different processes. This PR:
      
      1. create cmake function py_test_modules which will generate the
      Makefile that runs a list of Python unittest module in a single Python
      process.
      
      2. move all "python unittest compatible" (e.g., used the unittest
      package, not just a regular python file). from fluid/tests to
      fluid/tests/unittests.
      
      3. cmake now will run all OP tests in fluid/tests/unittests in a
      single process, except the time-consuming tests, they are separated
      into different processes to utilize parallelism. Please make sure to
      use the unittest package if you put the python test file in
      fluid/tests/unittests
      
      4. remove all exit(0) from fluid/tests/unittests/*.py, exit(0) is used
      to disable unittest, we can not do it when running all tests in a
      single process since it will terminate the process without running the
      other tests. Instead, the test is disabled in
      fluid/tests/unittests/CMakeLists.txt. FIXME is added for each disabled
      item. Please disable the unittest from
      fluid/tests/unittests/CMakeLists.txt, instead of adding exit(0) to the
      Python file, for all Python file in fluid/tests/unittests/.
      
      5. add an option WITH_FAST_BUNDLE_TEST. When OFF, will run the unit
      tests in separate process so that they can be tested individually.
      cde6241a
  21. 12 2月, 2018 1 次提交
  22. 02 2月, 2018 1 次提交
  23. 21 1月, 2018 1 次提交
    • D
      "fix decode bug" (#7711) · e983cc90
      dzhwinter 提交于
      * "fix decode bug"
      
      * "follow commnet"
      
      * "fix error"
      
      * "fix hook bug"
      
      * fix based comment
      
      * fix copyright
      
      * fix based on comment
      e983cc90
  24. 15 1月, 2018 1 次提交
    • D
      Feature/hooks (#7513) · b9b75377
      dzhwinter 提交于
      * add copyright hook
      
      * add copyright hook
      
      * refine copyright hook
      
      * "test copyright hook"
      
      * fix check style
      
      * fix ci
      b9b75377
  25. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  26. 05 1月, 2018 1 次提交
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d
  27. 27 11月, 2017 1 次提交
    • Conv cudnn 3d (#5783) · a06bec12
      武毅 提交于
      * conv cudnn 3d
      
      * update test case
      
      * update
      
      * update
      
      * follow comments and remove groups from helper
      
      * update
      
      * refine
      
      * update
      
      * follow comments2
      
      * update
      
      * fix compile
      a06bec12