1. 30 8月, 2019 1 次提交
    • H
      [Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3
      Huihuang Zheng 提交于
      * Support memory eager deletion on recurrent OP (#17710)
      
      Test PaddingRNN on V100 GPU device.
      
      Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                         
      GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
      Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)
      
      * Fix random test_recurrent_op failure (#18718)
      
      The change includes 3 things:
      
      1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
      
      2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
      
      3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
      cb74dac3
  2. 08 7月, 2019 1 次提交
  3. 10 5月, 2019 1 次提交
    • Q
      Double backward of conv2d. (#17211) · e32c9888
      qingqing01 提交于
      * Add conv2d_grad_grad_op
      * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
          - Now use it in conv2d_grad_grad.
          - Will simply the searching code in conv2d and conv2d_grad in next PR.
      * Enhance and fix bug in unit testing of gradient_checker.
      * Support to fetch empty variables,return None in Python.
      e32c9888
  4. 17 4月, 2019 1 次提交
  5. 16 4月, 2019 3 次提交
  6. 13 4月, 2019 1 次提交
  7. 04 4月, 2019 1 次提交
  8. 28 3月, 2019 1 次提交
  9. 27 3月, 2019 2 次提交
  10. 19 3月, 2019 1 次提交
  11. 18 3月, 2019 2 次提交
  12. 15 3月, 2019 2 次提交
  13. 06 3月, 2019 2 次提交
  14. 05 3月, 2019 1 次提交
  15. 04 3月, 2019 2 次提交
  16. 28 2月, 2019 1 次提交
  17. 27 2月, 2019 1 次提交
  18. 19 2月, 2019 2 次提交
    • T
      fix warnings (#15790) · e1c707fe
      tensor-tang 提交于
      * fix warnings
      
      test=develop
      
      * fix enforce test
      
      test=develop
      e1c707fe
    • Y
      Correct the doc in Python API (#15725) · 56a5039e
      Yiqun Liu 提交于
      * Correct the comment in control_flow.py.
      
      * Correct the argument list of ops.
      test=develop
      
      * Update API.spec.
      test=develop
      
      * Skip op_callstack attr for all op apis.
      test=develop
      
      * Remove use_mkldnn and is_test from python api.
      test=develop
      
      * Remove use_mkldnn and is_test from op_proto_maker and hard-coding them in python when generating doc string.
      test=develop
      56a5039e
  19. 21 1月, 2019 1 次提交
  20. 17 1月, 2019 1 次提交
  21. 16 1月, 2019 1 次提交
    • Y
      Optimize while_op for test (#14764) · 568cc2ff
      Yiqun Liu 提交于
      * Simplify the compare op for CPU.
      
      * Use asynchronous tensor copy in reshape_op's kernel.
      
      * Optimize while_op for test, avoiding creating variables every time.
      test=develop
      
      * Enable the cache of kernel type and kernel function.
      test=develop
      
      * Enable profiling with gperftools.
      
      * Remove flags for testing, and fix the linking error.
      test=develop
      
      * Delete the codes of ChooseKernel.
      test=develop
      
      * Fix bug when preparing ExecutorPrepareContext for while_op.
      
      * Fix missing depending on grpc libraries.
      
      * Remove the redundant print.
      test=develop
      
      * Follow comments.
      
      * Remove the codes related to prepare the ExecutorPrepareContext for while_op.
      test=develop
      568cc2ff
  22. 19 12月, 2018 3 次提交
  23. 18 12月, 2018 1 次提交
  24. 14 12月, 2018 1 次提交
  25. 13 12月, 2018 1 次提交
  26. 12 12月, 2018 2 次提交
  27. 07 12月, 2018 1 次提交
  28. 04 12月, 2018 2 次提交