1. 11 11月, 2019 1 次提交
    • Y
      Add the check of lod_level between compile-time and runtime. (#20961) · 35f17ae2
      Yiqun Liu 提交于
      * Add the check of lod_level between compile-time and runtime.
      test=develop
      
      * Fix bug in check_compile_vs_runtime.
      test=develop
      
      * Fix the check of output when it is dispensiable or intermediate.
      test=develop
      
      * Share lod of x to out in match_matrix_tensor op in compile-time.
      
      * Implement GetLoDLevel in InferShapeContext.
      
      * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op.
      test=develop
      
      * Enable check_compile_vs_runtime in test_match_matrix_tensor.
      
      * Add the implementation of SetLoDLevel in InferShapeContext.
      
      * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead.
      
      * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead.
      
      * Refine some ops and unittests.
      test=develop
      
      * Fix a typo.
      test=develop
      
      * Remove the check of var type, and change int to int32_t.
      test=develop
      
      * Add unittest for Get/SetLoDLevel.
      test=develop
      35f17ae2
  2. 01 11月, 2019 1 次提交
    • L
      Update Tensor.set() to support float16 (#19964) · 9974e407
      Leo Chen 提交于
      * don't expose numerous Tensor.set(), test=develop
      
      * fix condition, test=develop
      
      * fix float16 bug, test=develop
      
      * feed should be Tensor or np.array, not Variable or number, test=develop
      
      * use forcecast to copy numpy slice to new array, test=develop
      
      * remove float16-uint16 hacking, test=develop
      9974e407
  3. 24 9月, 2019 1 次提交
    • L
      Make OpTest check grad inplace even if forward has no inplace (#19847) · 57606205
      Leo Chen 提交于
      * make OpTest check grad inplace even if forward has no inplace, test=develop
      
      * do not run PE when enable_inplace is False, test=develop
      
      * add conv3d cuda kernel for float16 type, test=develop
      
      * refactor OpTest for inplace, test=develop
      
      * add comments, test=develop
      57606205
  4. 17 9月, 2019 1 次提交
  5. 03 9月, 2019 1 次提交
  6. 26 8月, 2019 1 次提交
  7. 22 8月, 2019 2 次提交
    • L
    • L
      Enhance OpTest to check the consistency of operators when using and not using inplace (#19101) · a9d5fc51
      Leo Chen 提交于
      * add pybind interface to get all inplace ops, test=develop
      
      * enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop
      
      * handle corner cases in op_test, test=develop
      
      * support outputs without tensor holder_, like XShape in reshape_op, test=develop
      
      * fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop
      
      * use reshape_grad instead of reshape in FlattenGradOp, test=develop
      
      * fix error debug dims info for variables like XShape, test=develop
      
      * change computational order in sum_op to relieve computation difference using inplace, test=develop
      
      * add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop
      
      * follow sneaxiy's comments, test=develop
      
      * remove unused DefaultGradOpDescMaker in mkldnn op, test=develop
      a9d5fc51
  8. 08 8月, 2019 1 次提交
    • L
      Fix memory overwriting of tensors returned by executor (#19030) · 8f537354
      Leo Chen 提交于
      * fix memory overlapping of fetch var (return of executor.run), test=develop
      
      * fix wrong usage of ParallelExecutor in op_test, test=develop
      
      * remove useless parameter and simplify code
      
      * avoid tensor destruct untimely, test=develop
      
      * add testcase independent of OpTest, test=develop
      8f537354
  9. 28 3月, 2019 1 次提交
  10. 11 3月, 2019 2 次提交
  11. 19 2月, 2019 1 次提交
  12. 31 1月, 2019 1 次提交
  13. 30 1月, 2019 4 次提交
  14. 29 1月, 2019 2 次提交
  15. 26 12月, 2018 1 次提交
    • W
      Fp16 training (#14992) · 856f0da0
      Wu Yi 提交于
      * wip
      
      * wip
      
      * wip
      
      * wip for test
      
      * add fp16 tests test=develop
      
      * fix cpu build test=develop
      
      * fix test=develop
      
      * fix py3 tests test=develop
      
      * fix lr_scheduler dtype test=develop
      
      * fix test=dvelop
      
      * test fix ci compile test=develop
      
      * fix build and merge test=develop
      
      * fallback momentumop change to general test=develop
      
      * make fp16 lr schedule simple test=develop
      
      * fix ut test=develop
      
      * fix tests test=develop
      
      * remove fp16 learning rate cast test=develop
      856f0da0
  16. 20 12月, 2018 2 次提交
  17. 01 12月, 2018 2 次提交
  18. 22 11月, 2018 1 次提交
  19. 16 11月, 2018 1 次提交
  20. 12 11月, 2018 1 次提交
  21. 10 11月, 2018 1 次提交
  22. 07 11月, 2018 1 次提交
    • C
      Add fp16 backward support (#14202) · a9b5d42d
      chengduo 提交于
      * add fp16 backward support
      test=develop
      
      * add sum_op fp16 test
      
      * disable test_dist_save_load
      test=develop
      
      * add check_grad for sum
      
      * add unit test for softmax_grad fp16
      test=develop
      
      * add scale_op unit test
      
      * add mul_grad_op unit test for fp16
      
      * add cross_entropy_grad and eman_grad unit test for fp16
      test=develop
      
      * fix cross_entropy unit test
      
      * add pool2d fp16 unit test
      
      * refine conv2d fp16 unit test
      test=develop
      
      * refine activation unit test
      test=develop
      
      * fix ci
      test=develop
      
      * follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
      test=develop
      a9b5d42d
  23. 21 9月, 2018 2 次提交
  24. 20 9月, 2018 3 次提交
    • Y
      Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod"" · 6d2c6f96
      Yu Yang 提交于
      This reverts commit a6c8d6b9.
      6d2c6f96
    • Y
      Revert "Merge pull request #13431 from chengduoZH/refine_lod" · a6c8d6b9
      Yu Yang 提交于
      This reverts commit bd79e046, reversing
      changes made to 6b4d290c.
      a6c8d6b9
    • C
      Feature/op_fuse_pass (#12440) · d402234b
      chengduo 提交于
      * Add Preface
      
      * Add demo code
      
      * Save file
      
      * Refine code
      
      * seems can work
      
      * use elementwise strategy
      
      * Use ElementwiseComputeEx
      
      * Add comments
      
      * extract functions from operator
      
      * Refine code
      
      * Follow comment
      
      * code refine
      
      * add op_fuse  pass
      
      * add backward
      
      * code refine
      
      * use TopologySortOperations
      
      * follow comments
      
      * refine IsFusible
      
      * code enhance
      
      * fix op_fusion_pass
      
      * refine code
      
      * refine fuse_elemwise_act_op
      
      * adjust the input and output
      
      * refine logic
      
      * add intermediate_edge
      
      * disable inplace
      
      * follow comments
      
      * refine logic
      
      * follow comments
      
      * Remove the removable IntermediateOut
      
      * change strategy
      
      * code refine
      
      * enable fuse backward
      
      * code refine
      
      * code refine
      
      * rename unit test
      
      * follow comments
      d402234b
  25. 18 9月, 2018 1 次提交
  26. 06 9月, 2018 1 次提交
  27. 05 9月, 2018 1 次提交
    • Q
      Add centered mode rmsprop (#13161) · 6e03f790
      Qiao Longfei 提交于
      * rmsprop optimizer support v1 mode
      
      * typo
      
      * optimize code
      
      * refine code
      
      * optimize unit test
      
      * update test_rmsprop_op.py
      
      * update formula of rmsprop
      
      * optimize document
      
      * update API.spec for RMSPropOptimizer
      
      * add default value to check_output_with_place equal_nan
      6e03f790
  28. 30 8月, 2018 1 次提交
    • C
      Enhance fused_elementwise_activation_op (#12837) · 3bd1d22a
      chengduo 提交于
      * Enhance the function of fused_elementwise_activation_op
      
      * enhance unit test
      
      * Clean Code And Add Doc
      
      * Add compound functors
      
      * Fix doc and enhance unit test
      
      * define Dx and Dy for d_binary_func
      
      * add mul_scale
      
      * add mul_scale
      
      * add elementwise_mul
      
      * code refine
      
      * code refine
      
      * add doc
      
      * add  AsIntermediate
      3bd1d22a
  29. 22 8月, 2018 1 次提交