1. 16 1月, 2020 1 次提交
  2. 11 11月, 2019 1 次提交
    • Y
      Add the check of lod_level between compile-time and runtime. (#20961) · 35f17ae2
      Yiqun Liu 提交于
      * Add the check of lod_level between compile-time and runtime.
      test=develop
      
      * Fix bug in check_compile_vs_runtime.
      test=develop
      
      * Fix the check of output when it is dispensiable or intermediate.
      test=develop
      
      * Share lod of x to out in match_matrix_tensor op in compile-time.
      
      * Implement GetLoDLevel in InferShapeContext.
      
      * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op.
      test=develop
      
      * Enable check_compile_vs_runtime in test_match_matrix_tensor.
      
      * Add the implementation of SetLoDLevel in InferShapeContext.
      
      * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead.
      
      * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead.
      
      * Refine some ops and unittests.
      test=develop
      
      * Fix a typo.
      test=develop
      
      * Remove the check of var type, and change int to int32_t.
      test=develop
      
      * Add unittest for Get/SetLoDLevel.
      test=develop
      35f17ae2
  3. 31 10月, 2019 1 次提交
    • H
      GradMaker for dygraph (#19706) · 8c4573a3
      hong 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * optimize grad maker; test=develop
      
      * optimize grad maker
      
      * test
      
      * grad make optim; test=develop
      
      * fix unittest bugs; test=develop
      
      * add dygraph grad op maker and split_op
      
      * grad op maker refactor; test=develop
      
      * add dygraph grad maker; test=develop
      
      * fix op deformable_conv_v1_op bug; test=develop
      
      * fix deformable_conv prroi pool bugs;
      
      * fix new op grad op maker bug; test=develop
      
      * fix split by ref bug; test=develop
      
      * fix dygraph auto prune bug; test=develop
      
      * fix test_trace bug; test=develop
      
      * fix fused emb seq pool bug; test=develop
      
      * remove useless code in op_desc file; test=develop
      
      * remove useless code, StrVarBaseNode; test=develop
      
      * fix review issues; test=develop
      
      * fix rank_loss grad maker; test=develop
      
      * remove flag in VarBase; test=develop
      
      * fix distributed_notify_op compile bug ; test=develop
      
      * fix reshape op double grad; test=develop
      
      * fix expand as op; test=develop
      
      * add impertive type_defs.h for demo_train; test=develop
      
      * fix inference lib cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix infernce_lib; test=develop
      
      * fix inference cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix inference lib; test=develop
      
      * remove condition dygraph grad maker, modify local name; test=develop
      
      * fix split grad maker bug; test=develop
      
      * fix pyramid_op bug; test=develop
      
      * change travis time out limit; test=develop
      
      * restore travis; test=develop
      
      * change timeout limit; test=develop
      8c4573a3
  4. 29 10月, 2019 1 次提交
    • Y
      Check and correct the output's lod_level in DynamicRNN related operators (#19144) · 6fcfd32e
      Yiqun Liu 提交于
      * Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime.
      test=develop
      
      * Add comment for ReorderLoDTensorByRank op.
      
      * Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time.
      test=develop
      
      * ShrinkRNNMemory op should call ShareLoD for compile time.
      test=develop
      
      * Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool.
      test=develop
      
      * Refine the unittest of DynamicRNN.
      test=develop
      
      * Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE.
      test=develop
      6fcfd32e
  5. 12 12月, 2018 1 次提交
  6. 26 11月, 2018 1 次提交
  7. 08 11月, 2018 1 次提交
  8. 23 10月, 2018 1 次提交
    • C
      Refine Split op (#13967) · a7497653
      chengduo 提交于
      * speedup split_op
      test=develop
      
      * speedup split_op
      test=develop
      
      * rename ConcatGrad to Split
      
      * refine concat and split
      test=develop
      
      * fix compile error
      a7497653
  9. 19 9月, 2018 1 次提交
  10. 18 9月, 2018 1 次提交
  11. 08 5月, 2018 1 次提交
    • Y
      Clean OpProtoAndCheckerMaker · 0e78cb69
      Yu Yang 提交于
      Do not use ctor
      
      * Reduce line of codes.
      * We can use virtual function for Maker now.
      * The implementation does not care what maker holds, it is easier to
      refactor later.
      0e78cb69
  12. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  13. 12 2月, 2018 1 次提交
  14. 10 2月, 2018 2 次提交
  15. 09 2月, 2018 1 次提交
  16. 09 1月, 2018 1 次提交
  17. 27 12月, 2017 4 次提交
  18. 26 12月, 2017 1 次提交
  19. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29
  20. 21 12月, 2017 1 次提交
  21. 20 12月, 2017 1 次提交
  22. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  23. 09 11月, 2017 1 次提交
    • F
      Add grad for lodtensor array ops (#5461) · b698d19b
      fengjiayi 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add skeleton for array_to_lod_tensor and lod_tensor_to_array
      
      * Add VarType::LoDTensorArray
      
      * Add PyBind of LoDTensorArray
      
      * Add InferVarType
      
      * Add first unittest
      
      * Add ut
      
      * Add unittest
      
      * Add unittest
      
      * Add unittests
      
      * update
      
      * init
      
      * add infershape for lod_tensor_to_array_op
      
      * compelete array_to_lod_tensor_op
      
      * copy data
      
      * clean code
      
      * clean code
      
      * Fix unittest data
      
      * fix bugs
      
      * fix compile error
      
      * Refine TensorToArrayOp
      
      * refactor array_to_lod_tensor
      
      * Unittest
      
      * fix bugs
      
      * Fix unittest
      
      * Fix unittest
      
      * debug
      
      * Debug
      
      * Fix unittest
      
      * Add grad for ops
      
      * Debug
      
      * Fix a bug
      
      * fix a bug
      
      * fix a bug
      b698d19b
  24. 08 11月, 2017 1 次提交
    • Y
      Feature/rnn to array to lod tensor (#5411) · f72729d4
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add skeleton for array_to_lod_tensor and lod_tensor_to_array
      
      * Add VarType::LoDTensorArray
      
      * Add PyBind of LoDTensorArray
      
      * Add InferVarType
      
      * Add first unittest
      
      * Add ut
      
      * Add unittest
      
      * Add unittest
      
      * Add unittests
      
      * update
      
      * init
      
      * add infershape for lod_tensor_to_array_op
      
      * compelete array_to_lod_tensor_op
      
      * copy data
      
      * clean code
      
      * clean code
      
      * Fix unittest data
      
      * fix bugs
      
      * fix compile error
      
      * Refine TensorToArrayOp
      
      * refactor array_to_lod_tensor
      
      * Unittest
      
      * fix bugs
      
      * Fix unittest
      
      * Fix unittest
      
      * debug
      
      * Debug
      
      * Fix unittest
      
      * clean code
      
      * refactor
      
      * use ostream
      
      * update test
      
      * fix gpu build error
      
      * make gpu test pass
      f72729d4