1. 23 9月, 2020 1 次提交
  2. 26 4月, 2020 1 次提交
    • L
      improve efficiency of runtime InferVarType (#22778) · 9a93f6aa
      liuwei1031 提交于
      * save InferVarType changes, test=develop
      
      * remove code comments, test=develop
      
      * tweak code, test=develop
      
      * fix compilation warning, update merge_ids_op split_ids_op to new interface, test=develop
      
      * modify fused_bn_activation_op, test=develop
      
      * fix error of fused_bn_activation_op, test=develop
      
      * fix PADDLE_ENFORCE and unittest coverage issue, test=develop
      
      * tweak PADDLE_ENFORCE messages, test=develop
      
      * improve unittest coverage, test=develop
      
      * add StaticGraphInferVarType class, test=develop
      
      * rebase develop branch, test=develop
      
      * fix unittest error, test=develop
      
      * remove comments, test=develop
      
      * improve unittest coverage, test=develop
      
      * imporve error message and imporve unittest coverage, test=develop
      
      * upgrade InferVarType API, test=develop
      
      * tweak pyfunc error message, test=develop
      
      * fix compilation conflict - save_combine_op, test=develop
      9a93f6aa
  3. 16 1月, 2020 1 次提交
  4. 31 10月, 2019 1 次提交
    • H
      GradMaker for dygraph (#19706) · 8c4573a3
      hong 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * optimize grad maker; test=develop
      
      * optimize grad maker
      
      * test
      
      * grad make optim; test=develop
      
      * fix unittest bugs; test=develop
      
      * add dygraph grad op maker and split_op
      
      * grad op maker refactor; test=develop
      
      * add dygraph grad maker; test=develop
      
      * fix op deformable_conv_v1_op bug; test=develop
      
      * fix deformable_conv prroi pool bugs;
      
      * fix new op grad op maker bug; test=develop
      
      * fix split by ref bug; test=develop
      
      * fix dygraph auto prune bug; test=develop
      
      * fix test_trace bug; test=develop
      
      * fix fused emb seq pool bug; test=develop
      
      * remove useless code in op_desc file; test=develop
      
      * remove useless code, StrVarBaseNode; test=develop
      
      * fix review issues; test=develop
      
      * fix rank_loss grad maker; test=develop
      
      * remove flag in VarBase; test=develop
      
      * fix distributed_notify_op compile bug ; test=develop
      
      * fix reshape op double grad; test=develop
      
      * fix expand as op; test=develop
      
      * add impertive type_defs.h for demo_train; test=develop
      
      * fix inference lib cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix infernce_lib; test=develop
      
      * fix inference cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix inference lib; test=develop
      
      * remove condition dygraph grad maker, modify local name; test=develop
      
      * fix split grad maker bug; test=develop
      
      * fix pyramid_op bug; test=develop
      
      * change travis time out limit; test=develop
      
      * restore travis; test=develop
      
      * change timeout limit; test=develop
      8c4573a3
  5. 19 3月, 2019 1 次提交
  6. 18 3月, 2019 1 次提交
  7. 15 3月, 2019 1 次提交
  8. 26 11月, 2018 1 次提交
  9. 08 11月, 2018 1 次提交
  10. 08 5月, 2018 1 次提交
    • Y
      Clean OpProtoAndCheckerMaker · 0e78cb69
      Yu Yang 提交于
      Do not use ctor
      
      * Reduce line of codes.
      * We can use virtual function for Maker now.
      * The implementation does not care what maker holds, it is easier to
      refactor later.
      0e78cb69
  11. 13 2月, 2018 1 次提交
    • A
      Separate VarType from VarDesc in framework.proto and fix all related compiler errors (#8414) · fcadb452
      Abhinav Arora 提交于
      * Refine Type system
      
      * Fixing type inference
      
      * Fixed create_reader_op.cc
      
      * Fix var_desc.h
      
      * Fixed executor.cc
      
      * Fix shape_inference.h
      
      * Fixed create_reader_op.cc
      
      * Fix tensor_util.h
      
      * Fixed var_type_inference_test.cc
      
      * Fix shape_inference.cc
      
      * Fixed sum_op.c
      
      * Fixed read_op.cc
      
      * Fix var_type.h
      
      * Fixed beam_search_decode_op.cc
      
      * sendrecvop_utils.cc
      
      * Fix operator.cc
      
      * Fixed lookup_table_op.cc
      
      * Fixed op_desc.cc
      
      * Fixed get_places_op.cc
      
      * Fixed lod_rank_table_op.cc
      
      * Fixed beam_search_op.cc
      
      * Fix var_desc.cc
      
      * Fixed lod_tensor_to_array_op.cc
      
      * Fixed while_op.cc
      
      * Fix program_desc_test.cc
      
      * tensor_array_read_write_op.cc
      
      * Fix assign_op.cc
      
      * Fix executor.cc
      
      * Fix protobuf.cc
      
      * Fix protobuf.cc
      fcadb452
  12. 12 2月, 2018 1 次提交
  13. 10 2月, 2018 2 次提交
  14. 09 2月, 2018 1 次提交
  15. 08 1月, 2018 1 次提交
  16. 26 12月, 2017 1 次提交
  17. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29
  18. 21 12月, 2017 1 次提交
  19. 20 12月, 2017 2 次提交
  20. 10 11月, 2017 1 次提交
    • Y
      feature/while_op (#5502) · 40367d18
      Yang Yang(Tony) 提交于
      * first commit
      
      * Python API for while op
      
      * Python Unittest for simple while_op forward
      
      * fix out to be list
      
      * Fix UT
      
      * VarType
      
      * Fix several bugs
      
      * Fix bug
      
      * Fix bug
      
      * Fix Bug
      
      * Fix bug
      
      * Fix unittest
      
      * Remove debug log
      
      * Add comments
      
      * add PADDLE_ENFORCE
      
      * while_grad_op first commit
      
      * Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs
      
      * refine code
      
      * fix unittest bug
      40367d18
  21. 08 11月, 2017 1 次提交
    • Y
      Feature/rnn to array to lod tensor (#5411) · f72729d4
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add skeleton for array_to_lod_tensor and lod_tensor_to_array
      
      * Add VarType::LoDTensorArray
      
      * Add PyBind of LoDTensorArray
      
      * Add InferVarType
      
      * Add first unittest
      
      * Add ut
      
      * Add unittest
      
      * Add unittest
      
      * Add unittests
      
      * update
      
      * init
      
      * add infershape for lod_tensor_to_array_op
      
      * compelete array_to_lod_tensor_op
      
      * copy data
      
      * clean code
      
      * clean code
      
      * Fix unittest data
      
      * fix bugs
      
      * fix compile error
      
      * Refine TensorToArrayOp
      
      * refactor array_to_lod_tensor
      
      * Unittest
      
      * fix bugs
      
      * Fix unittest
      
      * Fix unittest
      
      * debug
      
      * Debug
      
      * Fix unittest
      
      * clean code
      
      * refactor
      
      * use ostream
      
      * update test
      
      * fix gpu build error
      
      * make gpu test pass
      f72729d4
  22. 04 11月, 2017 1 次提交
    • Y
      Add LoDRankTable (#5349) · 74849158
      Yu Yang 提交于
      * Add LoDRankTable
      
      LoD Rank Table stores the `level` of `lod` which is ordered by sequence
      length in descending order. It is useful when implement dynamic RNN and
      is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
      RNN slice output operators.
      
      * Add InferVarType
      74849158