1. 10 11月, 2021 1 次提交
    • C
      [PTen] Compatible runtime performance optimization (#36946) · 76d2fd1d
      Chen Weihang 提交于
      * resolve conflit with develop
      
      * cache kernel context in tracer for perf up
      
      * replace densetensor when build kernel context
      
      * fix detail compile error
      
      * append impl to static mode
      
      * fix conflit error
      
      * clear attrs after run kernel
      
      * fix coverage failed
      
      * fix cycle compile error
      
      * remove multi-in&out adapt code
      
      * remove tensor meta utils
      
      * clear data when throw exception
      76d2fd1d
  2. 13 10月, 2021 1 次提交
  3. 17 9月, 2021 1 次提交
    • Z
      [AMP] Support pure fp16 training mode for dygraph (#35521) · adaeee4d
      zhangbo9674 提交于
      * add pure fp16 major function in auto_cast & tracer
      
      * support master weight in dygraph for pure fp16
      
      * check mix dtype of fp16&fp32 for check_finite_and_unscale op
      
      * change pure fp16 funtion name
      
      * refine some bug in auto_cast
      
      * refine auto_cast interface logic
      
      * add param _casted_by_pure_fp16 for class Layer
      
      * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator
      
      * refine pure_fp16_decorator as decorator
      
      * add unittest
      
      * add comment
      
      * add comment
      
      * support recompute
      
      * add comment for auto_cast and decorator
      
      * support to_static_state_dict for paddle.jit.save
      
      * unlimite models num and optimizers num
      
      * add lookup_table in black_list
      
      * fix momentum and layer state_dict
      
      * fix bug in layer state_dict
      
      * fix bug in layer state_dict_helper
      
      * refine unittest
      
      * refine test_momentun_op
      
      * refine interface and some code
      
      * refine amp_decorator interface
      
      * refine pure fp16 interface
      
      * refine master weight interface
      adaeee4d
  4. 05 8月, 2021 1 次提交
  5. 30 6月, 2021 1 次提交
  6. 24 6月, 2021 1 次提交
    • H
      [NPU] support dygraph execution on npu place(#33579) · 6aea6be2
      houj04 提交于
      * in NPU environment, use CPUPlace for missing operators.
      
      * in NPU environment, use CPUPlace for missing operators.
      
      * fix TensorCopy bug and add unit test.
      
      * fix code style.
      
      * add more unit tests.
      6aea6be2
  7. 23 6月, 2021 1 次提交
    • W
      optimize attr default value (#33357) · 5d2eb678
      wanghuancoder 提交于
      * optimize attr default value, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix bug in AttrReader, test=develop
      
      * fix bug, test=develop
      
      * fix double_grad, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix checker null, test=develop
      
      * for test, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      5d2eb678
  8. 28 4月, 2021 1 次提交
  9. 21 4月, 2021 1 次提交
  10. 15 4月, 2021 1 次提交
    • W
      Customizable Python Layer in Dygraph (#32130) · 29f65225
      WeiXin 提交于
      * custom python backward
      
      * polish up the code
      
      * polish up the code
      
      * polish up the code.
      
      * Fix code format and comments.
      
      * Delete redundant files.
      
      * add unnittest.
      
      * edit unnittest.
      
      * edit unnittest.
      
      * Remove redundant header files.
      
      * Improve coverage and remove redundant code.
      
      * support saving for backward.
      
      * polish code according to comments.
      
      * Add support type for PyLayer.
      
      * Modify the DOC.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish code and make the code robust.
      
      * Modify the code format.
      29f65225
  11. 24 2月, 2021 1 次提交
  12. 22 2月, 2021 1 次提交
  13. 04 2月, 2021 1 次提交
  14. 20 1月, 2021 1 次提交
    • W
      add some RecordEvent, for dygraph timeline (#30299) · d1b25ed9
      wanghuancoder 提交于
      * add some RecordEvent, for dygraph timeline, test=develop
      
      * change GpuMemcpySync to memory::Copy, test=develop
      
      * fix compile problem, test=develop
      
      * fix compile problem, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      d1b25ed9
  15. 15 1月, 2021 1 次提交
    • P
      Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) · 13d75736
      pangyoki 提交于
      * add view strategy on squeeze,unsqueeze,reshape,flatten
      
      * add squeeze unittest
      
      * add unittests
      
      * use View strategy as name rather than Reuse Allacation
      
      * fix view api doc
      
      * fix format
      
      * use core.ops when input of reshape2 is Tensor
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * add inplace strategy
      
      * add elementwise_add sub
      
      * let backward op not use inplace
      
      * grad op do not use inplace
      
      * fix memory increase error and add leaf error message
      
      * delete selected_rows
      
      * change op_function
      
      * little change
      
      * solve HandleViewBetweenInputAndOutput
      
      * add unittest and leaf error message
      
      * merge view error
      
      * optimize op_function_generator format and support sum inplace op
      
      * fix format of basic_engine
      
      * fix format for framework
      
      * little change of variable wrapper
      
      * add reshape, squeeze, unsqueeze, scatter api
      
      * add relu elu tanh softmax inplace api
      
      * fix test_squeeze_op unittest
      
      * fix test_relu_op unittest
      
      * fix comment problems
      
      * delete sample code of inplace api
      
      * add reference of grad_pending_nodes in basic_engine
      
      * fix unittest name
      
      * add inplace apis into wlist
      
      * fix error message
      
      * add PADDLE_ENFORCE for set grad op twice
      
      * fix head file error
      13d75736
  16. 08 1月, 2021 1 次提交
    • L
      Add callback after TensorCopy (#30123) · 1f97d61c
      Leo Chen 提交于
      * change to tensor copy sync
      
      * change to tensor copy sync
      
      * make copy_to safe when use TensorCopy
      
      * refine code
      
      * add ut
      
      * add cudapinned garbagecollector
      
      * add testcase: cpu place -> cuda pinned place
      1f97d61c
  17. 26 11月, 2020 1 次提交
  18. 28 9月, 2020 1 次提交
  19. 31 8月, 2020 1 次提交
    • A
      Add use of global flag 'use_mkldnn' to layer_helper (#26497) · 885c61f0
      arlesniak 提交于
      * get use of global 'use_mkldnn' in layer_helper
      
      * update for CI
      
      * update for CI, relu test
      
      * update for CI, relu test added, make FLAGS_use_mkldnn a public flag
      
      * added more strict tests, fixes after review
      
      * fixes after review
      
      * fixes after review, CI stuff
      885c61f0
  20. 13 8月, 2020 1 次提交
    • L
      Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903) · 2d95280e
      Leo Chen 提交于
      * add auto_cast, test=develop
      
      * add loss scaler, test=develop
      
      * add comments, test=develop
      
      * refine code, test=develop
      
      * refine code, test=develop
      
      * do not set flags automatically, test=develop
      
      * fix custom op bug, test=develop
      
      * add more test, test=develop
      
      * refine enable logic, test=develop
      
      * enable amp test with GPU, test=develop
      
      * add unittest
      
      * add test for found_inf
      
      * follow comments
      
      * follow comments
      
      * remove global variable, use singleton
      
      * add some notes
      
      * update comments
      
      * update comments
      
      * update comments
      
      * add use_dynamic_loss_scaling argument
      
      * refine found_inf
      
      * refine found_inf
      2d95280e
  21. 03 6月, 2020 1 次提交
  22. 03 4月, 2020 1 次提交
  23. 20 3月, 2020 1 次提交
    • Z
      Add dygraph double grad implementation (#22939) · a31d7328
      Zeng Jinle 提交于
      * add double grad implementation for dygraph, test=develop
      
      * polish code, add uts, test=develop
      
      * fix place bug, test=develop
      
      * polish codes, add more uts for coverages, test=develop
      
      * add no_grad_set, test=develop
      
      * add star gan ut, test=develop
      
      * follow comments, test=develop
      a31d7328
  24. 09 3月, 2020 1 次提交
  25. 10 12月, 2019 1 次提交
  26. 29 11月, 2019 1 次提交
    • H
      Add dygraph execution context (#20157) · ac854670
      hong 提交于
      * add_dygraph_execution_context
      
      * add dygraph infershape context and execution context; test=develop
      
      * fix imperative bug; test=develop
      
      * remove inputs outputs interface from execution context,
      because it have same function with inputNames;
      test=develop
      
      * remove tracer_test ctest; test=develop
      
      * fix split op bug; test=develop
      
      * fix unitests bug; test=develop
      
      * fix distribute test bug; test=develop
      
      * fix ngraph compile bug; test=develop
      
      * fix grad maker bug; test=develop
      
      * fix load op bugs; test=develop
      
      * fix operator.cc construct bug; test=develop
      
      * remove useless name find in operator; test=develop
      
      * add tracer_test; test=develop
      
      * fix concat, split bug; test=develop
      
      * remove tracer_test unitest; test=develop
      
      * fix attribute check bug; test=develop
      
      * add test code to fix converage; test=develop
      
      * remove useless code, change check backward input in engin; test=develop
      
      * unlock var type infer shape;test=develop
      
      * add ShareAllLoD api; test=develop
      
      * add dygraph infershape context unitest; test=develop
      
      * remove increase and decrease lod in dygraph; test=develop
      
      * addd override; test=develop
      
      * fix increase descrease lod; test=develop
      
      * fix paddle_enforce; test=develop
      
      * disable lod op dygraph check; test=develop
      
      * fix paddle enforce error; test=develop
      
      * add comment for op_registry and OperatorBase; test=develop
      
      * optimize the comment of op_registry; test=develop
      
      * fix format of comment; test=develop
      
      * fix format of comment; test=develop
      
      * optimize the format of comment; test=develop
      
      * optimize the format of the comment; test=develop
      
      * optimize comment of op_registry; test=develop
      ac854670
  27. 21 11月, 2019 1 次提交
    • Z
      open dygraph op test, test=develop (#19787) · c4ede95c
      zhongpu 提交于
      * open dygraph op test, test=develop
      
      * modify to_variable, test=develop
      
      * modify input and output for dygraph, test=develop
      
      * modify input and output for dygraph(fix bug), test=develop
      
      * fix input processing of dygraph op test, test=develop
      
      * fix bug, test=develop
      
      * fix op test, test=develop
      
      * fix forward bug for dygraph, test=develop
      
      * fix mkldnn op test for forward, test=develop
      
      * update nn.py for dygraph, test=develop
      
      * fix crop_tensor_op, test=develop
      
      * fix elementwise_mul_op, test=develop
      
      * fix fill_op, test=develop
      
      * fix some mkldnn op, test=develop
      
      * open backward op test for dygraph, test=develop
      
      * delete log, test=develop
      
      * close backward op test for dygraph, test=develop
      
      * fix bug for edit_distance_op and test_lstm_cudnn_op, test=develop
      
      * fix optest backward bug for dygraph, test=develop
      
      * fix optest backward bug for dygraph, test=develop
      
      * close backward op test for dygraph, test=develop
      
      * close backward op test for dygraph, test=develop
      
      * open dygraph op test, test=develop
      
      * fix op test for dygraph, fix GradOpDescMaker, test=develop
      
      * fix bug for linear_chain_crf_op.h, test=develop
      
      * remove log, test=develop
      
      * remove log, test=develop
      
      * remove log for op_test.py, test=develop
      
      * remove log for op_test.py, test=develop
      
      * fix bug for var_conv_2d_op, change PADDLE_ENFORCE, test=develop
      
      * fix PADDLE_ENFORCE_EQ for hierarchical_sigmoid_op.cc, test=develop
      
      * fix bug for test_increment_ngraph_op.py, test=develop
      
      * fix lod for op test in dygraph, test=develop
      
      * refactor op_test.py to reduce redundant code, test=develop
      
      * fix lod optest, modify InputVar/OutputVar to HasInput/HasOutput, test=develop
      
      * remove debug log, test=develop
      
      * remove redundant code in base.py, test=develop
      
      * fix some error in optest, test=develop
      
      * fix ClearNoNeedBufferInputs function's bug for LoDTensor, test=develop
      
      * refactor op_test.py, test=develop
      
      * remove redundant writing, test=develop
      
      * fix error(get tensor of the grad variable), test=develop
      
      * fix test_concat_mkldnn test_conv2d_mkldnn, test=develop
      
      * fix optest.py for get tensor of LoDTensor, test=develop
      
      * fix optest.py for get tensor of LoDTensor, test=develop
      
      * fix optest.py for get tensor of LoDTensor, test=develop
      
      * fix some redundant code, test=develop
      
      * reslove conflict and rewrite paddle error message, test=develop
      c4ede95c
  28. 05 11月, 2019 2 次提交
    • L
      Add c++ global current tracer for dygraph (#20882) · 008ed65f
      Leo Chen 提交于
      * Add c++ global current tracer for dygraph, test=develop
      
      * add tracer property in c++, test=develop
      
      * support different place, test=develop
      
      * add unittest for tracer, test=develop
      008ed65f
    • Z
      Support NoNeedBufferVarsInference in dygraph backward (#20868) · 878a40f5
      Zeng Jinle 提交于
      * support no need buffer vars in dygraph, test=develop
      
      * fix inference compilation error, test=develop
      
      * update no_need_buffer_vars_inference, test=develop
      
      * add unittests for no_need_buffer_vars_context, test=develop
      
      * refine no_need_buffer_vars by return ref, test=develop
      
      * polish some codes, test=develop
      878a40f5
  29. 04 11月, 2019 1 次提交
  30. 31 10月, 2019 1 次提交
    • H
      GradMaker for dygraph (#19706) · 8c4573a3
      hong 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * optimize grad maker; test=develop
      
      * optimize grad maker
      
      * test
      
      * grad make optim; test=develop
      
      * fix unittest bugs; test=develop
      
      * add dygraph grad op maker and split_op
      
      * grad op maker refactor; test=develop
      
      * add dygraph grad maker; test=develop
      
      * fix op deformable_conv_v1_op bug; test=develop
      
      * fix deformable_conv prroi pool bugs;
      
      * fix new op grad op maker bug; test=develop
      
      * fix split by ref bug; test=develop
      
      * fix dygraph auto prune bug; test=develop
      
      * fix test_trace bug; test=develop
      
      * fix fused emb seq pool bug; test=develop
      
      * remove useless code in op_desc file; test=develop
      
      * remove useless code, StrVarBaseNode; test=develop
      
      * fix review issues; test=develop
      
      * fix rank_loss grad maker; test=develop
      
      * remove flag in VarBase; test=develop
      
      * fix distributed_notify_op compile bug ; test=develop
      
      * fix reshape op double grad; test=develop
      
      * fix expand as op; test=develop
      
      * add impertive type_defs.h for demo_train; test=develop
      
      * fix inference lib cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix infernce_lib; test=develop
      
      * fix inference cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix inference lib; test=develop
      
      * remove condition dygraph grad maker, modify local name; test=develop
      
      * fix split grad maker bug; test=develop
      
      * fix pyramid_op bug; test=develop
      
      * change travis time out limit; test=develop
      
      * restore travis; test=develop
      
      * change timeout limit; test=develop
      8c4573a3
  31. 24 10月, 2019 1 次提交
  32. 23 10月, 2019 1 次提交
  33. 21 9月, 2019 1 次提交
    • J
      Feature/auto prune in dygraph (#19757) · 45425411
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      
      * support auto prune in dygraph mode
      
      * test=develop, support auto prune
      
      * test=develop, merge develop conflict
      
      * test=develop, fix test_layer and test_tracer ut
      
      * test=develop, fix bug which may cause stop_gradient disabled with a list of backward inputs
      45425411
  34. 17 9月, 2019 1 次提交
    • J
      Feature/add transform data dygraph (#19707) · cc311bdf
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * add transform_data to dygraph
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * add test and change input to const ref for safety
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * add ut for data transform
      
      * refine ut for data_transform
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      
      * add test_tracer on multiple devices
      
      * test=develop, change place to mutable for data transform
      
      * test=develop, add transform data on same place test and remove useless log
      
      * test=develop, Add to do for data layout and and ut for conv2d with no bias
      cc311bdf
  35. 05 9月, 2019 1 次提交
    • J
      Refactor dygraph (#19107) · e9233d1c
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      e9233d1c
  36. 05 6月, 2019 1 次提交
    • J
      Feature/Fix recurrent usage of Varbase in Dygraph (#17838) · 3d3f5506
      Jiabin Yang 提交于
      * for debug
      
      * test=develop, memory optimize for dygraph using shared_ptr
      
      * test=develop, fix travis ci showed error
      
      * test=develop, fix bug for recurrent usage of varbase
      
      * test=develop, init varbase when it need to be Add
      
      * test=develop, fix problem of recurrent gradient
      
      * test=develop, add gradient test for recurrent varbase usage
      3d3f5506
  37. 04 6月, 2019 1 次提交
  38. 29 5月, 2019 1 次提交
  39. 27 5月, 2019 1 次提交