1. 28 4月, 2022 1 次提交
    • Z
      [cherry-pick] Optimize performance of dygraph (#42196) (#42329) · 2ea56c90
      zyfncg 提交于
      * Optimize performance of dygraph (v4)  (#42196)
      
      * optimize performance of dygraph
      
      * optimize performance of dygraph and elementwise_add
      
      * optimize the trace op
      
      * fix bug
      
      * fix bug
      
      * fix unittest bug
      
      * fix code format
      
      * fix cherry-pick problem
      2ea56c90
  2. 28 3月, 2022 1 次提交
  3. 21 2月, 2022 1 次提交
    • C
      Update record interface using part2 (#39694) · c984cd85
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update record event interface using
      
      * update record event interface using
      
      * update operator.cc
      
      * update part2
      
      * update part1
      
      * fix include profiler.h header in ps server
      
      * fix include profiler.h header in ps server
      
      * fix profiler.h header
      c984cd85
  4. 20 2月, 2022 1 次提交
  5. 16 2月, 2022 1 次提交
    • J
      EagerTensor to EagerVariable (#39447) · 831fd86e
      Jiabin Yang 提交于
      * merge legacy to fluid
      
      * Remove legacy code
      
      * Remove legacy code
      
      * Remove DataType test
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * add more test
      
      * merge develop and refine code
      831fd86e
  6. 15 2月, 2022 1 次提交
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  7. 11 2月, 2022 1 次提交
  8. 02 2月, 2022 1 次提交
  9. 25 1月, 2022 1 次提交
    • W
      [Move selected_rows PR #3] Change the relationship of [include/Cmake]. (#39128) · 2bafd338
      Weilong Wu 提交于
      * Added selected_rows and rw_lock to pten
      
      * Renamed the unit test target to fix CI
      
      * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid
      
      * Remove rw_lock.h,rw_lock_test.cc in fluid
      
      * Use pten::RWLock and pten::AutoRDLock, fix CI
      
      * Use pten::SelectedRows
      
      * Use pten::SelectedRows
      
      * Fix to pass NPU CI
      
      * Use pten::SelectedRows, to pass NPU CI
      
      * To fix NPU CI
      
      * To fix NPU CI again
      2bafd338
  10. 20 1月, 2022 1 次提交
  11. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  12. 15 1月, 2022 1 次提交
  13. 22 11月, 2021 1 次提交
  14. 16 11月, 2021 1 次提交
  15. 11 11月, 2021 2 次提交
    • W
      [Bug fixes] Add default arg to enhance varbase ClearGradient func (#36837) · 63f5c2d4
      Weilong Wu 提交于
      * Add default arg to enhance varbase ClearGradient func
      
      * Removed default arg, use a Flag to enhance varbase ClearGradient func
      
      * Renamed Flags to FLAGS_real_release
      
      * Use default arg to enhance varbase ClearGradient func and expose two func to set/get gradient isEmpty
      
      * Removed DECLARE_bool statement
      
      * Polished Code
      63f5c2d4
    • W
      [New features] Support VarBase to expose func (#36965) · 52645667
      Weilong Wu 提交于
      * Expose func for varbase
      
      * Expose func for varbase and enhance varbase init func
      
      * Change func name and add test case for _CopyGradientWith
      
      * Rename func
      
      * Add test cases to increase coverage
      
      * Refine the logic of _to func
      
      * Replace numel() with _numel(), Add test code
      52645667
  16. 10 11月, 2021 1 次提交
    • C
      [PTen] Compatible runtime performance optimization (#36946) · 76d2fd1d
      Chen Weihang 提交于
      * resolve conflit with develop
      
      * cache kernel context in tracer for perf up
      
      * replace densetensor when build kernel context
      
      * fix detail compile error
      
      * append impl to static mode
      
      * fix conflit error
      
      * clear attrs after run kernel
      
      * fix coverage failed
      
      * fix cycle compile error
      
      * remove multi-in&out adapt code
      
      * remove tensor meta utils
      
      * clear data when throw exception
      76d2fd1d
  17. 12 8月, 2021 1 次提交
  18. 23 6月, 2021 1 次提交
    • W
      optimize attr default value (#33357) · 5d2eb678
      wanghuancoder 提交于
      * optimize attr default value, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix bug in AttrReader, test=develop
      
      * fix bug, test=develop
      
      * fix double_grad, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix checker null, test=develop
      
      * for test, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      5d2eb678
  19. 25 4月, 2021 1 次提交
  20. 15 4月, 2021 1 次提交
    • W
      Customizable Python Layer in Dygraph (#32130) · 29f65225
      WeiXin 提交于
      * custom python backward
      
      * polish up the code
      
      * polish up the code
      
      * polish up the code.
      
      * Fix code format and comments.
      
      * Delete redundant files.
      
      * add unnittest.
      
      * edit unnittest.
      
      * edit unnittest.
      
      * Remove redundant header files.
      
      * Improve coverage and remove redundant code.
      
      * support saving for backward.
      
      * polish code according to comments.
      
      * Add support type for PyLayer.
      
      * Modify the DOC.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish Doc.
      
      * polish code and make the code robust.
      
      * Modify the code format.
      29f65225
  21. 04 2月, 2021 1 次提交
  22. 20 1月, 2021 1 次提交
    • W
      add some RecordEvent, for dygraph timeline (#30299) · d1b25ed9
      wanghuancoder 提交于
      * add some RecordEvent, for dygraph timeline, test=develop
      
      * change GpuMemcpySync to memory::Copy, test=develop
      
      * fix compile problem, test=develop
      
      * fix compile problem, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      d1b25ed9
  23. 15 1月, 2021 1 次提交
    • P
      Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) · 13d75736
      pangyoki 提交于
      * add view strategy on squeeze,unsqueeze,reshape,flatten
      
      * add squeeze unittest
      
      * add unittests
      
      * use View strategy as name rather than Reuse Allacation
      
      * fix view api doc
      
      * fix format
      
      * use core.ops when input of reshape2 is Tensor
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * add inplace strategy
      
      * add elementwise_add sub
      
      * let backward op not use inplace
      
      * grad op do not use inplace
      
      * fix memory increase error and add leaf error message
      
      * delete selected_rows
      
      * change op_function
      
      * little change
      
      * solve HandleViewBetweenInputAndOutput
      
      * add unittest and leaf error message
      
      * merge view error
      
      * optimize op_function_generator format and support sum inplace op
      
      * fix format of basic_engine
      
      * fix format for framework
      
      * little change of variable wrapper
      
      * add reshape, squeeze, unsqueeze, scatter api
      
      * add relu elu tanh softmax inplace api
      
      * fix test_squeeze_op unittest
      
      * fix test_relu_op unittest
      
      * fix comment problems
      
      * delete sample code of inplace api
      
      * add reference of grad_pending_nodes in basic_engine
      
      * fix unittest name
      
      * add inplace apis into wlist
      
      * fix error message
      
      * add PADDLE_ENFORCE for set grad op twice
      
      * fix head file error
      13d75736
  24. 08 1月, 2021 2 次提交
    • L
      Fix dtype of ungenerated grad var (#28511) · 8696335f
      Leo Chen 提交于
      * fix dtype of ungenerated grad var
      
      * update ut
      
      * refine code
      
      * set default dtype
      
      * fix could_use_cudnn bug
      
      * remove debug code
      
      * re-implement
      
      * fix bug
      8696335f
    • L
      Add callback after TensorCopy (#30123) · 1f97d61c
      Leo Chen 提交于
      * change to tensor copy sync
      
      * change to tensor copy sync
      
      * make copy_to safe when use TensorCopy
      
      * refine code
      
      * add ut
      
      * add cudapinned garbagecollector
      
      * add testcase: cpu place -> cuda pinned place
      1f97d61c
  25. 07 1月, 2021 1 次提交
  26. 25 12月, 2020 1 次提交
    • C
      [Complex] Handle complex to real after type promotion (#29855) · a6072055
      Chen Weihang 提交于
      * try to add fwd op input dtypes
      
      * refactor base impl
      
      * return tmp_ins after dygraph prepare data
      
      * fix typo found in debug
      
      * polish comment & add complex net test
      
      * revert detail change
      
      * fix unittest failed
      
      * add complex kernel condition control
      
      * fix xpu test failed & polish comment
      
      * polish details by review comments
      a6072055
  27. 09 12月, 2020 1 次提交
  28. 01 12月, 2020 1 次提交
  29. 30 11月, 2020 1 次提交
    • L
      Check whether there is any inplace operation affecting gradient calculation. (#27901) · 865a4598
      liym27 提交于
      * Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.
      
      * Add a new attribute `_inplace_version` for VarBase.
      
      * Raise exception if an inplace operation can result in incorrect gradient computation.
      
      * Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.
      
      * For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.
      
      * Use original var_wrapper if the inplace_version is not changed.
      
      * Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.
      865a4598
  30. 28 7月, 2020 1 次提交
    • A
      Added DNNL cache management for DyGraph (#25624) · e52df3b1
      arlesniak 提交于
      * Added DNNL cache management for DyGraph
      
      * move FLAGS_use_mkldnn to more general CMakeLists, getu use of the flag in ClearGradients
      
      * missing file
      
      * Fixes after review
      
      * Bringing back original idea of place for 'use_mkldnn' flag to be accessible from platform nad imperative.
      
      * Removed duplicate and added docs
      
      * Fixes for CI
      e52df3b1
  31. 15 4月, 2020 1 次提交
  32. 10 4月, 2020 1 次提交
  33. 22 3月, 2020 1 次提交
  34. 20 3月, 2020 1 次提交
    • Z
      Add dygraph double grad implementation (#22939) · a31d7328
      Zeng Jinle 提交于
      * add double grad implementation for dygraph, test=develop
      
      * polish code, add uts, test=develop
      
      * fix place bug, test=develop
      
      * polish codes, add more uts for coverages, test=develop
      
      * add no_grad_set, test=develop
      
      * add star gan ut, test=develop
      
      * follow comments, test=develop
      a31d7328
  35. 09 3月, 2020 1 次提交
  36. 05 3月, 2020 1 次提交
    • H
      reduce default attrs for dynamic graph (#22850) · 5191e544
      hong 提交于
      * reduce default attrs for dynamic graph, test=develop
      
      * add some explanations for explicit attr, test=develop
      
      * tweak explicit attr comments, test=develop
      5191e544
  37. 27 12月, 2019 1 次提交
  38. 05 12月, 2019 1 次提交
    • L
      Split VarBase from Python Variable for Dygraph (#21359) · cdd46d7e
      Leo Chen 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * don't expose numerous Tensor.set(), test=develop
      
      * fix condition, test=develop
      
      * fix float16 bug, test=develop
      
      * feed should be Tensor or np.array, not Variable or number, test=develop
      
      * use forcecast to copy numpy slice to new array, test=develop
      
      * remove float16-uint16 hacking, test=develop
      
      * add variable method to varbase and refactor to_variable to support return varbase
      
      * support kwargs in varbase constructor
      
      * add VarBase constructor to support default python args
      
      * refine varbase initial method
      
      * reset branch
      
      * fix ut for change VarBase error info to PaddleEnforce
      
      * cherry is parameter change before
      
      * overload isinstance to replace too many change of is_variable
      
      * rm useless files
      
      * rm useless code merged by git
      
      * test=develop, fix some ut failed error
      
      * test=develop, fix test_graph_wrapper
      
      * add some tests, test=develop
      
      * refine __getitem__, test=develop
      
      * add tests, test=develop
      
      * fix err_msg, test=develop
      cdd46d7e