1. 27 3月, 2022 2 次提交
    • H
      Move slice to phi (#40736) · b8236b7b
      hong 提交于
      * move slice to pten
      
      * merge develop; test=develop
      
      * fix slice bug;
      
      * update
      
      * update
      
      * fix error
      
      * update
      
      * fix bug
      
      * polish code
      
      * polish code
      
      * polish code
      
      * try to fix windows bug
      
      * add gpu compile flag;
      
      * try to fix
      
      * remov template;
      
      * polish code;
      
      * fix npu bug;
      
      * fix npu bug
      
      * fix npu bug; test=develop
      
      * fix slice bug;
      
      * remove no need dep
      b8236b7b
    • F
      Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy (#40886) · 0ad2e192
      From00 提交于
      * Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy
      
      * Set FLAGS_use_stream_safe_cuda_allocator to false
      
      * Update
      
      * Remove unnecessary code
      
      * Fix CI errors
      
      * Add UT
      0ad2e192
  2. 25 3月, 2022 2 次提交
  3. 24 3月, 2022 2 次提交
  4. 23 3月, 2022 2 次提交
  5. 22 3月, 2022 1 次提交
  6. 18 3月, 2022 1 次提交
    • C
      [Phi] move reduce_grad kernel into phi (#40522) · 70726696
      chentianyu03 提交于
      * move reduce_mean_grad kernel into phi
      
      * move reduce_max/min_grad into phi
      
      * remove raw max/min grad kernel
      
      * fix bug
      
      * fix max/min grad error
      
      * move all reduce_grad kernel into one file
      
      * add prod grad kernel
      
      * add infermeta for prod kernel
      70726696
  7. 17 3月, 2022 1 次提交
  8. 15 3月, 2022 1 次提交
    • Y
      [Phi]Move Tanh/BRelu/LeakyRelu/ThresholdedRelu Kernels to Phi (#40385) · d7112180
      YuanRisheng 提交于
      * move activation op
      
      * adjust code format
      
      * fix compile bugs
      
      * fix ci bugs
      
      * code format adjust
      
      * code format adjust2
      
      * activate ci status
      
      * modify according to comment
      
      * move activation kernel
      
      * revert relu6
      
      * reduce add code
      
      * perfect use_phi_functor
      
      * completing func name
      
      * fix bugs when run ci
      
      * fix bugs when run infr
      
      * modifpy infrt get kernel signature
      d7112180
  9. 11 3月, 2022 1 次提交
    • C
      [Phi] Reduce grad (#40263) · f452ad5c
      chentianyu03 提交于
      * add reduce_sum grad kernel
      
      * add reduce_grad
      
      * modify reduce grad
      
      * update reduce grad functions
      
      * fix build error
      
      * add argument mapping
      
      * move cast input after grad
      
      * add dims.size=1 cpu reduce_sum grad compute method
      
      * update reduce grad GPU
      
      * remove raw reduce_sum_grad kernel
      
      * modify header files
      
      * add namespace funcs for reduce_grad_funcstions
      f452ad5c
  10. 09 3月, 2022 1 次提交
  11. 07 3月, 2022 1 次提交
  12. 03 3月, 2022 1 次提交
    • L
      Workqueue threadnames (#40035) · b8a16911
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * Set thread name for WorkQueue
      
      * Add thread names
      
      * fix ut
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      b8a16911
  13. 28 2月, 2022 3 次提交
  14. 21 2月, 2022 2 次提交
    • C
      [pten]rm reduce_sum and reduce_mean raw kernel (#39484) · 2bb5aae8
      chentianyu03 提交于
      * rm reduce_sum raw kernel
      
      * remove reduce_mean kernel
      
      * remove reduce_mean kernel
      
      * reduce support int and int64_t
      
      * mean support int and int64_t type
      2bb5aae8
    • C
      Update record interface using part2 (#39694) · c984cd85
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update record event interface using
      
      * update record event interface using
      
      * update operator.cc
      
      * update part2
      
      * update part1
      
      * fix include profiler.h header in ps server
      
      * fix include profiler.h header in ps server
      
      * fix profiler.h header
      c984cd85
  15. 20 2月, 2022 1 次提交
  16. 19 2月, 2022 1 次提交
    • Z
      [Pten] Add selected_rows kernel for Full (#39465) · 79f8eeca
      zyfncg 提交于
      * Add selected_rows kernel for full
      
      * remove fill_constant register in fluid
      
      * fix bug without GPU
      
      * add jit_kernel_helper dependency for fc
      
      * do some refactor
      
      * add unittest for ops signatures
      
      * add coverage unittest
      
      * fix merge conflict
      
      * fix full selectew_rows bug
      79f8eeca
  17. 17 2月, 2022 1 次提交
  18. 16 2月, 2022 1 次提交
    • Y
      [Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270
      YuanRisheng 提交于
      * remove reshape and elementwise_add registry
      
      * delete code
      
      * fix bugs when run ci ut
      
      * remove log
      
      * fix bugs when run unit test
      
      * fix bugs when run unit test
      
      * fix bugs when run cinn
      
      * fix bugs when run ci-mac-python3
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix bugs when run kunlun
      
      * fix bugs when compile
      
      * update code according comment
      c6478270
  19. 15 2月, 2022 1 次提交
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  20. 14 2月, 2022 2 次提交
  21. 11 2月, 2022 2 次提交
  22. 08 2月, 2022 1 次提交
  23. 30 1月, 2022 1 次提交
  24. 28 1月, 2022 2 次提交
  25. 26 1月, 2022 1 次提交
    • L
      [pten] remove deprecated fluid op kernel for pten (#38842) · 3ab9aef1
      Leo Chen 提交于
      * update cmake file to remove fluid kernel
      
      * add pten declaration.h to where pybind.h used
      
      * fix sync_bn and tensorrt_engine
      
      * refine detection_library
      
      * fix interpreter_core
      
      * support eager legacy
      
      * fit eager legacy for pten
      
      * fall back to cpu if not found kernel
      
      * fix compile problem
      
      * fix compile problem
      
      * refine fallback logic
      
      * fit operator.run()
      
      * fix xpu compile
      
      * fit for new_exec
      
      * add REGISTER_OP_WITHOUT_GRADIENT
      
      * un-cache pt_kernel_context
      
      * fix compile
      
      * fix cudnn
      
      * fix compiling with on_infer
      
      * fix mkldnn
      
      * fix isfinite_v2
      
      * fix xpu problem
      
      * fix op_device
      
      * refine fallback for xpu
      
      * fix xpu compile
      
      * merge develop
      
      * refine code format
      
      * fix compile
      
      * fix compile
      
      * add data_transfer
      
      * fix PreparePtenData
      
      * fix cpu context
      
      * merge develop
      
      * fix compile
      
      * fix error device context
      
      * fix xpu
      
      * fix dev_ctx
      3ab9aef1
  26. 25 1月, 2022 1 次提交
    • W
      [Move selected_rows PR #3] Change the relationship of [include/Cmake]. (#39128) · 2bafd338
      Weilong Wu 提交于
      * Added selected_rows and rw_lock to pten
      
      * Renamed the unit test target to fix CI
      
      * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid
      
      * Remove rw_lock.h,rw_lock_test.cc in fluid
      
      * Use pten::RWLock and pten::AutoRDLock, fix CI
      
      * Use pten::SelectedRows
      
      * Use pten::SelectedRows
      
      * Fix to pass NPU CI
      
      * Use pten::SelectedRows, to pass NPU CI
      
      * To fix NPU CI
      
      * To fix NPU CI again
      2bafd338
  27. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  28. 15 1月, 2022 1 次提交
  29. 13 1月, 2022 1 次提交
  30. 10 1月, 2022 1 次提交
    • L
      Profiler skeleton (#38826) · a8afed69
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * profiler skeleton
      
      * update
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      a8afed69