1. 11 3月, 2022 2 次提交
    • C
      [Phi] Remove needless deps in unittests (#40256) · 89ed57e2
      Chen Weihang 提交于
      * remove needless deps in unittests
      
      * add gpu marco
      
      * fix other unittests
      
      * fix kernel name error
      
      * fix test_prepare_op
      
      * fix failed dygraph unittests
      
      * fix gpu failed tests
      
      * fix cinn test failed
      
      * fix cinn test failed
      
      * fix dropout tests
      89ed57e2
    • C
      [Phi] Reduce grad (#40263) · f452ad5c
      chentianyu03 提交于
      * add reduce_sum grad kernel
      
      * add reduce_grad
      
      * modify reduce grad
      
      * update reduce grad functions
      
      * fix build error
      
      * add argument mapping
      
      * move cast input after grad
      
      * add dims.size=1 cpu reduce_sum grad compute method
      
      * update reduce grad GPU
      
      * remove raw reduce_sum_grad kernel
      
      * modify header files
      
      * add namespace funcs for reduce_grad_funcstions
      f452ad5c
  2. 10 3月, 2022 1 次提交
  3. 09 3月, 2022 1 次提交
  4. 08 3月, 2022 1 次提交
  5. 07 3月, 2022 2 次提交
  6. 03 3月, 2022 2 次提交
  7. 02 3月, 2022 3 次提交
  8. 01 3月, 2022 1 次提交
  9. 28 2月, 2022 3 次提交
  10. 24 2月, 2022 2 次提交
  11. 23 2月, 2022 2 次提交
  12. 22 2月, 2022 3 次提交
  13. 21 2月, 2022 2 次提交
    • C
      [pten]rm reduce_sum and reduce_mean raw kernel (#39484) · 2bb5aae8
      chentianyu03 提交于
      * rm reduce_sum raw kernel
      
      * remove reduce_mean kernel
      
      * remove reduce_mean kernel
      
      * reduce support int and int64_t
      
      * mean support int and int64_t type
      2bb5aae8
    • C
      Update record interface using part2 (#39694) · c984cd85
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update record event interface using
      
      * update record event interface using
      
      * update operator.cc
      
      * update part2
      
      * update part1
      
      * fix include profiler.h header in ps server
      
      * fix include profiler.h header in ps server
      
      * fix profiler.h header
      c984cd85
  14. 20 2月, 2022 1 次提交
  15. 19 2月, 2022 2 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
    • C
      fix RecordEvent interface (#39675) · 019a552b
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update operator.cc
      019a552b
  16. 18 2月, 2022 4 次提交
    • F
      [Pten] blas and lapck migration (#39587) · 8c7ee8c2
      Feiyu Chan 提交于
      * move blas related files
      * move lapack related files
      8c7ee8c2
    • Z
      [AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848
      zhangbo9674 提交于
      * support dtype param for auto_cast
      
      * add amp_dtype for tracer
      
      * add unsupported bf16 list
      
      * support bf16 amp for O2
      
      * refine python interface for bfloat16
      
      * refine code
      
      * refine code
      
      * refine unittest
      
      * refine code
      
      * refine code
      
      * add bf16 o1
      
      * refine code by comment
      
      * add gradient accumulator
      
      * add recompute
      7d6d3848
    • Q
      [MLU]add matmul and matmul_v2 op (#39539) · 229ec32a
      qipengh 提交于
      * [MLU]add matmul and matmul_v2 op
      
      * [MLU] fix data_type and del matmul
      
      * [MLU] fix compile error
      
      * [MLU] fix ci_check error
      229ec32a
    • J
      [Bug Fix]Fix gradient accumulator (#39577) · a7cbd3ef
      Jiabin Yang 提交于
      * merge legacy to fluid
      
      * Remove legacy code
      
      * Remove legacy code
      
      * Remove DataType test
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * add more test
      
      * fix different device gradient_accmulator bug
      
      * merge develop
      
      * remove useless tests
      a7cbd3ef
  17. 16 2月, 2022 3 次提交
    • F
      [MLU] fix TensorAdd for mlu (#39523) · 24b8f63e
      fwenguang 提交于
      24b8f63e
    • J
      EagerTensor to EagerVariable (#39447) · 831fd86e
      Jiabin Yang 提交于
      * merge legacy to fluid
      
      * Remove legacy code
      
      * Remove legacy code
      
      * Remove DataType test
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * add more test
      
      * merge develop and refine code
      831fd86e
    • Y
      [Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270
      YuanRisheng 提交于
      * remove reshape and elementwise_add registry
      
      * delete code
      
      * fix bugs when run ci ut
      
      * remove log
      
      * fix bugs when run unit test
      
      * fix bugs when run unit test
      
      * fix bugs when run cinn
      
      * fix bugs when run ci-mac-python3
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix bugs when run kunlun
      
      * fix bugs when compile
      
      * update code according comment
      c6478270
  18. 15 2月, 2022 4 次提交
    • R
      [PluggableDevice] Add custom runtime support (#38740) · 3e7825f3
      ronnywang 提交于
      * [CustomRuntime] Add DeviceManager
      
      * [CustomRuntime] Add DeviceInterface
      
      * [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager
      
      * [CustomRuntime] Add plug-in device
      
      * [CustomRuntime] Memory module support PluggableDevice
      
      * [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option
      
      * update
      
      * [API] update API doc based on comments, test=develop
      Co-authored-by: Nqili93 <qili93@qq.com>
      3e7825f3
    • H
      move histogram to pten (#39496) · 556f6eb0
      hong 提交于
      * move histogram to pten; test=develop
      
      * fix format error; test=develop
      
      * fix histogram kernel format; test=develop
      556f6eb0
    • W
      [Eager] Support SellectedRows MergeAdd case (#39449) · 6549a041
      Weilong Wu 提交于
      
      * Refactor SelectedRows MergeAdd func by using template
      
      * Add GetInnerMutable func instead of modify GetInnerMutableTensor
      
      * Updated PADDLE_ENFORCE statement
      
      * Remove useless PADDLE_ENFORCE statement
      
      * Polish Code
      6549a041
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  19. 14 2月, 2022 1 次提交