1. 13 4月, 2022 1 次提交
  2. 03 4月, 2022 1 次提交
    • H
      Add infer meta (#41054) · 868a3203
      hong 提交于
      * add some infer meta
      
      * fix bug
      
      * fix bugs;
      
      * fix bug and add set data type
      
      * revert infer shape of lookup table
      
      * recover test
      868a3203
  3. 28 3月, 2022 1 次提交
    • H
      Move meshgrid to phi (#40994) · ca871957
      hong 提交于
      * move momentum, rmsprop to phi; test=develop
      
      * update
      
      * update
      
      * update
      
      * update
      
      * udpate; test=develop
      
      * fix xpu npu bugs; test=develop
      
      * fix npu bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows compile error; test=develop
      
      * polish code; test=develop
      
      * fix conflict; test=develop
      
      * add meshgrid;
      
      * update
      
      * polish code
      
      * polish code;
      
      * fix bug
      
      * format; remove useless code
      
      * fix npu bug
      
      * fix bug
      ca871957
  4. 25 3月, 2022 3 次提交
    • D
      fix lars optitmizer bug (#40892) · c006a609
      duanboqiang 提交于
      * fix lars optitmizer bug
      
      * Update optimizer.py
      c006a609
    • A
      [Phi] Migrate Adam and AdamW into Phi (#40351) · 56cd3407
      Aurelius84 提交于
      * [Phi] Migrate Adam and Adamw into Phi
      
      * fix compile error and unittest ok
      
      * fix compile error and unittest ok
      
      * fix undefined reference to fLI::FLAGS
      
      * test depend on operator
      
      * fix cmake
      
      * fix xpu compile
      
      * fix infrt
      
      * fix amp_type_traits
      
      * fix amp_type_traits
      
      * modify according reviewer
      
      * modify according reviewer
      
      * fix dtype float16
      
      * fix typo
      
      * fix Cmake
      
      * fix code style
      56cd3407
    • A
      [NPU] add merged_momentum (#40875) · 2b74b739
      Aganlengzi 提交于
      * [NPU] add merged_momentum
      
      * fix
      
      * fix device
      2b74b739
  5. 14 3月, 2022 1 次提交
  6. 07 3月, 2022 2 次提交
  7. 04 3月, 2022 1 次提交
  8. 02 3月, 2022 2 次提交
  9. 01 3月, 2022 1 次提交
  10. 25 2月, 2022 1 次提交
  11. 22 2月, 2022 1 次提交
    • X
      change Vector to std::vector and provide MixVector class as a helper … (#39559) · 728c0624
      xiongkun 提交于
      * change Vector to std::vector and provide MixVector class as a helper wrapper class
      
      * solve the multi-gpu hang problem
      
      * remove the duplicate template instantialize
      
      * Copy vector to cpu
      
      * add CopyToCPU
      
      * xxx
      
      * final version: fix the problem of all reduce
      
      * remove mixvector dependence
      
      * fix
      
      * merge
      
      * fix code
      
      * fix by CI
      728c0624
  12. 21 2月, 2022 1 次提交
  13. 20 2月, 2022 1 次提交
  14. 19 2月, 2022 2 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
    • S
      Add the DistributedFusedLamb optimizer (#39148) · 5df3cd61
      sneaxiy 提交于
      * add DistributedFusedLamb op
      
      * polish code
      
      * fix compile error
      
      * compatible with pten changement
      
      * fix rocm compile error
      
      * improve converage
      
      * update upstream/develop
      
      * fix cast_with_ptr.h
      
      * add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1
      
      * fix clip before allreduce
      
      * add use_master_param_norm
      
      * code polish
      
      * fix bug
      
      * fix ROCM ci
      5df3cd61
  15. 15 2月, 2022 2 次提交
    • F
      move algorithm.h (#39502) · 7eb9593e
      Feiyu Chan 提交于
      Move paddle/fluid/operators/math/algorithm.h to paddle/pten/kernels/funcs and rename all references to symbols in it.
      7eb9593e
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  16. 11 2月, 2022 1 次提交
  17. 09 2月, 2022 2 次提交
  18. 07 2月, 2022 1 次提交
  19. 27 1月, 2022 1 次提交
  20. 25 1月, 2022 1 次提交
    • W
      [Move selected_rows PR #3] Change the relationship of [include/Cmake]. (#39128) · 2bafd338
      Weilong Wu 提交于
      * Added selected_rows and rw_lock to pten
      
      * Renamed the unit test target to fix CI
      
      * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid
      
      * Remove rw_lock.h,rw_lock_test.cc in fluid
      
      * Use pten::RWLock and pten::AutoRDLock, fix CI
      
      * Use pten::SelectedRows
      
      * Use pten::SelectedRows
      
      * Fix to pass NPU CI
      
      * Use pten::SelectedRows, to pass NPU CI
      
      * To fix NPU CI
      
      * To fix NPU CI again
      2bafd338
  21. 24 1月, 2022 2 次提交
    • F
      [Pten] Migration of eigen numeric extensions and functors in paddle/fluid/operatos/eigen (#39124) · a1e40dc6
      Feiyu Chan 提交于
      * migration of functors in paddle/fluid/operators/eigen and paddle/fluid/platform/eigen_ext.h
      * update path of data types like float16.h in includes in extensions.h
      a1e40dc6
    • z8hanghuan's avatar
      support sparse of adam, *test=kunlun (#38483) · e106901e
      z8hanghuan 提交于
      * support sparse of adam, *test=kunlun
      
      * add pre-commit-config.yaml
      
      * support sparse of adam in KL2,*test=kunlun
      
      * support sparse of adam in KL2, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * support sparse of adam, rm some wait, *test=kunlun
      
      * support sparse of adam, rm some wait, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      e106901e
  22. 21 1月, 2022 1 次提交
  23. 20 1月, 2022 1 次提交
  24. 18 1月, 2022 1 次提交
  25. 17 1月, 2022 1 次提交
  26. 10 1月, 2022 1 次提交
    • Z
      [Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea
      Zhanlue Yang 提交于
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor
      
      * Modified framework::Tensor to inherit from DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      
      * Rearranged cfunction calls from tensor.data<void>() to tensor.data()
      
      * Fixed CI issues
      
      * Fixed lite issues
      
      * Fixed data() interface issues,test=allcases
      
      * Resolved IsInitialized() issues
      
      * Fixed ResetHolder() issues
      
      * Fixed MKLDNN & Storage issues
      
      * Resolved ShareBufferWith() issues
      
      * Fixed LoD issues
      5c73a6ea
  27. 07 1月, 2022 1 次提交
  28. 29 12月, 2021 1 次提交
  29. 28 12月, 2021 1 次提交
  30. 24 12月, 2021 1 次提交
  31. 17 12月, 2021 1 次提交
    • S
      Refine some AMP operators for BERT (#37923) · d80fe268
      sneaxiy 提交于
      * support multi precision update for LAMB
      
      * hide some api
      
      * fix ci uts
      
      * fix lamb output of dygraph
      
      * remove some changes to some PR
      
      * try to fix Py3 CI compile error
      
      * fix test_imperative_optimizer, add lars ut, add layer_norm ut
      
      * fix ut, fix format
      
      * fix ut
      
      * fix windows ci
      d80fe268
  32. 03 12月, 2021 1 次提交