1. 28 8月, 2023 2 次提交
    • G
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_,... · c5fc413a
      GGBond8488 提交于
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509)
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * support sinh and cosh in cpu
      
      * support asin, acos, atan, asinh, acosh, atanh in cpu
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * support sinh and cosh in gpu
      
      * support asin, acos, atan, asinh, acosh, atanh in gpu
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * check complex in python
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * fix cast error
      
      * fix
      
      * Revert "check complex in python"
      
      This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05.
      
      * add renorm , fix error
      
      * add coverage
      
      * fix cumsum inpalce version error
      
      * add cast inpalce impl
      
      * rm test.log
      
      * fix multiply_dyfunction and add multiply_backward test
      
      * add and use is_same_tensor
      
      * fix typro
      
      * fix sone error
      
      * fix typro
      
      ---------
      Co-authored-by: NScotty <jmhgchn@gmail.com>
      Co-authored-by: NScotty <527407973@qq.com>
      c5fc413a
    • C
      [AutoParallel] Simplify PADDLE_WITH_DISTRIBUTE marco using (#56361) · 62c78e26
      Chen Weihang 提交于
      * simplify with dist marco
      
      * polish error message format
      
      * fix vtable error
      
      * fix cmake error
      
      * fix winsock redefined error
      
      * fix windows compile error
      
      * fix windows conpile failed
      
      * fix merge error
      
      * fix vec compile error
      
      * add port.h into test_cpu_vec
      
      * fix merge error
      
      * try to fix winsock error
      62c78e26
  2. 22 8月, 2023 1 次提交
    • C
      [AutoParallel] Polish dist tensor design (#56368) · 8495377a
      Chen Weihang 提交于
      * polish dist teensor design
      
      * adjust constructor
      
      * polish details
      
      * polish details design
      
      * fix compile error
      
      * refactor init tensor impl
      
      * fix reshard test
      
      * polish details
      
      * add unittest for coverage
      8495377a
  3. 16 8月, 2023 1 次提交
    • C
      [AutoParallel] Dygraph basic impl for semi auto parallel (#55698) · 7039bef3
      Chen Weihang 提交于
      * add phi forward api gen impl
      
      * add phi backward gen code
      
      * polish api code gen impl
      
      * polish code gen impl
      
      * remove auto_paralel namespace
      
      * add dygraph forward impl
      
      * add for_auto_parallel cond
      
      * fix code gen errors
      
      * add dygraph backward impl
      
      * resolve conflict with develop
      
      * refactor dist api gen impl
      
      * revert origin api gen impl
      
      * replace template for override func
      
      * fix dnnl marco error
      
      * revert third_party change
      
      * add with distributed marco
      
      * Update grad_tensor_holder.cc details
      
      * merge dist tensor constructor
      
      * change test tensor to replicate
      
      * fx typo
      
      * resolve conflict with develop
      
      * fix out dim error
      7039bef3
  4. 31 7月, 2023 1 次提交
  5. 12 7月, 2023 1 次提交
  6. 22 9月, 2022 1 次提交
  7. 19 9月, 2022 2 次提交
  8. 16 8月, 2022 1 次提交
    • C
      [Phi] Move amp ops into phi (#45079) · b4f67757
      Chen Weihang 提交于
      * move check finite and unscale kernel into phi
      
      * move infershape into phi
      
      * move update_loss_scaling kernel into phi
      
      * remove original kernels
      
      * move update loss scaling infershape into phi
      
      * add header for xpu and npu
      
      * solve coverage failed
      
      * fix npu test failed
      
      * remove mutable data in cu file
      
      * fix new executor failed
      
      * add valid check for meta tensor output
      b4f67757
  9. 12 7月, 2022 1 次提交
  10. 17 4月, 2022 1 次提交
    • C
      [Perf] Optimize dygraph scheduling performance (#41696) · 7ee31a96
      Chen Weihang 提交于
      * split phi and fluid infermeta context
      
      * resolve conflict
      
      * fix type error
      
      * optimize scheduling perf
      
      * spec small vector size
      
      * replace all grad var name
      
      * fix test failed
      
      * move init defalut signature
      
      * polish details
      
      * polish details
      
      * fix no init bug
      
      * init sig for tests
      
      * add init sig for infer
      
      * fix infrt error
      
      * fix infrt failed
      
      * fix kunlun error
      
      * fix infrt failed
      7ee31a96
  11. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  12. 15 3月, 2022 1 次提交
    • H
      Move one hot to phi (#39876) · 7701db37
      hong 提交于
      * move one hot to phi; test=develop
      
      * fix bugs; test=develop
      
      * fix bugs; test=develop
      
      * add infer meta; test=develop
      
      * fix bugs; test=develop
      
      * resolve confilct
      
      * resolve confilct
      
      * fix bug;
      
      * fix error; test=develop
      
      * update; test=develop
      
      * polish code; test=develop
      
      * add one api in eager mode; test=develop
      
      * add one hot test; test=develop
      
      * remove use less code; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      7701db37
  13. 13 3月, 2022 1 次提交
  14. 08 3月, 2022 1 次提交
  15. 20 2月, 2022 1 次提交
  16. 15 2月, 2022 1 次提交
    • Z
      [Pten] Support SelectedRows in C++ API (#39497) · 5bb3b668
      zyfncg 提交于
      * add data_transform in pten api
      
      * support GetKernelTypeForVar
      
      * fix complie problem of bfloat16
      
      * add scale_sr in api
      
      * suppport select_row in C++ api
      
      * merge code
      5bb3b668
  17. 30 1月, 2022 1 次提交
    • C
      [PTen] Change all InferMeta functions (#39222) · 7e29cea9
      Chen Weihang 提交于
      * change unary infermeta
      
      * change other infermeta
      
      * change all infermeta format
      
      * resolve conflit
      
      * fix test failed
      
      * resolve reshape conflit
      
      * fix compile failed
      
      * adapt auto api gen
      
      * fix reshape failed
      
      * fix concat failed
      
      * resolve conflict
      7e29cea9
  18. 29 1月, 2022 1 次提交
    • C
      [PTen] Tidy pten core headers (#39188) · dd990981
      Chen Weihang 提交于
      * open header for custom kernel
      
      * add core utils
      
      * tidy core code
      
      * tify header
      
      * tidy include
      
      * tidy namespace
      
      * resolve conflit
      
      * fix unittest and coverage
      
      * remove platform using
      
      * resolve conflict
      
      * resolve conflict
      
      * fix digamma namespace error
      
      * fix xpu full kernel error
      
      * fix xpu full kernel error
      
      * polish details
      
      * add place for lib storage
      dd990981
  19. 26 1月, 2022 1 次提交
    • C
      [PTen] Unify InferMeta(Shape) Function in pten and fluid op (#38976) · b75507d3
      Chen Weihang 提交于
      * infermeta context init design
      
      * support infermeta called in fluid op
      
      * add hasattr and attr methods
      
      * add dygraah GetVarPtrs support
      
      * rename arg_map_context to arg_map_utils
      
      * add registry for arg map func
      
      * resolve conflit
      
      * refactor op utils design
      
      * polish meta config
      
      * fix details
      
      * remove hasattr method
      
      * resolve conflit
      
      * revert cmake order change
      
      * revert some change
      
      * change init pos
      
      * fix compile faileed
      
      * fix typo
      
      * fix inference failed
      
      * fix windows ccompile failed
      
      * polish format
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      b75507d3
  20. 22 1月, 2022 1 次提交