1. 26 4月, 2022 1 次提交
    • C
      [Cherry-pick] Optimize dygraph performance part2 (#42224) · ab24b9c0
      Chen Weihang 提交于
      * Add paddle::variant and replace paddle::any (#42139)
      
      * add variant and replace any
      
      * split attribute
      
      * Optimize dygraph GetExpectedKernelType perf (#42154)
      
      * opt dygraph scheduling
      
      * revert part impl
      
      * fix variant compile error (#42203)
      
      * replace any by variant in infermeta (#42181)
      ab24b9c0
  2. 25 4月, 2022 1 次提交
  3. 21 4月, 2022 1 次提交
  4. 20 4月, 2022 1 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) (#41963) · 3b25afb2
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      3b25afb2
  5. 18 4月, 2022 2 次提交
    • J
      Add eager string tensor (#41039) (#41839) · 623f8308
      Jack Zhou 提交于
      * Add core.eager.StringTensor __init__ which pyarray args can be passed
      
      * Add the numpy method of core.eager.StringTensor
      
      * revert tensor.to_string modification
      
      * Add ToPyObject for core.eager.StringTensor
      
      * Add debug string for core.eager.StringTensor
      
      * Remove place args of core.eager.StringTensor temporarily
      
      * Fix check string_tensor error
      
      * remove dtype of core.eager.StringTensor
      
      * add core.eager.StringTensor unittest
      
      * remove pstring from VarDesc
      
      * Add InitStringTensorWithStringTensor
      
      * Remove to_string modification
      
      * Remove zero_copy arg from StringTensor creator
      623f8308
    • C
      [Cherry-pick] Organize the API of custom operators (#41882) · 897911fc
      Chen Weihang 提交于
      * [Phi&CustomOp] Remove deprecated enum PlaceType for custom op & add warning (#41647)
      
      * remove old custom op placetype
      
      * replace dist  placetype using
      
      * add with gpu macro
      
      * fix mutable_data error
      
      * fix set value error
      
      * add comment
      
      * remove all is initialized using (#41766)
      
      * remove inner_place using (#41768)
      
      * polish tensor depreacted method warning (#41807)
      
      * [CustomOp] Fix PlaceType related compat error (#41826)
      
      * fix place type related compat error
      
      * fix test failed
      
      * remove dll decl
      
      * revert place type change
      
      * add dll decl
      
      * resolve conflict
      897911fc
  6. 14 4月, 2022 1 次提交
  7. 11 4月, 2022 2 次提交
    • A
      fix running error for ipu (#41533) · 664199aa
      Allen Guo 提交于
      cherry from #41481
      664199aa
    • C
      [Cherry-pick] Add truncated_normal/unique/swish/unbind yaml and polish Getting... · b2e095c4
      Chen Weihang 提交于
      [Cherry-pick] Add truncated_normal/unique/swish/unbind yaml and polish Getting tensor place impl (#41539)
      
      * [Phi] Polish truncated normal kernel and add yaml (#41280)
      
      * polish truncated normal kernel
      
      * add yaml
      
      * add truncated normal kernel and add yaml
      
      * polish unittests and yaml
      
      * import dygraph mehtod
      
      * add unique yaml and final state api (#41460)
      
      * fix get tensor backend set bug (#41478)
      
      * [Phi] Add unbind yaml and final state api (#41277)
      
      * add unbind yaml
      
      * fix unittest
      
      * [Phi] Add swish yaml and final state api (#41479)
      
      * add swish yaml and final state api
      
      * skip mkldnn test
      
      * fix grad mkldnn test
      
      * add cherry-pick lost code
      b2e095c4
  8. 04 4月, 2022 1 次提交
  9. 02 4月, 2022 1 次提交
  10. 01 4月, 2022 5 次提交
    • W
      [Eager] Support pinned (#41035) · f3270fc8
      wanghuancoder 提交于
      * support pinned, test=develop
      
      * support async_write, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine,test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      f3270fc8
    • C
      [Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe
      Chen Weihang 提交于
      * add cross_entropy_with_softmax phi kernel
      
      * remove softmax_with_cross_entropy kernel
      
      * add softmax_with_cross_entropy grad kernel
      
      * remove original op kernel
      
      * refine cross entropy impl
      
      * fix pointer error
      
      * revert kernel cu change
      
      * fix xpu failed
      
      * fix cinn failed
      
      * fix npu failed
      
      * add forward sig
      
      * add check_nan_inf for pt kernel
      
      * remove repeat cmake item
      
      * fix unittest error
      e6ec98fe
    • C
      [Phi]Interploatd kernels into phi (#40855) · d65a7a46
      chentianyu03 提交于
      * add interploate cpu kernel
      
      * fix nullptr bug
      
      * add interpolate gpu kernel
      
      * fix unit test error
      
      * remove raw kernels
      
      * add cuda kernel impl
      
      * add infermeta
      
      * recover accidentally deleted kernels in interpolate op
      
      * fix grad x_grad name error
      
      * remove interpolate_v2_op.h
      
      * rm unused codes
      
      * fix xpu build error
      
      * fix build error
      
      * fix namespace error
      
      * add register header for nup
      
      * fix infermeta error
      
      * modify by review
      
      * add the missing args in test_trt_convert_nearest_interp_v2
      d65a7a46
    • L
      [KP] fix bug in activation xpu kp kernel (#41219) · 705776ca
      Liu-xiandong 提交于
      * fix bug in activation xpu kp kernel
      
      * delete useless comment
      705776ca
    • Z
      3a29e4f8
  11. 31 3月, 2022 3 次提交
  12. 30 3月, 2022 1 次提交
    • P
      support view strategy in dygraph eager_final state (#40891) · 495ca4aa
      pangyoki 提交于
      * support view strategy in eager_final state
      
      * perfect reshape kernel
      
      * fix bugs of sig
      
      * add unittest for reshape_sig
      
      * fix bugs when run converage
      
      * fix inplace bug in final_state eager_gen
      
      * fix python_c_gen
      
      * support view strategy for final state
      
      * fix order of out and xshape in reshape
      
      * fix Coverage_CI unittest timeout error
      
      * support reshape view
      
      * fix reshape_sig
      
      * fix yml and api_base
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      495ca4aa
  13. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  14. 25 3月, 2022 1 次提交
  15. 24 3月, 2022 2 次提交
  16. 23 3月, 2022 1 次提交
  17. 22 3月, 2022 1 次提交
  18. 21 3月, 2022 1 次提交
    • H
      Add yaml config part0 (#40020) · cc853e95
      hong 提交于
      * add add yaml
      
      * add elementwise add yaml; test=develop
      
      * add norm
      
      * update
      
      * add some yaml config; test=develop
      
      * fix bug; test=develop
      
      * fix compare error; test=develop
      
      * revert erger_gen.py
      
      * update; test=deveop
      
      * remove usless code; test=deveop
      
      * fix bug; test=develop
      
      * fix test error; test=develop
      
      * remove int_type; test=develop
      
      * fix type error; test=develop
      
      * format; test=develop
      
      * remove type register; test=develop
      
      * polish code; test=develop
      
      * fix ci error; test=develop
      cc853e95
  19. 18 3月, 2022 2 次提交
  20. 17 3月, 2022 2 次提交
    • C
      [Phi] Move assign kernel into phi (#40022) · 1904572a
      Chen Weihang 提交于
      * move assign kernel init commit
      
      * change vec<tensor> to vec<tensor*>
      
      * support tensor array
      
      * support api declare
      
      * fix test_list failed
      
      * fix npu and xpu failed
      
      * fix infrt failed
      
      * remove assign array size in operator
      
      * move assign sr header into sr dir
      
      * add infermeta for assign
      
      * test op success
      
      * fix test_list failed
      
      * fix kunlun failed
      
      * add set host allocator in tests
      
      * support tensor array in arg ctx
      
      * open set layout in share_meta
      
      * fix meta tensor layout error
      
      * fix test failed
      1904572a
    • Z
      Optimize the performance of C++ API (#40640) · add304ed
      zyfncg 提交于
      * Optimize performance
      
      * optimiaze c++ api performance
      
      * remove unsed code
      
      * fix paddle throw
      
      * updata format
      add304ed
  21. 15 3月, 2022 2 次提交
    • C
      [Phi]move reduce_min/any/all kernel (#40374) · c46e661d
      chentianyu03 提交于
      * add reduce_min kernel
      
      * remove raw reduce_min kernel
      
      * add reduce min
      
      * add reduce any all impl
      
      * add bool reduce Kernel
      
      * remove raw any/all kernel
      
      * add any all kernel
      
      * rm comment
      c46e661d
    • H
      Move one hot to phi (#39876) · 7701db37
      hong 提交于
      * move one hot to phi; test=develop
      
      * fix bugs; test=develop
      
      * fix bugs; test=develop
      
      * add infer meta; test=develop
      
      * fix bugs; test=develop
      
      * resolve confilct
      
      * resolve confilct
      
      * fix bug;
      
      * fix error; test=develop
      
      * update; test=develop
      
      * polish code; test=develop
      
      * add one api in eager mode; test=develop
      
      * add one hot test; test=develop
      
      * remove use less code; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      7701db37
  22. 14 3月, 2022 1 次提交
  23. 13 3月, 2022 1 次提交
  24. 12 3月, 2022 1 次提交
  25. 11 3月, 2022 3 次提交
  26. 10 3月, 2022 1 次提交
    • Z
      [Phi]Move topk kernel to phi (#40064) · 329b095e
      Zhang Zheng 提交于
      * first commit
      
      * cpu kernel
      
      * first version
      
      * fix compile error
      
      * fix compile error
      
      * delete v2
      
      * fix
      
      * fix
      
      * add alias
      
      * fix
      
      * fix
      
      * fix
      
      * fix error
      
      * fix
      
      * fix
      
      * fix
      
      * fix format
      329b095e