1. 19 4月, 2022 1 次提交
  2. 17 4月, 2022 1 次提交
    • C
      [Perf] Optimize dygraph scheduling performance (#41696) · 7ee31a96
      Chen Weihang 提交于
      * split phi and fluid infermeta context
      
      * resolve conflict
      
      * fix type error
      
      * optimize scheduling perf
      
      * spec small vector size
      
      * replace all grad var name
      
      * fix test failed
      
      * move init defalut signature
      
      * polish details
      
      * polish details
      
      * fix no init bug
      
      * init sig for tests
      
      * add init sig for infer
      
      * fix infrt error
      
      * fix infrt failed
      
      * fix kunlun error
      
      * fix infrt failed
      7ee31a96
  3. 15 4月, 2022 1 次提交
    • J
      Add eager string tensor (#41039) · a22b68b8
      Jack Zhou 提交于
      * Add core.eager.StringTensor __init__ which pyarray args can be passed
      
      * Add the numpy method of core.eager.StringTensor
      
      * revert tensor.to_string modification
      
      * Add ToPyObject for core.eager.StringTensor
      
      * Add debug string for core.eager.StringTensor
      
      * Remove place args of core.eager.StringTensor temporarily
      
      * Fix check string_tensor error
      
      * remove dtype of core.eager.StringTensor
      
      * add core.eager.StringTensor unittest
      
      * remove pstring from VarDesc
      
      * Add InitStringTensorWithStringTensor
      
      * Remove to_string modification
      
      * Remove zero_copy arg from StringTensor creator
      a22b68b8
  4. 14 4月, 2022 2 次提交
  5. 13 4月, 2022 4 次提交
  6. 12 4月, 2022 1 次提交
    • C
      [CustomOp] Add context pool unittests (#41085) · 59ec9599
      Chen Weihang 提交于
      * add context pool unittests
      
      * fix timeout
      
      * polish details
      
      * change option pos
      
      * add dll decl for wndows
      
      * fix pre-commit error
      
      * move dll_decl and export DeviceContext
      
      * replace lost dll_decl.h
      59ec9599
  7. 08 4月, 2022 1 次提交
  8. 07 4月, 2022 1 次提交
  9. 04 4月, 2022 1 次提交
  10. 02 4月, 2022 1 次提交
  11. 01 4月, 2022 5 次提交
    • W
      [Eager] Support pinned (#41035) · f3270fc8
      wanghuancoder 提交于
      * support pinned, test=develop
      
      * support async_write, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine,test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      f3270fc8
    • C
      [Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe
      Chen Weihang 提交于
      * add cross_entropy_with_softmax phi kernel
      
      * remove softmax_with_cross_entropy kernel
      
      * add softmax_with_cross_entropy grad kernel
      
      * remove original op kernel
      
      * refine cross entropy impl
      
      * fix pointer error
      
      * revert kernel cu change
      
      * fix xpu failed
      
      * fix cinn failed
      
      * fix npu failed
      
      * add forward sig
      
      * add check_nan_inf for pt kernel
      
      * remove repeat cmake item
      
      * fix unittest error
      e6ec98fe
    • C
      [Phi]Interploatd kernels into phi (#40855) · d65a7a46
      chentianyu03 提交于
      * add interploate cpu kernel
      
      * fix nullptr bug
      
      * add interpolate gpu kernel
      
      * fix unit test error
      
      * remove raw kernels
      
      * add cuda kernel impl
      
      * add infermeta
      
      * recover accidentally deleted kernels in interpolate op
      
      * fix grad x_grad name error
      
      * remove interpolate_v2_op.h
      
      * rm unused codes
      
      * fix xpu build error
      
      * fix build error
      
      * fix namespace error
      
      * add register header for nup
      
      * fix infermeta error
      
      * modify by review
      
      * add the missing args in test_trt_convert_nearest_interp_v2
      d65a7a46
    • L
      [KP] fix bug in activation xpu kp kernel (#41219) · 705776ca
      Liu-xiandong 提交于
      * fix bug in activation xpu kp kernel
      
      * delete useless comment
      705776ca
    • Z
      3a29e4f8
  12. 31 3月, 2022 3 次提交
  13. 30 3月, 2022 1 次提交
    • P
      support view strategy in dygraph eager_final state (#40891) · 495ca4aa
      pangyoki 提交于
      * support view strategy in eager_final state
      
      * perfect reshape kernel
      
      * fix bugs of sig
      
      * add unittest for reshape_sig
      
      * fix bugs when run converage
      
      * fix inplace bug in final_state eager_gen
      
      * fix python_c_gen
      
      * support view strategy for final state
      
      * fix order of out and xshape in reshape
      
      * fix Coverage_CI unittest timeout error
      
      * support reshape view
      
      * fix reshape_sig
      
      * fix yml and api_base
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      495ca4aa
  14. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  15. 25 3月, 2022 1 次提交
  16. 24 3月, 2022 2 次提交
  17. 23 3月, 2022 1 次提交
  18. 22 3月, 2022 1 次提交
  19. 21 3月, 2022 1 次提交
    • H
      Add yaml config part0 (#40020) · cc853e95
      hong 提交于
      * add add yaml
      
      * add elementwise add yaml; test=develop
      
      * add norm
      
      * update
      
      * add some yaml config; test=develop
      
      * fix bug; test=develop
      
      * fix compare error; test=develop
      
      * revert erger_gen.py
      
      * update; test=deveop
      
      * remove usless code; test=deveop
      
      * fix bug; test=develop
      
      * fix test error; test=develop
      
      * remove int_type; test=develop
      
      * fix type error; test=develop
      
      * format; test=develop
      
      * remove type register; test=develop
      
      * polish code; test=develop
      
      * fix ci error; test=develop
      cc853e95
  20. 18 3月, 2022 2 次提交
  21. 17 3月, 2022 2 次提交
    • C
      [Phi] Move assign kernel into phi (#40022) · 1904572a
      Chen Weihang 提交于
      * move assign kernel init commit
      
      * change vec<tensor> to vec<tensor*>
      
      * support tensor array
      
      * support api declare
      
      * fix test_list failed
      
      * fix npu and xpu failed
      
      * fix infrt failed
      
      * remove assign array size in operator
      
      * move assign sr header into sr dir
      
      * add infermeta for assign
      
      * test op success
      
      * fix test_list failed
      
      * fix kunlun failed
      
      * add set host allocator in tests
      
      * support tensor array in arg ctx
      
      * open set layout in share_meta
      
      * fix meta tensor layout error
      
      * fix test failed
      1904572a
    • Z
      Optimize the performance of C++ API (#40640) · add304ed
      zyfncg 提交于
      * Optimize performance
      
      * optimiaze c++ api performance
      
      * remove unsed code
      
      * fix paddle throw
      
      * updata format
      add304ed
  22. 15 3月, 2022 2 次提交
    • C
      [Phi]move reduce_min/any/all kernel (#40374) · c46e661d
      chentianyu03 提交于
      * add reduce_min kernel
      
      * remove raw reduce_min kernel
      
      * add reduce min
      
      * add reduce any all impl
      
      * add bool reduce Kernel
      
      * remove raw any/all kernel
      
      * add any all kernel
      
      * rm comment
      c46e661d
    • H
      Move one hot to phi (#39876) · 7701db37
      hong 提交于
      * move one hot to phi; test=develop
      
      * fix bugs; test=develop
      
      * fix bugs; test=develop
      
      * add infer meta; test=develop
      
      * fix bugs; test=develop
      
      * resolve confilct
      
      * resolve confilct
      
      * fix bug;
      
      * fix error; test=develop
      
      * update; test=develop
      
      * polish code; test=develop
      
      * add one api in eager mode; test=develop
      
      * add one hot test; test=develop
      
      * remove use less code; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      7701db37
  23. 14 3月, 2022 1 次提交
  24. 13 3月, 2022 1 次提交
  25. 12 3月, 2022 1 次提交
  26. 11 3月, 2022 1 次提交
    • C
      [Phi] Reduce grad (#40263) · f452ad5c
      chentianyu03 提交于
      * add reduce_sum grad kernel
      
      * add reduce_grad
      
      * modify reduce grad
      
      * update reduce grad functions
      
      * fix build error
      
      * add argument mapping
      
      * move cast input after grad
      
      * add dims.size=1 cpu reduce_sum grad compute method
      
      * update reduce grad GPU
      
      * remove raw reduce_sum_grad kernel
      
      * modify header files
      
      * add namespace funcs for reduce_grad_funcstions
      f452ad5c