1. 20 7月, 2023 1 次提交
    • H
      [NewIR]Change feed list to variable list && support GPU (#55401) · 75517841
      hong 提交于
      * add feed with place op
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * new ir support builtin slice op
      
      * fix phi kernel adaptor bug
      
      * add enable_static
      
      * remove useless test case
      
      * change feed list to single variable
      
      * support gpu
      
      * fix bug
      
      * remove template
      
      * add more data type
      
      * fix cimpile bug
      75517841
  2. 13 7月, 2023 2 次提交
  3. 12 7月, 2023 1 次提交
  4. 11 7月, 2023 1 次提交
  5. 03 7月, 2023 1 次提交
  6. 26 6月, 2023 1 次提交
    • S
      remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85
      Sonder 提交于
      * remove ops from OpsWithFluidKernelNeedMoveToPhi set
      
      * open static build flag
      
      * OpsWithFluidKernelNeedMoveToPhi
      
      * open new_executor_static_build
      
      * add infermate for cudnn_lstm
      
      * fix
      
      * update
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * fix pow2 decay
      
      * fix pow2 decay
      
      * recover analysis_predictor.cc
      
      * fix pow2 decay
      
      * fix cudnn lstm
      
      * add output register info for svd
      
      * fix pow2_decay_with_linear_warmup_kernel
      
      * recover test lstm cudnn
      
      * recover svg register codes
      
      * fix register info
      
      * fix reduce sum register info
      
      * add output info for adadelta
      
      * add output info for adadelta
      
      * add output info for adamax
      
      * fix complex abs register info
      
      * add register info for cudnn_lstm_grad
      
      * recover
      
      * fix lstm cudnn
      
      * fix
      
      * fix xpu output registe info
      
      * remove std::cout
      
      * add backend
      
      * remove output info in pow2_decay_with_linear_warmup_kernel
      
      * add judgment in TensorShouldBeFakeInitialized
      
      * recover power_
      
      * close new_executor_static_build
      
      * fix set_value_xpu
      733eca85
  7. 16 6月, 2023 1 次提交
  8. 14 6月, 2023 1 次提交
  9. 12 6月, 2023 1 次提交
    • H
      log/Log10/log2/log1p support int32/int64/float16/bfloat16 forward (#54089) · 2ddd0473
      Hui Zhang 提交于
      * fix for log xxx
      
      * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward
      
      * fix docstring
      
      * fix bug
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bug
      
      * using cast
      
      * fix test
      
      * fix api
      
      * fix other bugs
      
      * fix ci bug for not using dygraph guard
      
      * add bfloat16 test
      
      * fix ut
      
      * bf16
      2ddd0473
  10. 05 6月, 2023 1 次提交
  11. 02 6月, 2023 2 次提交
  12. 01 6月, 2023 1 次提交
  13. 26 5月, 2023 1 次提交
    • Y
      [PHI Decoupling]Create PHI shared lib (#53735) · da50a009
      YuanRisheng 提交于
      * create phi so
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * add file
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * perfect so
      
      * fix py3 bugs
      
      * delete all static target in phi
      
      * fix windows bugs
      
      * fix py3 bugs
      
      * fix ci bugs
      
      * fix windows bugs
      
      * fix bugs: gflags can't be linked by dynamic and static lib
      
      * fix bugs that can not load 3rd party
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix conflict
      
      * fix xpu bugs
      
      * fix mac compile bugs
      
      * fix psgpu bugs
      
      * fix inference failed
      
      * deal with conflict
      
      * fix LIBRARY_PATH bug
      
      * fix windows bugs
      
      * fix onednn error
      
      * fix windows compile bugs
      
      * fix windows compile bugs
      
      * fix test_cuda_graph_static_mode_error aborted
      
      * fix windows bugs
      
      * fix mac-python3 error
      
      * fix hip compile bugs
      
      * change mode to static
      
      * change to static mode
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix bugs
      
      * add static flag
      
      * add PADDLE_API
      
      * change position of PADDLE_API
      
      * fix windows bugs
      
      * change mode to dynamic lib
      
      * fix windows static bugs
      
      * deal with conflict
      
      * fix windows unit bug
      
      * fix coverage
      
      * deal with conflict
      
      * fix windows-inference
      
      * fix py3 bugs
      
      * fix bugs when compile type_info
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix windows openblas
      
      * fix xpu bugs
      
      * fix enforce_test in windows
      
      * update code according comment
      
      * fix windows cmake bug
      
      * fix windows bugs
      
      * fix windows bugs
      
      * delete cinn unittest
      
      * fix cinn bugs
      
      ---------
      Co-authored-by: HappyHeavyRain's avatarlzydev <1528794076@qq.com>
      da50a009
  14. 25 5月, 2023 2 次提交
  15. 24 5月, 2023 2 次提交
  16. 19 5月, 2023 3 次提交
  17. 17 5月, 2023 1 次提交
  18. 16 5月, 2023 2 次提交
    • C
      Add huber_loss tests (#53535) · 74b91bce
      co63oc 提交于
      74b91bce
    • G
      [phi] move stft to phi - Step 1 (#53517) · 00c21abc
      gouzil 提交于
      * [phi]mv StftKernel to phi
      
      * [phi] fix KernelSignature
      
      * [phi]fix arr error
      
      * [phi] Disable check_dygraph
      
      * [phi]fix include
      
      * [phi] rewrite mutable_data, add output register
      
      * [phi] fix  Alloc
      
      * [phi] fix Alloc again
      
      * [phi] fix mutable_data
      
      * [phi] fix onesided_out Resize
      00c21abc
  19. 15 5月, 2023 3 次提交
  20. 12 5月, 2023 3 次提交
    • P
      【Hackathon 4 No.20】Add i0 / i0e to paddle (#52058) · ce256f75
      PommesPeter 提交于
      * added base code for i0 and i0e
      
      * added grad base code for i0 and i0e
      
      * added i0 and i0e python code
      
      * added ops and backward yaml config
      
      * added i0 and i0e cpu kernel, but not test.
      
      * added i0 and i0e code and unitest files
      
      * added test files
      
      * added i0/i0e gpu implementation code
      
      * updated code style
      
      * updated code style
      
      * fixed unitests code
      
      * updated i0 with eigen3
      
      * fixed bug and added more test cases
      
      * refactor: fixed static graph bug
      
      * refactor: removed i0 and i0e from op_compat
      
      * refactor: updated code style
      
      * refactor: updated op_compat.yaml
      
      * refactor: updated op_compat.yaml
      
      * refactor: fixed op name mapping and optimize unittest case
      
      * refactor: manually implement i0 / i0e
      
      * refactor: added grad kernel for i0 / i0e,didn't finish
      
      * Update math.py
      
      * refactor: added equation to doc in English and added comments for computing i0 / i0e gradient
      
      * refactor: removed eigen implementation
      
      * refactor: finished i0 / i0e cpu and gpu op
      
      * refactor: updated code style
      
      * fix: find  a bug but not fix
      
      * fix: incorrect unittest cases
      
      * update: updated code style and remove my file
      
      * update: updated unittest case
      
      * fix: fixed sign error
      
      * fix: fixed mistakes when merging
      
      * refactor: updated code style
      
      * refactor: remove unused code
      
      * refactor: updated code style
      ce256f75
    • W
      sequence_mask functionalization (#53478) · d2b1e3c2
      Wang Xin 提交于
      * sequence_mask functionalization
      
      * fix sequence_mask test
      d2b1e3c2
    • H
      move pow2_decay_with_linear_warmup kernel to phi (#53741) · 348565b0
      huangjiyi 提交于
      * update
      
      * update
      348565b0
  21. 11 5月, 2023 1 次提交
  22. 09 5月, 2023 2 次提交
  23. 05 5月, 2023 2 次提交
  24. 28 4月, 2023 2 次提交
    • B
      Dropout optimize & clean broadcast inT and ElementwiseType (#52969) · d611e48c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * clean ElementwiseT and InT for BroadcastKernel
      
      * default axis and clean inT
      
      * remove redundant fast divmod computation
      
      * optimize drop_nd & drop_nd_grad
      
      * optimize BroadcastDataLoader bf16 fp16
      
      * rm InT etc. after merge develop
      
      * delete constexpr for windows ci
      
      * fix conflict
      
      * fix conflic with develop
      
      * fix conflic
      
      * new clean
      
      * clean
      d611e48c
    • G
      【0D output】add_0D_output_support (#52857) · ef6e8d09
      GGBond8488 提交于
      * add 0d support for dist, trace, paddle.linalg.cond test=allcase
      
      * add_0d_output_support_for_det
      
      * test=allcase
      
      * support_0d_output_for_linalg.norm
      
      * support linalg.norm 0d output, test=allcase
      
      * fix 0D test
      
      * fix zero dim test, test=allcase
      
      * fix 0D test
      
      * fix tets,test=allcase
      
      * fix error,test=allcase
      
      * fix errors ,test=allcase
      
      * add static backward , test=allcase
      
      * add static backwward test, test=allcase
      
      * fix pr-ci-build error;test=document_fix (#53060)
      
      * [Cherry-Pick] Unique support float16&bfloat16 (#53023)
      
      unique支持float16和bfloat16数据类型,并完善相关单测。
      
      * slogdet_support_0D_output
      
      * add new case
      
      * fix tests, test=allcase
      
      * fix p_norm related test, test=allcase
      
      * fix some err, test=allcase
      
      * test=allcase
      
      * move out trace
      
      * open some case, test=allcase
      
      * fix norm all case, test=allcase
      
      * fix some test error, test=allcase
      
      * fix typro,test=allcase
      
      * fix test err, test=allcase
      
      * test=allcase
      
      * test
      
      * fix test error, test=allcase
      
      * fix test error, test=allcase
      
      * fallback norm, test=allcase
      
      ---------
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      Co-authored-by: NZhang Zheng <32410583+ZzSean@users.noreply.github.com>
      ef6e8d09
  25. 27 4月, 2023 2 次提交
    • G
      [phi] Move sequence_pool to phi - Step 3 :sequence_pool_grad_op (#52680) · fe053396
      gouzil 提交于
      * [phi] move sequence_pool kernel to phi
      
      * mv kernels impl
      
      * fix parameter error
      
      * clean include
      
      * fix compat filename
      
      * [phi] move fluid sequence_pool_grad to phi
      
      * [phi][compat] sig rm GradVarName
      
      * [phi] fix sequence_pool out type
      
      * [phi] rm impl, add const string
      
      * [phi] fix const str
      
      * fix sequence_pooling cmake
      
      * [phi] mv sequence_pooling_test
      
      * [phi] fix grad sig
      
      * [phi] fix sequence_pool is_test error
      
      * [phi] fix sequence_pooling gpu include
      
      * [phi] mv to impl
      
      * [phi] fix SequencePoolFunctor cu include
      
      * [phi] modify out max_index int32_t
      
      * [phi] add pooltype mapping determine
      
      * [phi] fix sequence_pool_sig
      
      * [phi] fix sequence_pool_sig sum
      
      * [phi] try ci
      
      * [phi] fix max_index optional
      fe053396
    • C
      【Hackathon4】No5 nextafter (#52544) · 82ac3913
      cyberslack_lee 提交于
      82ac3913