1. 20 4月, 2023 1 次提交
  2. 10 4月, 2023 1 次提交
    • H
      [Opt Performance] Optimize custom operator performance (#52597) · 01247e33
      HongyuJia 提交于
      * [Opt Performance] Optimize custom operator performance, reconstruct python API auto-gen, add cache and use const inference
      
      * opt AutoGradMeta implementation
      
      * remove profiler codes
      
      * fix unit test
      
      * change year, 2021->2023
      
      * fix int64_t parse bug
      01247e33
  3. 03 4月, 2023 1 次提交
  4. 31 3月, 2023 1 次提交
  5. 27 3月, 2023 1 次提交
  6. 23 3月, 2023 1 次提交
  7. 22 3月, 2023 1 次提交
  8. 21 3月, 2023 1 次提交
  9. 16 3月, 2023 1 次提交
    • H
      [Custom Operator] Custom op support inplace mechanism (#51620) · f824bc0d
      HongyuJia 提交于
      * init unit test commit, contains register thinking
      
      * support inplace
      
      * get inplaced x.grad
      
      * Try support inplace and hook at the same time
      
      * Support inplace, need debug
      
      * Support inplace successfully
      
      * Inplace use Tensor&, consistent with Tensor*
      
      * fix MapPlainOutputs bug
      
      * fix double grad inplace error
      f824bc0d
  10. 07 3月, 2023 1 次提交
  11. 16 2月, 2023 1 次提交
    • H
      [phi decoupling] remove variable.h in phi (#50407) · 905cefd4
      Huang Jiyi 提交于
      * move variable_utils from phi_api_utils to fluid
      
      * fix coment
      
      * update include
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * update
      
      * update
      
      * fix CI-Windows-OpenBLAS
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * update include
      
      * move variable_utils to phi_utils
      
      * fix namespace
      905cefd4
  12. 14 2月, 2023 1 次提交
  13. 11 2月, 2023 1 次提交
    • H
      [Tensor Operator] Overload Tensor Operator (#50098) · 14e45f6b
      HongyuJia 提交于
      * init commit
      
      * fix tensor operator*
      
      * fix compile bug
      
      * bug reproduce
      
      * update commit
      
      * polish codes
      
      * fix compile bug
      
      * test begin
      
      * test begin
      
      * compile finish
      
      * restore origin composite_backward_api
      
      * pass local CI
      
      * fix merge error
      
      * fix merge error
      
      * change py_test from GPU->CPU, test custom op
      
      * polish codes, modify prim unittest
      
      * modify prim unittest
      
      * determine phi_tensor_operants location
      
      * polish codes
      
      * add header file
      
      * solve windows unresolved symbol
      
      * fix some CI error
      
      * add overload defination
      
      * fix CI inference and Windows
      
      * polish codes according to reviewers' opinion
      
      * polish codes according to reviewers' opinion
      14e45f6b
  14. 04 1月, 2023 1 次提交
    • H
      [Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f
      HongyuJia 提交于
      * execute use kernel_key first
      
      * change OpKernelType->KernelKey
      
      * fix py3 compile error, remove redundant header files
      
      * fix build_strategy_test
      
      * fix DataType::RAW
      
      * fix custom_type test: operator_test.cc
      
      * fix transform place
      
      * fix backends_are_same_class
      
      * try fix place TransDataDevice
      
      * support all KernelKey
      
      * fix TransformData
      
      * fix place_are_same_class
      
      * fix merge
      
      * fix test_params_no_grad
      
      * fix specific place of GetExpectedKernelType
      
      * fix specific place of GetExpectedKernelType
      
      * fix GetKernelTypeForVar
      
      * fix dtype error
      
      * fix fetch_v2
      
      * change GetKernelTypeForVar
      
      * fix interpreter
      
      * fix typo error
      
      * polish codes
      
      * polish codes
      
      * polish codes
      
      * fix conflict
      4383494f
  15. 21 12月, 2022 1 次提交
  16. 09 12月, 2022 1 次提交
  17. 22 11月, 2022 1 次提交
  18. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  19. 21 9月, 2022 1 次提交
  20. 20 9月, 2022 1 次提交
  21. 15 9月, 2022 1 次提交
  22. 26 6月, 2022 1 次提交
  23. 05 6月, 2022 1 次提交
  24. 24 4月, 2022 1 次提交
  25. 20 4月, 2022 1 次提交
  26. 13 4月, 2022 1 次提交
  27. 12 4月, 2022 1 次提交
  28. 30 3月, 2022 1 次提交
  29. 24 3月, 2022 1 次提交
  30. 14 3月, 2022 1 次提交
    • J
      Support custom op and paddle.autograd.bacward in eager (#40423) · 227fa408
      Jiabin Yang 提交于
      * eager, test=develop
      
      * fix bug, test=develop
      
      * eager, test=develop
      
      * merge legacy to fluid
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Refactor TensorAdd func by template and remove gradient_accumulation in eager
      
      * Remove needless target name
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Use overload instead of template
      
      * Remove legacy code
      
      * Remove legacy code
      
      * selectedrows, test=develop
      
      * Remove DataType test
      
      * eager, test=develop
      
      * eager, test=develop
      
      * support gan, test=develop
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * ptb, test=develop
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * add more test
      
      * eager, test=develop
      
      * Support copiable selected rows and merge develop
      
      * save load, eager, test=develop
      
      * save load, eager, test=develop
      
      * refine, test=develop
      
      * remove useless _set_value method
      
      * refine, test=develop
      
      * refine, test=develop
      
      * revert static_runner, test=develop
      
      * EagerTensor to Tensor, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * clear grad, test=develop
      
      * merge, develop
      
      * merge, develop
      
      * merge, test=develop
      
      * merge, test=develop
      
      * Support quant and part of slice
      
      * support legacy static save
      
      * extend slim tests time
      
      * remove imperative on inference
      
      * remove imperative on inference
      
      * merge develop
      
      * fix typo
      
      * fix typo
      
      * split slice related code into 2 part for imperative and eager
      
      * split slice from inference
      
      * split slice from inference
      
      * fix test_tensor_register_hook
      
      * support custom op in eager mode
      
      * fix inference deps error
      
      * split eager utils from custom operator
      
      * fix type match
      
      * fix typo
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
      227fa408
  31. 28 2月, 2022 1 次提交
  32. 20 2月, 2022 1 次提交
  33. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
  34. 15 2月, 2022 1 次提交
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  35. 09 2月, 2022 1 次提交
  36. 08 2月, 2022 1 次提交
  37. 07 2月, 2022 1 次提交
  38. 29 1月, 2022 1 次提交
    • C
      [PTen] Tidy pten core headers (#39188) · dd990981
      Chen Weihang 提交于
      * open header for custom kernel
      
      * add core utils
      
      * tidy core code
      
      * tify header
      
      * tidy include
      
      * tidy namespace
      
      * resolve conflit
      
      * fix unittest and coverage
      
      * remove platform using
      
      * resolve conflict
      
      * resolve conflict
      
      * fix digamma namespace error
      
      * fix xpu full kernel error
      
      * fix xpu full kernel error
      
      * polish details
      
      * add place for lib storage
      dd990981
  39. 28 1月, 2022 1 次提交
  40. 27 1月, 2022 1 次提交