1. 10 4月, 2023 1 次提交
    • H
      [Opt Performance] Optimize custom operator performance (#52597) · 01247e33
      HongyuJia 提交于
      * [Opt Performance] Optimize custom operator performance, reconstruct python API auto-gen, add cache and use const inference
      
      * opt AutoGradMeta implementation
      
      * remove profiler codes
      
      * fix unit test
      
      * change year, 2021->2023
      
      * fix int64_t parse bug
      01247e33
  2. 03 4月, 2023 1 次提交
  3. 31 3月, 2023 1 次提交
  4. 27 3月, 2023 1 次提交
  5. 23 3月, 2023 1 次提交
  6. 22 3月, 2023 1 次提交
  7. 21 3月, 2023 1 次提交
  8. 20 3月, 2023 1 次提交
  9. 16 3月, 2023 1 次提交
    • H
      [Custom Operator] Custom op support inplace mechanism (#51620) · f824bc0d
      HongyuJia 提交于
      * init unit test commit, contains register thinking
      
      * support inplace
      
      * get inplaced x.grad
      
      * Try support inplace and hook at the same time
      
      * Support inplace, need debug
      
      * Support inplace successfully
      
      * Inplace use Tensor&, consistent with Tensor*
      
      * fix MapPlainOutputs bug
      
      * fix double grad inplace error
      f824bc0d
  10. 07 3月, 2023 1 次提交
  11. 31 1月, 2023 1 次提交
  12. 09 1月, 2023 1 次提交
  13. 30 12月, 2022 1 次提交
    • H
      [Custom device] Add custom_cpu testcase of custom_relu (#49300) · 69c7edcf
      HongyuJia 提交于
      * add custom_cpu testcase
      
      * update test_custom_device_setup
      
      * update path to custom_runtime
      
      * fix cmd wait
      
      * test Linux only
      
      * setup once
      
      * integrate to one run_cmd
      
      * add pip install
      
      * change timeout
      
      * add debug string
      
      * add debug string
      
      * add debug string
      
      * use os.system and change module name
      
      * add runtime
      
      * add more debug message
      
      * continue debug
      
      * timestamp
      
      * fix testcase import bug
      
      * remove error message
      
      * set TIMEOUT property
      69c7edcf
  14. 28 11月, 2022 1 次提交
  15. 22 11月, 2022 1 次提交
  16. 25 8月, 2022 1 次提交
    • H
      Fix relu python call (#45082) · 839fac65
      hong 提交于
      * add python final state
      
      * fix bug
      
      * fix bugs
      
      * fix bug
      
      * fix bug
      
      * revert impl, final state mul not support selected rows
      
      * fix softmax use cudnn error
      
      * add softlable false unitest
      
      * revert loss.py
      839fac65
  17. 04 8月, 2022 1 次提交
  18. 04 7月, 2022 1 次提交
  19. 05 6月, 2022 1 次提交
  20. 14 4月, 2022 2 次提交
  21. 13 4月, 2022 1 次提交
  22. 12 4月, 2022 2 次提交
  23. 14 3月, 2022 1 次提交
    • J
      Support custom op and paddle.autograd.bacward in eager (#40423) · 227fa408
      Jiabin Yang 提交于
      * eager, test=develop
      
      * fix bug, test=develop
      
      * eager, test=develop
      
      * merge legacy to fluid
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Refactor TensorAdd func by template and remove gradient_accumulation in eager
      
      * Remove needless target name
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Use overload instead of template
      
      * Remove legacy code
      
      * Remove legacy code
      
      * selectedrows, test=develop
      
      * Remove DataType test
      
      * eager, test=develop
      
      * eager, test=develop
      
      * support gan, test=develop
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * ptb, test=develop
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * add more test
      
      * eager, test=develop
      
      * Support copiable selected rows and merge develop
      
      * save load, eager, test=develop
      
      * save load, eager, test=develop
      
      * refine, test=develop
      
      * remove useless _set_value method
      
      * refine, test=develop
      
      * refine, test=develop
      
      * revert static_runner, test=develop
      
      * EagerTensor to Tensor, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * clear grad, test=develop
      
      * merge, develop
      
      * merge, develop
      
      * merge, test=develop
      
      * merge, test=develop
      
      * Support quant and part of slice
      
      * support legacy static save
      
      * extend slim tests time
      
      * remove imperative on inference
      
      * remove imperative on inference
      
      * merge develop
      
      * fix typo
      
      * fix typo
      
      * split slice related code into 2 part for imperative and eager
      
      * split slice from inference
      
      * split slice from inference
      
      * fix test_tensor_register_hook
      
      * support custom op in eager mode
      
      * fix inference deps error
      
      * split eager utils from custom operator
      
      * fix type match
      
      * fix typo
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
      227fa408
  24. 22 2月, 2022 3 次提交
  25. 20 2月, 2022 1 次提交