1. 15 6月, 2022 1 次提交
  2. 13 6月, 2022 1 次提交
  3. 10 6月, 2022 1 次提交
  4. 07 6月, 2022 3 次提交
  5. 06 6月, 2022 1 次提交
  6. 05 6月, 2022 1 次提交
  7. 04 6月, 2022 1 次提交
  8. 31 5月, 2022 1 次提交
  9. 30 5月, 2022 1 次提交
  10. 27 5月, 2022 1 次提交
  11. 26 5月, 2022 1 次提交
  12. 25 5月, 2022 1 次提交
  13. 20 5月, 2022 3 次提交
  14. 19 5月, 2022 1 次提交
    • Z
      [Phi] Change the output format of C++ backward api (Part2) (#42545) · 4427f1b1
      zyfncg 提交于
      * change the output format of C++ backward api
      
      * fix merge conflict
      
      * fix sparse api code auto-gen
      
      * fix eager_gen bug
      
      * fix bug of output is null
      
      * fix bug of conv2d_grad_impl
      
      * fix optional grad
      
      * fix bug of eager-gen double_grad
      
      * fix bug
      
      * fix multiply_double_grad bug
      
      * fix bug of higher order derivative
      
      * fix bug of FillZeroForEmptyGradInput
      
      * remove redundant vector in grad_node
      
      * fix bug of test_deformable_conv_v1_op
      
      * fix bug of test_deformable_conv_v1_op
      
      * some refacotr
      4427f1b1
  15. 16 5月, 2022 1 次提交
  16. 12 5月, 2022 1 次提交
  17. 10 5月, 2022 2 次提交
  18. 09 5月, 2022 1 次提交
  19. 01 5月, 2022 1 次提交
  20. 28 4月, 2022 1 次提交
  21. 27 4月, 2022 1 次提交
    • Z
      Optimize performance of dygraph (v4) (#42196) · 37e2f027
      zyfncg 提交于
      * optimize performance of dygraph
      
      * optimize performance of dygraph and elementwise_add
      
      * optimize the trace op
      
      * fix bug
      
      * fix bug
      
      * fix unittest bug
      
      * fix code format
      37e2f027
  22. 25 4月, 2022 2 次提交
  23. 18 4月, 2022 2 次提交
  24. 16 4月, 2022 1 次提交
  25. 14 4月, 2022 2 次提交
  26. 13 4月, 2022 1 次提交
  27. 12 4月, 2022 2 次提交
    • L
      [KP] Add Logical/compare/bitwise registry & UT (#40802) · 3749198e
      Lijunhui 提交于
      * init commit no push
      
      * collect comile errors
      
      * bitwise UT
      
      * fix compile problem
      
      * cancel comments
      
      * restore miss deletion
      
      * fix compilation
      
      * fix UT
      
      * NO stash in multiple branch at the same times
      
      * fix error
      
      * combine .cu from gpu and kps
      
      * replace gpu by kps
      
      * fix by Chen-weihang
      
      * Revert "Fix kps compile error in Junhui logic compare bitwise"
      
      * fix backend test
      
      * rm comments
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      3749198e
    • F
      add a inner loop for index_select_grad_init() in index_select op when dealing... · bc01242b
      FlyingQianMM 提交于
      add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563)
      
      * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op
      
      * use CUDA_KERNEL_LOOP_TYPE
      
      * fix code style
      
      * replace index_select_grad_init with SetConstant
      bc01242b
  28. 07 4月, 2022 1 次提交
  29. 03 4月, 2022 1 次提交
    • F
      add maximum limit for grid of index_select (#41127) · af8d2482
      FlyingQianMM 提交于
      * limit grid dim for index select
      
      * mv LimitGridDim into gpu_launch_config.h
      
      * fix conflicts
      
      * fix conflicts
      
      * fix code style
      
      * set block to 256
      
      * fix grid setting
      
      * set dtype of block_dim to unsigned int
      af8d2482
  30. 02 4月, 2022 2 次提交