1. 25 5月, 2022 1 次提交
    • X
      [EinsumOp] Optimize the backward speed of EinsumOp (#42663) · 71b046cd
      xiongkun 提交于
      * change logic for optimize
      
      * modifty
      
      * optimize the backward speed of EinsumOp
      
      * add cache optimizer for einsum op
      
      * EinsumOp: fix new dygraph mode error
      
      * fix bug
      
      * change Cache->InnerCache
      
      * fix code
      
      * fix
      
      * add nan inf utils for einsum op
      
      * add as_extra
      
      * Compatible with v2.3 EinsumOp
      
      * remove dispensable
      71b046cd
  2. 24 5月, 2022 1 次提交
  3. 23 5月, 2022 4 次提交
  4. 20 5月, 2022 4 次提交
    • N
      Delete ElementwiseKernel in BroadcastKernel (#42779) · 0d878f1a
      niuliling123 提交于
      0d878f1a
    • L
      use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output (#42851) · f36a9464
      Leo Chen 提交于
      * use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output
      
      * add flags to control compute type
      
      * default to false
      
      * add unit test
      
      * default to true
      f36a9464
    • Y
      move activation kernel (#42880) · 191c441a
      YuanRisheng 提交于
      191c441a
    • X
      [Hackathon No.5] tril_indices OP (#41639) · 75db5b86
      xiaoguoguo626807 提交于
      * add tril_indices cpu kernal
      
      * modify tril_indice cpu op
      
      * modify bug
      
      * modify bug
      
      * add tril_indices python api
      
      * add tril_indices python api
      
      * resolve conflict
      
      * add tril_indices test
      
      * modify details
      
      * add tril_indices.cu
      
      * pythonapi pass
      
      * save tril_indices
      
      * CPU tril_indices pass
      
      * delete vlog
      
      * modify test_tril_indices_op.py
      
      * delete tril_indices_kernel.cc.swp
      
      * delete tril_indice.cu
      
      * modify code style
      
      * add newline in creation.py
      
      * modify creation.py linux newline
      
      * delete annotation
      
      * check code style
      
      * check .py style add final_state??
      
      * modify code style
      
      * add gpu_tril_indices
      
      * modify gpu_compiled_juage
      
      * modify gpu judge
      
      * code style
      
      * add test example
      
      * modify english document
      
      modify english document
      
      modify english document
      
      modify document
      
      modify document
      
      * modify pram name
      
      * modify pram name
      
      * modify pram
      
      * reduce test ex
      75db5b86
  5. 19 5月, 2022 2 次提交
    • Z
      [Phi] Change the output format of C++ backward api (Part2) (#42545) · 4427f1b1
      zyfncg 提交于
      * change the output format of C++ backward api
      
      * fix merge conflict
      
      * fix sparse api code auto-gen
      
      * fix eager_gen bug
      
      * fix bug of output is null
      
      * fix bug of conv2d_grad_impl
      
      * fix optional grad
      
      * fix bug of eager-gen double_grad
      
      * fix bug
      
      * fix multiply_double_grad bug
      
      * fix bug of higher order derivative
      
      * fix bug of FillZeroForEmptyGradInput
      
      * remove redundant vector in grad_node
      
      * fix bug of test_deformable_conv_v1_op
      
      * fix bug of test_deformable_conv_v1_op
      
      * some refacotr
      4427f1b1
    • Z
      [Phi] Remove shared_storage (#42821) · 7a171e3c
      zyfncg 提交于
      * remove shared_storage
      
      * fix bug
      
      * fix rnn bug
      7a171e3c
  6. 18 5月, 2022 2 次提交
  7. 16 5月, 2022 2 次提交
  8. 12 5月, 2022 2 次提交
  9. 11 5月, 2022 1 次提交
    • Z
      [Phi] Change the output format of C++ backward api (Part1) (#42677) · ba71fbea
      zyfncg 提交于
      * change the output format of C++ backward api
      
      * fix merge conflict
      
      * fix sparse api code auto-gen
      
      * fix eager_gen bug
      
      * fix bug of output is null
      
      * fix bug of conv2d_grad_impl
      
      * fix optional grad
      
      * fix bug of eager-gen double_grad
      
      * fix bug
      
      * fix multiply_double_grad bug
      
      * remove node pruning
      ba71fbea
  10. 10 5月, 2022 3 次提交
  11. 09 5月, 2022 2 次提交
  12. 06 5月, 2022 1 次提交
  13. 05 5月, 2022 1 次提交
  14. 04 5月, 2022 1 次提交
  15. 01 5月, 2022 1 次提交
  16. 29 4月, 2022 2 次提交
  17. 28 4月, 2022 2 次提交
  18. 27 4月, 2022 5 次提交
  19. 26 4月, 2022 3 次提交
    • B
      【PaddlePaddle Hackathon 2】29、为 Paddle 新增 PixelUnshuffle 组网 API (#40728) · 5be9b824
      BrilliantYuKaimin 提交于
      * 增加PixelUnshuffle的形状推断
      
      * 增加PixelUnshuffle的算子注册
      
      * 增加PixelUnshuffle及其梯度的核函数
      
      * 增加PixelUnshuffle算子的描述
      
      * 增加PixelUnshuffle算子的签名
      
      * 在Python层面增加PixelUnshuffle
      
      * 增加PixelUnshuffle的单测
      
      * Update test_pixel_unshuffle.py
      
      * test=document_fix
      
      * Update test_pixel_unshuffle.py
      
      增加对extra_repr的测试
      
      * 修正代码格式
      
      * Update test_pixel_unshuffle.py
      
      修正对extra_repr的测试
      
      * 修改pixel_unshuffle核函数的实现位置
      
      * 修正代码格式
      
      * 完善对输入的检查
      
      * Update test_pixel_unshuffle.py
      
      * 完善pixel_unshuffle的输入检查
      
      * Update pixel_unshuffle_op.cc
      
      * Update unary.cc
      
      * add pixel_unshuffle
      
      * Update test_pixel_unshuffle.py
      
      * Update vision.py
      
      * 调整代码格式
      
      * Update vision.py
      
      * Delete extra spaces
      
      * Update pixel_unshuffle_sig.cc
      
      * Update vision.py
      
      * Update vision.py
      
      * add PixelUnshuffleGradInferMeta
      
      * remove PixelUnshuffleOpArgumentMapping
      
      * Update pixel_unshuffle_op.cc
      
      * 调整pixel_unshuffle及其梯度的核函数的实现位置
      
      * Update pixel_unshuffle_op.cc
      5be9b824
    • Z
      Optimize the performanece of sum api (#42231) · 2fe4bf2f
      zyfncg 提交于
      * optimize the performanece of sum api
      
      * optimize IsDenseTensorInput
      
      * remove debug log
      2fe4bf2f
    • Z
      Add Sparse MaxPool3D (#42130) · 18e9aafb
      zhangkaihuo 提交于
      18e9aafb