1. 01 6月, 2022 2 次提交
  2. 31 5月, 2022 4 次提交
    • C
      [Phi] Polish assign kernel copy impl (#43061) · c9e7c407
      Chen Weihang 提交于
      * fix assign kernel copy impl
      
      * fix test failed
      c9e7c407
    • T
      【PaddlePaddle Hackathon 2】16 新增 API RRelu (#41823) · 21e1d10f
      thunder95 提交于
      * rrelu逻辑部分
      
      * unregistered op kernel (unresolved)
      
      * commit before merge
      
      * 丰富测试用例
      
      * 修复rrelu-sig的bug
      
      * 修复cpu环境测试
      
      * 修改拼写错误
      
      * 修改code format
      
      * 尝试优化测试用例timeout的问题
      
      * 优化测试用例
      
      * 移除seed, 优化随机函数
      
      * update en doc for rrelu
      
      * fix rrelu en docs, test=document_fix
      
      * add paper link for en docs, test=document_fix
      
      * udpate en doc
      
      * add r,test=document_fix
      21e1d10f
    • X
      [EinsumOp] Make EinsumOp support bfloat16. (#43085) · a4bb38cb
      xiongkun 提交于
      * change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0
      
      * make EInsumOP support bf16
      
      * add unittest for BF16
      
      * add condition for test_BF16
      
      * fix bugs
      
      * fix
      a4bb38cb
    • Z
      add embedding yaml (#43029) · 2785f876
      zyfncg 提交于
      * add embedding yaml
      
      * fix infermeta bug
      
      * fix bug of selected_rows infer_meta
      
      * fix selected_rows
      
      * add unittest
      2785f876
  3. 30 5月, 2022 5 次提交
  4. 27 5月, 2022 2 次提交
  5. 26 5月, 2022 2 次提交
  6. 25 5月, 2022 2 次提交
    • L
      fix maybe-uninitialized warning (#42902) · f1f79b0d
      Leo Chen 提交于
      * fix maybe-uninitialized warning
      
      * fix compile
      
      * fix xpu compile
      
      * fix npu compile
      
      * fix infer compile
      
      * fix compile
      
      * fix compile
      f1f79b0d
    • X
      [EinsumOp] Optimize the backward speed of EinsumOp (#42663) · 71b046cd
      xiongkun 提交于
      * change logic for optimize
      
      * modifty
      
      * optimize the backward speed of EinsumOp
      
      * add cache optimizer for einsum op
      
      * EinsumOp: fix new dygraph mode error
      
      * fix bug
      
      * change Cache->InnerCache
      
      * fix code
      
      * fix
      
      * add nan inf utils for einsum op
      
      * add as_extra
      
      * Compatible with v2.3 EinsumOp
      
      * remove dispensable
      71b046cd
  7. 24 5月, 2022 2 次提交
  8. 23 5月, 2022 4 次提交
  9. 20 5月, 2022 5 次提交
    • N
      Delete ElementwiseKernel in BroadcastKernel (#42779) · 0d878f1a
      niuliling123 提交于
      0d878f1a
    • L
      use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output (#42851) · f36a9464
      Leo Chen 提交于
      * use fp32 compute type for cublasGemmStridedBatchedEx with fp16 input/output
      
      * add flags to control compute type
      
      * default to false
      
      * add unit test
      
      * default to true
      f36a9464
    • Y
      move activation kernel (#42880) · 191c441a
      YuanRisheng 提交于
      191c441a
    • W
      [Eager] Make CreateInferMeta more robust (#42871) · d8b69124
      Weilong Wu 提交于
      d8b69124
    • X
      [Hackathon No.5] tril_indices OP (#41639) · 75db5b86
      xiaoguoguo626807 提交于
      * add tril_indices cpu kernal
      
      * modify tril_indice cpu op
      
      * modify bug
      
      * modify bug
      
      * add tril_indices python api
      
      * add tril_indices python api
      
      * resolve conflict
      
      * add tril_indices test
      
      * modify details
      
      * add tril_indices.cu
      
      * pythonapi pass
      
      * save tril_indices
      
      * CPU tril_indices pass
      
      * delete vlog
      
      * modify test_tril_indices_op.py
      
      * delete tril_indices_kernel.cc.swp
      
      * delete tril_indice.cu
      
      * modify code style
      
      * add newline in creation.py
      
      * modify creation.py linux newline
      
      * delete annotation
      
      * check code style
      
      * check .py style add final_state??
      
      * modify code style
      
      * add gpu_tril_indices
      
      * modify gpu_compiled_juage
      
      * modify gpu judge
      
      * code style
      
      * add test example
      
      * modify english document
      
      modify english document
      
      modify english document
      
      modify document
      
      modify document
      
      * modify pram name
      
      * modify pram name
      
      * modify pram
      
      * reduce test ex
      75db5b86
  10. 19 5月, 2022 3 次提交
  11. 18 5月, 2022 4 次提交
  12. 17 5月, 2022 1 次提交
  13. 16 5月, 2022 3 次提交
  14. 13 5月, 2022 1 次提交