1. 31 10月, 2022 1 次提交
  2. 26 10月, 2022 1 次提交
  3. 19 10月, 2022 1 次提交
  4. 17 10月, 2022 1 次提交
  5. 14 10月, 2022 1 次提交
  6. 13 10月, 2022 1 次提交
    • Z
      Revert #46111 (#46961) · cf9ca61d
      Zhang Ting 提交于
      * Revert "【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111)"
      cf9ca61d
  7. 12 10月, 2022 2 次提交
  8. 11 10月, 2022 1 次提交
  9. 10 10月, 2022 1 次提交
  10. 03 10月, 2022 1 次提交
  11. 30 9月, 2022 1 次提交
    • S
      support pure bfloat16 for more ops (#46364) · b7b231a6
      sneaxiy 提交于
      * support pure bfloat16
      
      * support bf16 linear
      
      * update PR to pass CI
      
      * tiny fix where_grad_kernel.cu
      
      * add bfloat16 to selu_grad to pass CI
      
      * fix selu grad compilation error
      b7b231a6
  12. 28 9月, 2022 2 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
    • K
      [NPU] add gpu kernel for transfer layout (#46307) · 526d963e
      kangguangli 提交于
      * add gpu kernel for transfer layout
      
      * comment error throw
      
      * fix: flag setting in testcase; add condition check for raising error
      
      * fix typo
      
      * fix: add error type for PADDLE_THROW
      
      * remove kernel fallback in data_transfer.cc
      
      * remove useless variable definition
      526d963e
  13. 26 9月, 2022 1 次提交
  14. 23 9月, 2022 1 次提交
  15. 21 9月, 2022 1 次提交
  16. 20 9月, 2022 4 次提交
  17. 19 9月, 2022 2 次提交
    • Z
      Fix wrong eigen header include (#46082) · 59a2a987
      zyfncg 提交于
      * fix wrong eigen header include
      
      * fix complie bug
      
      * fix nan_inf_utils_detail
      
      * fix resource_manager
      
      * fix conv_miopen_helper
      59a2a987
    • L
      Performance fix for broadcast kernel [Part3] (#46071) · 46e4fb2a
      limingshu 提交于
      * first commit
      
      * refine code with template argument
      
      * refine code with template argument
      
      * add ternary broadcast test file
      
      * add ternary broadcast test file
      
      * fix accoriding to ci
      
      * fix op-benchmark ci error
      46e4fb2a
  18. 16 9月, 2022 1 次提交
    • S
      Support broadcast elementwise operators with int64 index type (#45741) · 20b5bf84
      sneaxiy 提交于
      * support int64 non-broadcast
      
      * support broadcast case for int64 index
      
      * fix bug
      
      * support more Arity
      
      * remove some codes
      
      * upgrade patchelf to v0.15.0 to pass CI build
      
      * fix bug
      
      * fix patchelf installation
      
      * add debug flags
      
      * remove useless codes
      
      * fix viterbi_decode and set_value op uts
      
      * remove always enable int64
      20b5bf84
  19. 15 9月, 2022 2 次提交
  20. 09 9月, 2022 3 次提交
  21. 07 9月, 2022 2 次提交
  22. 06 9月, 2022 2 次提交
  23. 05 9月, 2022 1 次提交
  24. 04 9月, 2022 1 次提交
  25. 02 9月, 2022 4 次提交
  26. 01 9月, 2022 1 次提交
    • H
      [phi] Migrate uniform_random XPU kernel to PHI (#45583) · ded33b58
      HongyuJia 提交于
      * copy kernel file to phi
      
      * delete some code
      
      * migrate uniform_random, test=kunlun
      
      * fix input error, test=kunlun
      
      * fix gpu register error, test=kunlun
      
      * add include file, test=kunlun
      
      * try fix error from CI, test=kunlun
      
      * polish other PR
      
      * fix CI-coverage error, test=kunlun
      ded33b58