1. 31 10月, 2022 2 次提交
    • F
      optimize: vit 384 (#47432) · 520adc0e
      feng_shuai 提交于
      * optimize: vit 384
      
      * fix:bug
      
      * fix:bug
      
      * fix:supoort rocm complie
      
      * refactor:name
      
      * fix:support rocm
      
      * fix:__HIP_NO_HALF_CONVERSIONS__
      
      * optimize: delete scalar
      
      * fix:rocm can't support
      
      * fix:ernie error
      520adc0e
    • N
      fix typos for `True` and `False` (#47477) · f5912d0c
      Nyakku Shigure 提交于
      * fix typo `Fasle`/`Flase` -> `Flase`
      
      * fix typo `Ture` -> `True`
      f5912d0c
  2. 27 10月, 2022 1 次提交
  3. 26 10月, 2022 2 次提交
  4. 25 10月, 2022 1 次提交
  5. 24 10月, 2022 1 次提交
  6. 17 10月, 2022 1 次提交
  7. 13 10月, 2022 1 次提交
    • H
      [Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (#46606) · ef1c8759
      HongyuJia 提交于
      * remove PADDLE_WITH_MKLDNN, test white_list=abs
      
      * fix unique_ptr
      
      * fix op.Type()
      
      * remove TODO in kernel_dispatch.h
      
      * remove IndicateVarDataType function, update white_list
      
      * remove mkldnn hard code
      
      * add comments
      
      * fix ==
      
      * update mkldnn_op_list
      
      * delete hard code of OPs
      
      * update mkldnn_op_list
      
      * update mkldnn_op_list, remove interp
      
      * add error check for ExecutionContext
      
      * update mkldnn_op_list, remove transpose2_grad
      
      * remove interpolate mkldnn
      
      * remove fill_constant mkldnn
      
      * opt HasAttr in DygraphExecutionContext
      
      * deprecated commit, test mkldnn_white_list
      
      * deprecated commit, test mkldnn_white_list
      
      * deprecated commit, test mkldnn_black_list
      
      * update mkldnn_op_list, add assert error op
      
      * solve cudnn related op
      
      * fix error
      
      * add mkldnn fallback in phi_utils.cc
      
      * remove mkldnn fallback in phi_utils.cc
      
      * opt code implementation
      
      * polish Copyright License
      ef1c8759
  8. 11 10月, 2022 1 次提交
  9. 10 10月, 2022 2 次提交
  10. 09 10月, 2022 1 次提交
  11. 30 9月, 2022 1 次提交
    • S
      support pure bfloat16 for more ops (#46364) · b7b231a6
      sneaxiy 提交于
      * support pure bfloat16
      
      * support bf16 linear
      
      * update PR to pass CI
      
      * tiny fix where_grad_kernel.cu
      
      * add bfloat16 to selu_grad to pass CI
      
      * fix selu grad compilation error
      b7b231a6
  12. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  13. 21 9月, 2022 1 次提交
  14. 18 9月, 2022 1 次提交
  15. 15 9月, 2022 1 次提交
  16. 09 9月, 2022 2 次提交
  17. 08 9月, 2022 2 次提交
  18. 07 9月, 2022 1 次提交
  19. 01 9月, 2022 1 次提交
  20. 31 8月, 2022 1 次提交
  21. 23 8月, 2022 1 次提交
  22. 17 8月, 2022 1 次提交
  23. 16 8月, 2022 1 次提交
    • F
      convert multihead to oss (#45019) · f706d95d
      feng_shuai 提交于
      * convert multihead to oss
      
      * fix:bug
      
      * fix:delete const cast
      
      * fix:don't support bias_qk
      
      * add vit pass
      
      * fix:convert bug and add preln_residual_bias
      
      * support length=-1
      
      * add UT for convert
      
      * add no_bias_qk support for gpu_multihead_op
      
      * delete infer_shape depends on bias_qk
      
      * oss just can be used in T4 and A*
      
      * fix:change api for ROCM CI
      f706d95d
  24. 15 8月, 2022 2 次提交
  25. 09 8月, 2022 1 次提交
  26. 05 8月, 2022 1 次提交
  27. 02 8月, 2022 1 次提交
  28. 01 8月, 2022 1 次提交
    • L
      unify gpu context (#44740) · 86763023
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      
      * remove alias name CUDADeviceContext
      
      * fix compile
      
      * fix tests
      
      * revert changes
      86763023
  29. 29 7月, 2022 3 次提交
  30. 26 7月, 2022 1 次提交
  31. 19 7月, 2022 2 次提交