1. 01 3月, 2023 14 次提交
  2. 28 2月, 2023 20 次提交
  3. 27 2月, 2023 6 次提交
    • J
      f78b4079
    • Z
      Add inferface of get registered phi kernels (#50814) · 0f8c304a
      zyfncg 提交于
      * add inferface of get registered phi kernels
      
      * change KernelType to KernelKey
      
      * add test
      
      * refactor code
      0f8c304a
    • H
      [XPU] add fp16 support for shape and lookup_table_v2 op. (#50773) · d2a0577a
      houj04 提交于
      * [XPU] add fp16 support for shape op.
      
      * [XPU] add fp16 support for lookup_table_v2 op.
      
      * update approval list: add qingshu's id.
      d2a0577a
    • Z
      377cbcea
    • 【Hackathon No.68】Remove utils in phi (#50833) · 6c181d1d
      张春乔 提交于
      * remove utils
      
      * remove utils
      
      * remove utils
      
      * remove utils
      
      * Update get_data_from_tensor.h
      
      * Update rnn_functor.h
      
      * Update rnn_grad_kernel.cu.cc
      
      * Update rnn_kernel.cu.cc
      
      * Update rnn_kernel.cc
      
      * Update rnn_grad_kernel.cu.cc
      
      * Update rnn_functor.h
      
      * Update rnn_kernel.cu.cc
      
      * Update rnn_kernel.cc
      
      * remove utils
      
      * Update rnn_functor.h
      
      * remove utils
      
      * remove utils
      
      * remove utils
      
      * remove utils
      
      * remove utils
      
      * Update rnn_functor.h
      
      * Update unsqueeze_op.h
      
      * Update utils.h
      
      * roll back
      
      * Update tensor_utils.h
      
      * Update tensor_utils.h
      
      * Update tensor_utils.h
      
      * Update tensor_utils.h
      
      * Update tensor_utils.h
      
      * use TensorToVector
      
      * use TensorToVector
      
      * use TensorToVector
      
      * use TensorToVector
      
      * use TensorToVector
      
      * Update rnn_kernel.cc
      
      * Update rnn_grad_kernel.cc
      
      * Update rnn_functor.h
      
      * Update rnn_grad_kernel.cu.cc
      
      * Update rnn_kernel.cu.cc
      
      * Update rnn_functor.h
      
      * Update rnn_grad_kernel.cu.cc
      
      * Update rnn_kernel.cu.cc
      
      * Update rnn_functor.h
      
      * Update rnn_grad_kernel.cu.cc
      
      * Update rnn_kernel.cu.cc
      
      * add TensorToVector
      
      * roll back
      
      * Update tensor_utils.h
      
      * Update rnn_functor.h
      
      * Update rnn_grad_kernel.cu.cc
      
      * Update tensor_utils.h
      
      * Update rnn_kernel.cu.cc
      
      * Update rnn_grad_kernel.cc
      
      * Update rnn_kernel.cc
      
      * Update rnn_grad_kernel.cu.cc
      
      * Update rnn_kernel.cu.cc
      
      * Update rnn_grad_kernel.cc
      
      * Update rnn_kernel.cc
      
      * TensorCopySync to phi::Copy
      
      * fix codestyle
      
      * rnn_kernel.cc: add ;
      
      * replace all GetDataFromTensor with phi::GetVectorFromTensor
      
      * delete include of util.h
      6c181d1d
    • W
      [TRT] Add sm version check for TensorRT flash attention and cross attention pass/plugin (#50830) · 38dad3b9
      Wang Bojun 提交于
      * add sm version check
      
      * use GetGPUComputeCapability
      38dad3b9