1. 29 8月, 2023 1 次提交
  2. 16 8月, 2023 1 次提交
  3. 30 3月, 2023 1 次提交
    • H
      register fluid kerenls to phi [part 1] (#52014) · 93d01787
      huangjiyi 提交于
      * update assign_pos
      
      * update attention_lstm
      
      * update barrier
      
      * update batch_fc
      
      * update beam_search
      
      * update beam_search_decode
      
      * update bilateral_slice
      
      * fix bug
      
      * Handle Structure kernel for InterpreterCore::RunOperator
      
      * fix bug
      
      * fix rocm compile
      
      * fix rocm compile
      
      * Revert "fix rocm compile"
      
      * test
      
      * revert test and update cmake
      
      ---------
      Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
      93d01787
  4. 04 1月, 2023 1 次提交
    • H
      [Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f
      HongyuJia 提交于
      * execute use kernel_key first
      
      * change OpKernelType->KernelKey
      
      * fix py3 compile error, remove redundant header files
      
      * fix build_strategy_test
      
      * fix DataType::RAW
      
      * fix custom_type test: operator_test.cc
      
      * fix transform place
      
      * fix backends_are_same_class
      
      * try fix place TransDataDevice
      
      * support all KernelKey
      
      * fix TransformData
      
      * fix place_are_same_class
      
      * fix merge
      
      * fix test_params_no_grad
      
      * fix specific place of GetExpectedKernelType
      
      * fix specific place of GetExpectedKernelType
      
      * fix GetKernelTypeForVar
      
      * fix dtype error
      
      * fix fetch_v2
      
      * change GetKernelTypeForVar
      
      * fix interpreter
      
      * fix typo error
      
      * polish codes
      
      * polish codes
      
      * polish codes
      
      * fix conflict
      4383494f
  5. 15 9月, 2022 1 次提交
  6. 26 6月, 2022 1 次提交
  7. 24 3月, 2022 1 次提交
    • R
      [MoE]Assign pos op (#40580) · 305f32d1
      Roc 提交于
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * fix for win
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * fix ut for number count
      Co-authored-by: Nhlygit66666 <2570058140@qq.com>
      305f32d1