1. 30 6月, 2023 2 次提交
  2. 29 6月, 2023 5 次提交
    • H
      Refactor build attribute (#54968) · eef38db1
      hong 提交于
      * update
      
      * refactor build context
      
      * fix bug
      
      * polish code
      
      * change func name
      eef38db1
    • Y
      Fix compiling on XPU related to MPTypeTrait. (#54924) · 7353e9e9
      Yiqun Liu 提交于
      * Fix compiling on XPU related to MPTypeTrait.
      
      * Unify the use of MPTypeTrait.
      
      * Fix compiling error.
      7353e9e9
    • H
      Refactor op info parser (#54859) · f18d538b
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * update
      
      * update
      
      * fix bug
      
      * split file
      
      * fix conflict
      
      * polish code and fix conflict
      
      * support place transformer
      
      * finish bug
      
      * add gpu flags
      
      * fix with cuda macro
      
      * add fetch kernel
      
      * support fetch var in new ir
      
      * fix bug
      
      * polish code
      
      * change array equal to np.testing
      
      * support feed in new ir
      
      * update
      
      * fix bug
      
      * try to hack combine op
      
      * add scope guard
      
      * revert atan2 op
      
      * add scope guard
      
      * update
      
      * polish code
      
      * update
      
      * refactor build kernel context
      
      * fix unitest bug
      
      * polish code
      
      * use original order
      
      * remove useless code
      
      * polish code
      
      * fix bug
      f18d538b
    • N
      Add fused_rope forward op (#54351) · a215c46a
      niuliling123 提交于
      * style
      
      * more
      
      * update ctest
      
      * Update legacy_backward.yaml
      
      * Update legacy_ops.yaml
      
      * Update legacy_ops.yaml
      
      * update
      
      * update
      
      * update for move
      a215c46a
    • H
  3. 28 6月, 2023 7 次提交
  4. 27 6月, 2023 5 次提交
    • Z
      delete swish_raw (#54536) · 0cdaafea
      zhangyuqin1998 提交于
      * delete swish_raw
      
      * fix
      
      * Update activation_kernel.cc
      
      * fix
      0cdaafea
    • J
      [Semi-Auto] SPMD Parallel Rule Base (#53863) · 6863e2ae
      JZ-LIANG 提交于
      * base rule
      
      * add sharidng merge
      
      * add sharidng axis merge
      
      * define unified data class for inferencing dist_attr
      
      * test wrap DistTensorSpec in dygraph mode
      
      * matmul main logic done
      
      * define unified data class for inferencing dist_attr
      
      ---------
      Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
      6863e2ae
    • L
      fix bug when place 'use_cudnn' in extra (#54766) · 689e27af
      lzydev 提交于
      689e27af
    • TaoTao Li's avatar
      add all_to_all phi operator (#54797) · 158b7ae5
      TaoTao Li 提交于
      * add all_to_all phi operator, kernel, api
      
      * add all_to_all ut
      
      * tinyfix
      158b7ae5
    • H
      New ir support data transfer (#54763) · b58869fa
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * update
      
      * update
      
      * fix bug
      
      * split file
      
      * fix conflict
      
      * polish code and fix conflict
      
      * support place transformer
      
      * finish bug
      
      * add gpu flags
      
      * fix with cuda macro
      
      * update
      
      * add scope guard
      
      * polish code
      b58869fa
  5. 26 6月, 2023 6 次提交
    • P
      exclude xpu (#54848) · 6962d3e2
      pangengzheng 提交于
      6962d3e2
    • H
      Support feed op new ir (#54840) · 1e323137
      hong 提交于
      * add fetch kernel
      
      * support fetch var in new ir
      
      * fix bug
      
      * polish code
      
      * change array equal to np.testing
      
      * support feed in new ir
      
      * fix bug
      
      * try to hack combine op
      
      * add scope guard
      
      * revert atan2 op
      
      * polish code
      1e323137
    • X
      [XPU] support xpu runtime profiler: follow up (#54690) · 9c3f4b13
      XiaociZhang 提交于
      * [XPU] support xpu runtime profiler: follow up
      
      * fix compile issue
      9c3f4b13
    • C
      support auto generation for gather (#54084) · ffeac6d5
      cyberslack_lee 提交于
      ffeac6d5
    • S
      Support static graph code-gen for bincount (#54686) · b547c4ac
      Sanbu 提交于
      b547c4ac
    • S
      remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85
      Sonder 提交于
      * remove ops from OpsWithFluidKernelNeedMoveToPhi set
      
      * open static build flag
      
      * OpsWithFluidKernelNeedMoveToPhi
      
      * open new_executor_static_build
      
      * add infermate for cudnn_lstm
      
      * fix
      
      * update
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * fix pow2 decay
      
      * fix pow2 decay
      
      * recover analysis_predictor.cc
      
      * fix pow2 decay
      
      * fix cudnn lstm
      
      * add output register info for svd
      
      * fix pow2_decay_with_linear_warmup_kernel
      
      * recover test lstm cudnn
      
      * recover svg register codes
      
      * fix register info
      
      * fix reduce sum register info
      
      * add output info for adadelta
      
      * add output info for adadelta
      
      * add output info for adamax
      
      * fix complex abs register info
      
      * add register info for cudnn_lstm_grad
      
      * recover
      
      * fix lstm cudnn
      
      * fix
      
      * fix xpu output registe info
      
      * remove std::cout
      
      * add backend
      
      * remove output info in pow2_decay_with_linear_warmup_kernel
      
      * add judgment in TensorShouldBeFakeInitialized
      
      * recover power_
      
      * close new_executor_static_build
      
      * fix set_value_xpu
      733eca85
  6. 25 6月, 2023 3 次提交
  7. 21 6月, 2023 1 次提交
  8. 20 6月, 2023 7 次提交
  9. 19 6月, 2023 4 次提交