1. 26 6月, 2023 6 次提交
  2. 25 6月, 2023 3 次提交
  3. 22 6月, 2023 1 次提交
  4. 21 6月, 2023 2 次提交
  5. 20 6月, 2023 7 次提交
    • W
      static graph autogen code support for matmul op (#54338) · ad80fbfe
      Wang Xin 提交于
      * static graph autogen code support for matmul op
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      ad80fbfe
    • H
      New ir support combine op (#54682) · 0cf841c9
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * update
      
      * update
      
      * fix bug
      
      * split file
      
      * fix conflict
      
      * polish code and fix conflict
      
      * polish code
      
      * fix bug
      0cf841c9
    • H
      [XPU] optimize gelu unittest. (#54737) · 80975d45
      houj04 提交于
      80975d45
    • Z
      [IR] Change IR from Static library to dynamic library (#54729) · 24a3cb52
      zhangbo9674 提交于
      * new_ir to shared
      
      * refine code
      
      * add ir lib path to env
      
      * refine type
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * refine code
      
      * close win
      
      * refine code
      
      * refine code
      
      * refine code
      
      * add win share
      
      * refine code
      
      * refie code
      
      * refine code
      
      * refine code
      
      * refien code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * solve conflict
      
      * solve conflict
      
      * fix bug
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * fix interpretercore program bug
      
      * delete unuse code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix cinn bug
      
      * fix cinn bug
      
      * debug
      
      * fix cinn bug
      
      * delete unused code
      
      * fix cinn bug
      
      * fix cinn bug
      
      * fix  ug
      
      * test win openblas
      
      * test win openblas
      
      * fix win openblas bug
      
      * polish code
      
      * fix win open blas bug
      
      * close win dll
      
      * fix flag bug
      
      * test for windows
      
      * fix compile bug
      24a3cb52
    • J
      24523c16
    • L
      [XPU][PHI Kernels] add unique kernel for xpu (#54758) · f836e7d2
      lijin23 提交于
      * add unique kernel for xpu
      
      * add unique kernel for xpu
      
      * update uniittest
      
      * add xpu support for unique with axis
      f836e7d2
    • A
      [AutoTuner] Add compare and record (#54668) · 6fe7b5e2
      Azure 提交于
      * add auto tuner
      
      * compare and record module
      
      * revert launch main
      
      * add prune rule
      
      * add unit test
      
      * add auto tuner
      
      * revert launch main
      
      * add prune rule
      
      * modify unit test script
      
      * fix bug for dump nodes; fix bug for checking log file
      
      * fix bug
      
      ---------
      Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
      6fe7b5e2
  6. 19 6月, 2023 4 次提交
    • A
      [NewExe]Polish InterpreterCore with PImpl and Derived ProgramInterpreter and... · f38e126e
      Aurelius84 提交于
      [NewExe]Polish InterpreterCore with PImpl and Derived ProgramInterpreter and NewIRInterpreter (#54651)
      
      * [NewExe]Polish InterpreterCore with PImpl
      
      fix code style
      
      add std::move
      
      * fix conflict
      
      * fix typo
      
      * fix typo
      f38e126e
    • C
      [LAUNCH] no endpoints env in dynamic mode (#54636) · cf515d90
      Chitsing KUI 提交于
      * no endpoints in dy mode
      
      * fix fleet api inconsistent
      cf515d90
    • T
      fix python (#54724) · 1375b3f7
      tianshuo78520a 提交于
      1375b3f7
    • H
      Support tensor attribute runtime (#54692) · 93f7a02a
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      
      * update
      
      * support tensor attribute runtime
      
      * add result check
      
      * polish test code
      
      * fix test error
      
      * add scalar test & polish code
      
      * re-open test case
      93f7a02a
  7. 16 6月, 2023 6 次提交
  8. 15 6月, 2023 6 次提交
    • Y
      fix mac unittest bugs when use static phi (#54656) · b7a6e981
      YuanRisheng 提交于
      b7a6e981
    • H
      exp/expm1 support int32/int64/float16 forward (#54556) · 58ae8c7c
      Hui Zhang 提交于
      * fix for log xxx
      
      * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward
      
      * fix docstring
      
      * fix bug
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bug
      
      * using cast
      
      * fix test
      
      * fix api
      
      * fix other bugs
      
      * fix ci bug for not using dygraph guard
      
      * add bfloat16 test
      
      * fix ut
      
      * bf16
      
      * exp/expm1 support int32/int64
      
      * fix ut
      
      * fix ut
      
      * fix ut
      58ae8c7c
    • H
      [IR] [Baby step] New interprector support new ir (#54570) · ce0c5c27
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      ce0c5c27
    • B
      [inference][trt]modify test timeout and test_trt_convert_activation bug fix (#54491) · 1f3dd978
      bukejiyu 提交于
      * modify tensorrt ci timeout
      
      * activation ci bug fix
      
      * comment out  int8 mode test_trt_dynamic_shape_groupnorm
      1f3dd978
    • C
      fix batch_norm optest code (#54661) · 3a8484c4
      cyber-pioneer 提交于
      3a8484c4
    • G
      Fix sync batch norm op under cuda 12 (#54640) · 7fef4ee9
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * Remove climits.
      
      * Fix problem of pickle and NCCL_P2P_DISABLE in distributed testcases in
      cuda12.
      
      * Fix problem of TimeOut of distributed testcases under cuda12.
      
      * Fix bug of test_sync_batch_norm_op_static_build accuracy problem under
      cuda12.
      
      * Remove useless code modification.
      7fef4ee9
  9. 14 6月, 2023 5 次提交