1. 25 5月, 2023 1 次提交
    • Z
      [IR] Refine Builder (#54052) · 3143d8bf
      zhangbo9674 提交于
      * refine code
      
      * delete some unused code
      
      * refine code of build
      
      * refine code of build
      
      * add block
      
      * refine builder
      
      * refine code
      
      * refine code by comment
      
      * fix compiler bug
      3143d8bf
  2. 24 5月, 2023 2 次提交
  3. 23 5月, 2023 2 次提交
    • Z
      [IR] Add op definition auto code generator (#54026) · b49a7e26
      zhangbo9674 提交于
      * Use copy_if_different to avoid recompilation of generated cutlass
      kernels.
      
      * add program parameter dialect_interface
      
      * fix op create bug
      
      * add conv2d
      
      * draft of paddle converter
      
      * fix CI
      
      * fix windows CI
      
      * fix program destructor
      
      * printer draft
      
      * fix bug
      
      * printer draft finish
      
      * fix windows CI
      
      * reserve inplace semantics
      
      * revert program::destroy since no need to do topology sort
      
      * revert
      
      * modify by reviews
      
      * commit printer and resnet50 related ops
      
      * fix
      
      * fix
      
      * fix op definition
      
      * refine op dyn_cast
      
      * fix bug
      
      * refine code
      
      * refine code
      
      * refine code
      
      * refine code
      
      * add code gen
      
      * refine code
      
      * refine code
      
      * refine code
      
      ---------
      Co-authored-by: Numiswing <umiswing@foxmail.com>
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      b49a7e26
    • K
      [NewIR] Program desc convert to IRProgram (#53707) · 07223e34
      kangguangli 提交于
      * Use copy_if_different to avoid recompilation of generated cutlass
      kernels.
      
      * add program parameter dialect_interface
      
      * fix op create bug
      
      * add conv2d
      
      * draft of paddle converter
      
      * fix CI
      
      * fix windows CI
      
      * fix program destructor
      
      * printer draft
      
      * fix bug
      
      * printer draft finish
      
      * fix windows CI
      
      * reserve inplace semantics
      
      * revert program::destroy since no need to do topology sort
      
      * revert
      
      * modify by reviews
      
      * polish
      
      * fix op definition
      
      * fix CI
      
      * refresh file changes
      
      ---------
      Co-authored-by: Numiswing <umiswing@foxmail.com>
      Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
      07223e34
  4. 22 5月, 2023 2 次提交
  5. 20 5月, 2023 1 次提交
  6. 19 5月, 2023 2 次提交
  7. 18 5月, 2023 4 次提交
    • H
      Fused elementwises kernels and ops (#51427) · fb4a6ecf
      Hulek 提交于
      * Fused elementwises kernels and ops
      
      * change fuse pass name
      
      * adjust .pbtxt files
      
      * adjust quantization attributes
      
      * add missing arguments and fix others, review fixed
      
      * simplify fused kernel registration
      
      * fix elementwise unit tests
      
      * reuse one fused elementwise op
      
      * adjust proto
      
      * Add supported datatypes
      
      * Change 'Scale' to 'scale' in tests, change some tests to onednn
      
      * Revert breaking changes
      
      * Fix unit tests
      
      * Delete obsolete test cases
      
      * Delete commented out code
      
      * Fix codestyle
      
      * delete temporary condition
      
      * fix conflicts and delete duplicate fusing
      
      * Fix code after merge
      
      * Move tests to new directory
      
      * fix tests volatility
      
      * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py
      
      * Update CMakeLists.txt add mkldnn op test
      
      ---------
      Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
      fb4a6ecf
    • H
      move fusion_group kernel to phi (#53781) · 26da689d
      huangjiyi 提交于
      26da689d
    • C
      Fix typos, test=document_fix (#53927) · e916e80c
      co63oc 提交于
      e916e80c
    • R
      support auto generate for op layer_norm (#53178) · 4f07b653
      RedContritio 提交于
      * simplify layer_norm_op.cc
      
      * support auto generate for op layer_norm
      
      * update unittest for composite_layer_norm
      
      * remove layer_norm_op.cc from scripts
      
      * replace layer_norm_op with generated_op
      
      * add get_expected_kernel for layer_norm
      
      * update cmake kernel register function for layer_norm_mkldnn_op
      4f07b653
  8. 17 5月, 2023 1 次提交
    • Z
      [IR] Program & Parameter & PaddleDialect (#53557) · 78967ad2
      zhangbo9674 提交于
      * add program parameter dialect_interface
      
      * fix op create bug
      
      * add ir parameter convert pd variable methods
      
      * refine code
      
      * fix bug
      
      * refine by ut
      
      * refine ut
      
      * delete unused code
      
      * refine code
      
      * refine code by comment
      
      * reset WITH_NEW_IR
      
      * refine op attribute map
      
      * refine program and op create
      
      * refine program and op create
      78967ad2
  9. 16 5月, 2023 1 次提交
  10. 12 5月, 2023 2 次提交
  11. 11 5月, 2023 1 次提交
    • G
      [test]mv fluid [controlflow,detection,dlnne,tensorrt] tests to tests (#53470) · 80757527
      gouzil 提交于
      * [test]mv fluid controlflow detection dlnne tensorrt tests to tests
      
      * [test]clean dlnne
      
      * [test] fix test_tensorrt_engine_op
      
      * [test] try fix path error
      
      * [test] RollBACK test_tensorrt_engine_op
      
      * [test] RollBACK test_tensorrt_engine_op
      
      * [test]add todo
      
      * Empty-Commit; test=document_fix
      80757527
  12. 09 5月, 2023 1 次提交
  13. 08 5月, 2023 1 次提交
  14. 06 5月, 2023 3 次提交
    • Z
      [IR] OpTrait & OpInterface & OpInfo (#52846) · d91d758d
      zhangbo9674 提交于
      * add OpTrait OpInterface ValueIterator TypeList
      
      * refine code
      
      * refine code
      
      * refine code
      
      * add opinfo
      
      * add typeid copy constructor
      
      * add trait interface construct method for opinfo_impl
      
      * add trait interface construct method for opinfo_impl
      
      * add trait interface construct method for opinfo_impl
      
      * add trait interface construct method for opinfo_impl
      
      * add trait interface construct method for opinfo_impl
      
      * add create
      
      * add member func for opinfo
      
      * fix compile bug
      
      * add op interface in ircontext
      
      * fix compile bug
      
      * fix compile bug
      
      * refine code
      
      * fix compile bug
      
      * add ut
      
      * refine ut
      
      * refine code of opinfo_impl
      
      * delete unused code
      
      * add dyncast for operation
      
      * refine comment
      
      * refine opinfo_impl
      
      * delete unused code
      
      * refine code by comment
      
      * refine code
      
      * refine code
      
      * refine code for registerOp
      
      * refine opfin create
      
      * refine code of search method of ircontext
      
      * refine op attribute
      
      * change opinfo_map key from type_id to string
      d91d758d
    • C
      XPU Support external stream (#53334) · 99399f32
      csy0225 提交于
      99399f32
    • K
      rem npu in test (#53469) · a4997311
      Kim Yann 提交于
      * rem npu in test
      
      * restore some code
      a4997311
  15. 05 5月, 2023 4 次提交
  16. 04 5月, 2023 2 次提交
  17. 29 4月, 2023 1 次提交
  18. 28 4月, 2023 2 次提交
    • B
      Dropout optimize & clean broadcast inT and ElementwiseType (#52969) · d611e48c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * clean ElementwiseT and InT for BroadcastKernel
      
      * default axis and clean inT
      
      * remove redundant fast divmod computation
      
      * optimize drop_nd & drop_nd_grad
      
      * optimize BroadcastDataLoader bf16 fp16
      
      * rm InT etc. after merge develop
      
      * delete constexpr for windows ci
      
      * fix conflict
      
      * fix conflic with develop
      
      * fix conflic
      
      * new clean
      
      * clean
      d611e48c
    • G
      [test]mv fluid op cinn to test/cpp/fluid/cinn (#53443) · a53ee944
      gouzil 提交于
      a53ee944
  19. 25 4月, 2023 1 次提交
    • Y
      [PHI]Add flags macro for PHI (#52991) · 22e96bde
      YuanRisheng 提交于
      * add flags for phi
      
      * fix compile bugs
      
      * fix ci bugs
      
      * fix inference bugs
      
      * fix cinn' bugs
      
      * fix cinn bugs
      
      * perfect code according comment
      
      * fix ci bugs
      
      * fix ci bugs
      22e96bde
  20. 24 4月, 2023 1 次提交
  21. 22 4月, 2023 1 次提交
    • W
      [Zero-Dim] support output 0D for... · b406a7db
      wangfengsheng1999 提交于
      [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850)
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase
      
      * add test case
      
      * modify dot/metric.accuracy/static.accuracy/static.auc
      
      * modfiy inner/tensordot bug
      
      * test 9 api
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase
      
      * fix bug
      
      * support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy
      
      * code style
      
      * fix bug
      
      * fix test_dot_op bug
      
      * fix accuracy bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * codestyle
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * code style
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * modify code
      b406a7db
  22. 21 4月, 2023 1 次提交
    • T
      Mv inference ut (#52987) · 008debe7
      tianshuo78520a 提交于
      * mv inference/api infer_ut
      
      * mv test
      
      * merge develop fix error
      
      * fix
      
      * fix build error
      
      * fix build error
      
      * fix bug
      
      * fix tester_helper.h
      
      * fix analyzer_transformer_profile_tester.cc
      
      * fix
      
      * fix mac
      
      * fix mac
      
      * fix error
      
      * fix
      
      * fix
      008debe7
  23. 14 4月, 2023 2 次提交
  24. 13 4月, 2023 1 次提交
    • H
      [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26
      HongyuJia 提交于
      * [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h
      
      * Add logging.h for profiler.cc
      
      * Add logging.h for gloo_utils.h
      
      * Add logging.h for addmm_kernel_impl.h
      
      * Add logging.h for addmm_grad_kernel_impl.h
      
      * Add logging.h for p_send_kernel.cu
      
      * Add logging.h for determinant_grad_kernel_impl.h
      
      * Add logging.h for p_recv_kernel.cu
      
      * Add logging.h for elementwise_grad_base.h
      
      * Add logging.h for transfer_layout_kernel.cc
      
      * Add logging.h for eigvals_kernel.cc and index_select_impl.h
      
      * Add logging.h for all files in kernel directory
      
      * Add logging.h for xpu_info.cc
      
      * Add logging.h for xpu
      5664ea26